Convert wide dataframe to long dataframe with specific conditions and addition of new columns(使用特定条件和添加新列将宽数据帧转换为长数据帧)
本文介绍了使用特定条件和添加新列将宽数据帧转换为长数据帧的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
我有如下所示的示例数据帧。
import pandas as pd
import numpy as np
NaN = np.nan
data = {'ID':['A','A','A','A','A','A','A','A','A','C','C','C','C','C','C','C','C'],
'Week': ['Week1','Week1','Week1','Week1','Week2','Week2','Week2','Week2','Week3',
'Week1','Week1','Week1','Week1','Week2','Week2','Week2','Week2'],
'Risk':['High','','','','','','','','','High','','','','','','',''],
'Testing':[NaN,'Pos',NaN,'Neg',NaN,NaN,NaN,NaN,'Pos', NaN,
NaN,NaN,'Negative',NaN,NaN,NaN,'Positive'],
'Week1_adher':['Yes',NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN,'No',NaN,NaN,NaN,NaN,NaN,NaN,NaN],
'Week2_adher':['No',NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN,'No',NaN,NaN,NaN,NaN,NaN,NaN,NaN],
'Week3_adher':['No',NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN,'No',NaN,NaN,NaN,NaN,NaN,NaN,NaN]}
df1 = pd.DataFrame(data)
df1
最终数据帧必须是这样的,即每个参与者必须有与周数一样多的行。周列转换为行后,应该有相应的值。
此外,还应将每个参与者每周的"测试"列中的NOTNA值添加到"测试值的数量"中。
最终数据帧应如下图所示。
推荐答案
通过创建两个新列预处理数据帧,然后按ID
和Week
分组,最后聚合新列:
df1['SurveyAdherence'] = df1.filter(regex=r'Weekd+_adher').eq('Yes').any(axis=1)
df1['#Tests'] = df1['Testing'].notna()
mi = pd.MultiIndex.from_product([df1['ID'].unique(), df1['Week'].unique()],
names=['ID', 'Week'])
out = df1.groupby(['ID', 'Week'])
.agg({'SurveyAdherence': 'max', '#Tests': 'sum'})
out = out.reindex(mi)
.fillna({'SurveyAdherence': False, '#Tests': 0})
.astype({'SurveyAdherence': bool, '#Tests': int})
.reset_index()
输出:
>>> df1
ID Week SurveyAdherence #Tests
0 A Week1 True 2
1 A Week2 False 0
2 A Week3 False 1
3 C Week1 False 1
4 C Week2 False 1
5 C Week3 False 0
这篇关于使用特定条件和添加新列将宽数据帧转换为长数据帧的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持编程学习网!
沃梦达教程
本文标题为:使用特定条件和添加新列将宽数据帧转换为长数据帧


基础教程推荐
猜你喜欢
- 合并具有多索引的两个数据帧 2022-01-01
- 使 Python 脚本在 Windows 上运行而不指定“.py";延期 2022-01-01
- 使用 Google App Engine (Python) 将文件上传到 Google Cloud Storage 2022-01-01
- 症状类型错误:无法确定关系的真值 2022-01-01
- 如何在 Python 中检测文件是否为二进制(非文本)文 2022-01-01
- 将 YAML 文件转换为 python dict 2022-01-01
- 哪些 Python 包提供独立的事件系统? 2022-01-01
- 如何在Python中绘制多元函数? 2022-01-01
- Python 的 List 是如何实现的? 2022-01-01
- 使用Python匹配Stata加权xtil命令的确定方法? 2022-01-01