Skip to content

Commit

Permalink
deidentified data
Browse files Browse the repository at this point in the history
  • Loading branch information
Hamza Elhamdadi authored and Hamza Elhamdadi committed Oct 21, 2023
1 parent 659336e commit 5e6091f
Show file tree
Hide file tree
Showing 16 changed files with 3,103 additions and 3,074 deletions.
29 changes: 29 additions & 0 deletions study_data/deidentify.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,29 @@
import pandas as pd
import os


paths_to_clean = [
os.path.join('full_Study','complete_study_mar16.csv'),
os.path.join('full_Study','study.csv'),
os.path.join('full_Study','data_patch.csv'),
os.path.join('full_Study','data_clean.csv'),
os.path.join('full_Study','data_clean_june30.csv'),
os.path.join('full_Study','condition_count.csv'),
os.path.join('pilot2','Complexity vs. Trust in Vis_March 6, 2023_07.54.csv'),
os.path.join('pilot3','pilot3.csv'),
os.path.join('pilot3','pilot3_clean.csv'),
os.path.join('pilot3','Complexity vs. Trust in Vis_March 12, 2023_16.46.csv'),
os.path.join('pilot3','Complexity vs. Trust in Vis_March 12, 2023_16.47.csv'),
os.path.join('pilot4','data.csv'),
os.path.join('pilot4','data_patch.csv'),
os.path.join('replicationStudy','data.csv'),
os.path.join('replicationStudy','data_clean.csv')
]
for path in paths_to_clean:
df = pd.read_csv(path)
print('Interview_1_TEXT' in df.columns)
df = df.drop('Interview_1_TEXT',axis=1)
df.to_csv(path,index=False)
# for col in df.columns:
# print(col)
# print(df['Interview_1_TEXT'])
1,274 changes: 637 additions & 637 deletions study_data/full_Study/complete_study_mar16.csv

Large diffs are not rendered by default.

308 changes: 154 additions & 154 deletions study_data/full_Study/condition_count.csv

Large diffs are not rendered by default.

1,090 changes: 545 additions & 545 deletions study_data/full_Study/data_clean.csv

Large diffs are not rendered by default.

1,100 changes: 550 additions & 550 deletions study_data/full_Study/data_clean_june30.csv

Large diffs are not rendered by default.

122 changes: 61 additions & 61 deletions study_data/full_Study/data_patch.csv

Large diffs are not rendered by default.

296 changes: 148 additions & 148 deletions study_data/full_Study/study.csv

Large diffs are not rendered by default.

202 changes: 101 additions & 101 deletions study_data/pilot2/Complexity vs. Trust in Vis_March 6, 2023_07.54.csv

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

124 changes: 62 additions & 62 deletions study_data/pilot3/pilot3.csv

Large diffs are not rendered by default.

126 changes: 63 additions & 63 deletions study_data/pilot3/pilot3_clean.csv

Large diffs are not rendered by default.

130 changes: 65 additions & 65 deletions study_data/pilot4/data.csv

Large diffs are not rendered by default.

122 changes: 61 additions & 61 deletions study_data/pilot4/data_patch.csv

Large diffs are not rendered by default.

594 changes: 297 additions & 297 deletions study_data/replicationStudy/data.csv

Large diffs are not rendered by default.

528 changes: 264 additions & 264 deletions study_data/replicationStudy/data_clean.csv

Large diffs are not rendered by default.

0 comments on commit 5e6091f

Please sign in to comment.