Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Convert raw spreadsheets to standardized/tidy data #1

Open
jsvine opened this issue Aug 24, 2023 · 0 comments
Open

Convert raw spreadsheets to standardized/tidy data #1

jsvine opened this issue Aug 24, 2023 · 0 comments
Labels
enhancement New feature or request help wanted Extra attention is needed

Comments

@jsvine
Copy link
Contributor

jsvine commented Aug 24, 2023

The current raw data is spread across three Excel files. Each has a main table of individual survey responses, but each with a slightly different set of columns, and a slightly different way of representing the response values.

It'd be great to convert those spreadsheets into a more standardized (and CSV-formatted) dataset, where all respondents and responses are represented in the same way.

In terms of structure: Although we might want to ultimately aim for a more "tidy" approach, I think a good first direction would be to keep the same general principle as the raw spreadsheets: One row per respondent, with columns representing the survey instance (respondent ID, date given, platform) as well as the question responses.

One thing to keep track of: Which data-points are not available in all three spreadsheets? How do we want to represent them (if at all) in the standardized file?

@jsvine jsvine added enhancement New feature or request help wanted Extra attention is needed labels Aug 24, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

1 participant