Skip to content

Notes about data cleaning comparison

Suzanne Childress edited this page Apr 11, 2019 · 2 revisions

I wrote a simple script here https://github.com/psrc/travel-studies/blob/master/2017/summary/hh_survey_simple_summary.py that generates top level metrics for raw trip household travel survey data.

To use it, you will need to point to your data (either in excel, csv, or database format). The paths are hard-coded at the top of the file.

The output is an excel file. I made a comparer file as well, where you can copy paste values into it to see how the data is changing here: https://github.com/psrc/travel-studies/blob/master/2017/summary/output/travel_survey_simple_compare.xlsx