Reading list:
- Creating columns
- How to calculate summary statistics?
- How to reshape the layout of tables?
- How to combine data from multiple tables?
Python Data Science Handbook by Jake Van derPlas:
Mckinney's book - Python for data analysis : Data wrangling with pandas, numpy, and ipython:
- Chapter 7:
- Section 7.1 - Handling missing data
- Section 7.2 - First 4 susbsections (Removing duplicates, transforming data, replacing values, detecting and filtering outliers)
- Chapter 8:
- Section 8.2 - combining and merging datasets (except combining data w/overlap)
Again, it is important to write code as you read and learn about pandas. For this section, I recommend running the code interactively in a Jupyter notebook where you can execute the code in steps, append notes and save it for later.