Skip to content

Latest commit

 

History

History
25 lines (18 loc) · 1.49 KB

week-04-part-2.md

File metadata and controls

25 lines (18 loc) · 1.49 KB

Week 4 - Data wrangling with pandas

Reading list:

Python Data Science Handbook by Jake Van derPlas:

Mckinney's book - Python for data analysis : Data wrangling with pandas, numpy, and ipython:

  • Chapter 7:
    • Section 7.1 - Handling missing data
    • Section 7.2 - First 4 susbsections (Removing duplicates, transforming data, replacing values, detecting and filtering outliers)
  • Chapter 8:
    • Section 8.2 - combining and merging datasets (except combining data w/overlap)

Again, it is important to write code as you read and learn about pandas. For this section, I recommend running the code interactively in a Jupyter notebook where you can execute the code in steps, append notes and save it for later.