In this class, we will focus most of our attention on the following books:
- McKinney, Wes, Python for Data Analysis, 3rd Edition, 2022.
- Irizarry, Rafael, Introduction to Data Science, 2020.
Access the textbooks using the links above. Alternatively, you can purchase these books for reasonable prices on Amazon or other textbook websites.
You will also need access to DataCamp. The link for our DataCamp class is on the home page, and invitations will be sent to you in the first week of classes.
There are other fantastic resources which we will refer to occasionally this semester, or use more completely in future semesters:
- * Grus, Joel, Data Science from Scratch, 2nd Edition, 2019.
- Phillips, Jeff, Mathematical Foundations for Data Analysis, v0.6, 2019.
- Brunton, Steven L. and J. Nathan Kutz, Data-Driven Science and Engineering, 2019. (Book for MATH 3280)
- Companion website: http://databookuw.com.
- Leskovec, Jure et al., Mining of Massive Datasets, 3rd Edition, 2020. (Book for MATH 3280)
- * Geron, Aurelien, Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 3rd Edition, 2022. (Book for MATH 3480)
*These books are available for free. The Data Science from Scratch textbook is available on the O'Reilly publishing website. Snow College has a license with O'Reilly, which gives you free access to this book and many others. Here is how to access O'Reilly textbooks:
- Navigate to https://www.oreilly.com/library/view/temporary-access/?orpqLinks.
- Select "Institution not listed?"
- Enter you Snow College email address
- Search for and read the textbook
Other Resources that may be helpful:
- Intro to Data Science: YouTube Videos by Steve Brunton
- University of Utah Lectures by Jeff Phillips
Python Reference Sheets:
- DataCamp: Python Basics - Cheat Sheet
- DataCamp: NumPy - Cheat Sheet
- DataCamp: SciPy - Cheat Sheet
- DataCamp: Importing Data - Cheat Sheet
- DataCamp: Pandas Basics - Cheat Sheet
- DataCamp: Data Wrangling in Pandas - Cheat Sheet
- DataCamp: Data Visualization - Cheat Sheet
- DataCamp: Matplotlib - Cheat Sheet
- DataCamp: Seaborn - Cheat Sheet
- DataCamp: PySpark RDD - Cheat Sheet
- DataCamp: PySpark SQL - Cheat Sheet
- DataCamp: Top Machine Learning Algorithms - Cheat Sheet
- DataCamp: Keras - Cheat Sheet