The main goal of this lecture:
- introduce new students in the class (name, program, goal for data analytics)
for all students
-
Pair T-test: a. example question, concepts, data analysis
-
Chi Square Test: a. example question, concepts, data analysis
-
Lineaer Regression: a. example question, concepts, data analysis b. what if we double the sample data, how would the linear regression result change? c. what if we replace the missing data with average, how would the linear regression result change?
for the new students only (a makeup 30min section after class)
- terminal operation: call jupyter notebook, learn about 'pip install XXXXX'
- notebook from week1: intro to Pandas, load dataset into jupyter notebook, data exploration analysis, data cleaning
- notebook from week1: learn about data structure (Lecture_One_Data_Structure.ipynb)
- make sure all students are the members of ColumbiaPython organization in Github
- every student create individual project in ColumbiaPython (nameing: Project_lastname)
- start writing proposal in github as a readme file
- upload new files into github (reference papers, data & codes)