Introduction to Exploratory Machine Learning
- Experiments with Muller Guido book
- Bishop, Pattern Recognition and Machine Learning
- Environment Setup
- Python 3.7
- VSCode (Free Open Source)
- Jupyter-lab
- Python Programming Revisited.
- Basic Python libraries
PIL
: Python Imaging library (pip3 install pillow
)- Numerical Computation:
numpy
- Visualization:
matplotlib.pyplot
- Several simple problem solving with
numpy
- random sampling functions
- sampling from probability density models
- Python libraries II
pandas
for tabular data maipulation as a replacement of Excel/Spreedsheet- Scientific computing:
scipy
&scikit-learn
- Probability and Statistics, revisted
- ref: chapter 2 of PRML (Pattern Recognition ad Machine Learning by Christopher M. Bishop)
- various probability density models
- problem solving
- Bayes' Theorem
- Intro. to Bayesian Data Analysis(https://www.youtube.com/watch?v=3OJEae7Qb_o)
- Bayesian Data Analysis, short intro & problem solving
- Think Bayes
- Supervisded Learning: Regression
-
Supervised Learning: Classification
-
Unsupervised Learning
- Dimensionality Reduction, Feature Extraction, Manifold Learning
- PCA, NMF, t-SNE
- Unsupervised Learning
- Clustering
- k-Means, Agglomerative Clutstering, DBSCAN
- Feature Engineering
- common sense (New Yong Citi Bike)
- Model Evaluation and Improvement
- Cross validation
- Grid Search
- Accuracy, Precision, Recall, F1
-
Algorithm Chains and Pipelines
-
Project Show-up
-
Datasets
- IRIS
- MNIST
- Seoul City Public Bike Rental Data (https://data.seoul.go.kr/dataList/datasetView.do?infId=OA-15182&serviceKind=1&srvType=F)
- Seoul City Bike Info from data.go.kr: https://www.data.go.kr/dataset/3045310/fileData.do
-
Kaggle