Skip to content

Commit

Permalink
updating final review questions
Browse files Browse the repository at this point in the history
  • Loading branch information
kvarada committed Dec 3, 2024
1 parent 7ed02ac commit 9cc34a8
Showing 1 changed file with 73 additions and 4 deletions.
77 changes: 73 additions & 4 deletions lectures/notes/final-exam-review-guiding-question.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -99,7 +99,8 @@
"- What are the advantages of cross-validation?\n",
"- Why it's important to look at sub-scores of cross-validation?\n",
"- What is the fundamental trade-off in supervised machine learning?\n",
"- What is the Golden rule in supervised machine learning? "
"- What is the Golden rule in supervised machine learning?\n",
"- Scenarios for data leakage "
]
},
{
Expand All @@ -113,8 +114,28 @@
"- KNNs, SVM RBFs\n",
"- Linear models \n",
"- Random forests\n",
"- Grading Boosyinh, LGBM, CatBoost\n",
"- Stacking, averaging "
"- Grading Boosting, LGBM, CatBoost\n",
"- Stacking, averaging\n",
"\n",
"**Comparison of models**\n",
"| **Model** | Parameters and hyperparameters | **Strengths** | **Weaknesses** |\n",
"|------------------|--------------------------------|---------------------------|---------------------------|\n",
"| **Decision Trees** | | | |\n",
"| **KNNs** | | | |\n",
"| **SVM RBF** | | | |\n",
"| **Linear models** | | | | \n",
"| **Random forests** | | | | \n",
"| **Gradient boosting** | | | | \n",
"| **Stacking** | | | | \n",
"| **Averaging** | | | | \n"
]
},
{
"cell_type": "markdown",
"id": "3b43fa4c-5691-4397-a057-a881d1d94179",
"metadata": {},
"source": [
"<br><br>"
]
},
{
Expand All @@ -133,6 +154,22 @@
"- What are various data preprocessing steps such as scaling, OHE, ordinal encoding, and handling missing values. Why and when each step is necessary?"
]
},
{
"cell_type": "markdown",
"id": "46551fbd-cf55-418c-867d-f8c7705fe7d1",
"metadata": {},
"source": [
"**`sklearn` Transformers** \n",
"| **Transformer** | Hyperparameters | **When to use?** |\n",
"|------------------|--------------------------------|---------------------------|\n",
"| `SimpleImputer` | | | \n",
"| `StandardScaler` | | | \n",
"| `OneHotEncoder` | | | \n",
"| `OrdinalEncoder` | | | \n",
"| `CountVectorizer` | | | \n",
"| `TransformedTargetRegressor` | | |\n"
]
},
{
"cell_type": "markdown",
"id": "bf30b454-9f43-481e-9b1c-da43031fc0d8",
Expand Down Expand Up @@ -586,7 +623,14 @@
"\n",
"- What makes hyperparameter optimization a hard problem?\n",
"- What are two different tools provided by sklearn for hyperparameter optimization? \n",
"- What is optimization bias? "
"- What is optimization bias?\n",
"\n",
"\n",
"| **Method** | Strengths/Weaknesses | **When to use?** |\n",
"|------------------|--------------------------------|---------------------------|\n",
"| Nested for loops | | | \n",
"| Grid search | | | \n",
"| Random search | | | "
]
},
{
Expand All @@ -604,6 +648,31 @@
"- What are advantages of RMSE or MAPE over MSE? "
]
},
{
"cell_type": "markdown",
"id": "7e11a3f7-0ec3-4306-a84e-43fe74869e20",
"metadata": {},
"source": [
"**Classification Metrics**\n",
"| **Metric** | How to generate/calculate? | **When to use?** |\n",
"|------------------|--------------------------------|---------------------------|\n",
"| Accuracy | | | \n",
"| Precision | | | \n",
"| Recall | | | \n",
"| F1-score | | | \n",
"| AP score | | | \n",
"| AUC | | | \n",
"\n",
"\n",
"**Regression Metrics**\n",
"| **Metric** | How to generate/calculate? | **When to use?** |\n",
"|------------------|--------------------------------|---------------------------|\n",
"| MSE | | | \n",
"| RMSE | | | \n",
"| r2 score | | | \n",
"| MAPE | | | "
]
},
{
"cell_type": "markdown",
"id": "a1e6c11b-ee26-4d37-87ea-2b6bd3560f60",
Expand Down

0 comments on commit 9cc34a8

Please sign in to comment.