updating final review questions

UBC-CS · Dec 3, 2024 · 9cc34a8 · 9cc34a8
1 parent 7ed02ac
commit 9cc34a8
Showing 1 changed file with 73 additions and 4 deletions.
diff --git a/lectures/notes/final-exam-review-guiding-question.ipynb b/lectures/notes/final-exam-review-guiding-question.ipynb
@@ -99,7 +99,8 @@
     "- What are the advantages of cross-validation?\n",
     "- Why it's important to look at sub-scores of cross-validation?\n",
     "- What is the fundamental trade-off in supervised machine learning?\n",
-    "- What is the Golden rule in supervised machine learning? "
+    "- What is the Golden rule in supervised machine learning?\n",
+    "- Scenarios for data leakage "
    ]
   },
   {
@@ -113,8 +114,28 @@
     "- KNNs, SVM RBFs\n",
     "- Linear models \n",
     "- Random forests\n",
-    "- Grading Boosyinh, LGBM, CatBoost\n",
-    "- Stacking, averaging "
+    "- Grading Boosting, LGBM, CatBoost\n",
+    "- Stacking, averaging\n",
+    "\n",
+    "**Comparison of models**\n",
+    "| **Model**        | Parameters and hyperparameters | **Strengths**  | **Weaknesses**     |\n",
+    "|------------------|--------------------------------|---------------------------|---------------------------|\n",
+    "| **Decision Trees**               |  |  |  |\n",
+    "| **KNNs**              |  |  |  |\n",
+    "| **SVM RBF**            |  |  |  |\n",
+    "| **Linear models**         |  |  | | \n",
+    "| **Random forests**         |  |  | | \n",
+    "| **Gradient boosting**         |  |  | | \n",
+    "| **Stacking**         |  |  | | \n",
+    "| **Averaging**         |  |  | | \n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "3b43fa4c-5691-4397-a057-a881d1d94179",
+   "metadata": {},
+   "source": [
+    "<br><br>"
    ]
   },
   {
@@ -133,6 +154,22 @@
     "- What are various data preprocessing steps such as scaling, OHE, ordinal encoding, and handling missing values. Why and when each step is necessary?"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "id": "46551fbd-cf55-418c-867d-f8c7705fe7d1",
+   "metadata": {},
+   "source": [
+    "**`sklearn` Transformers** \n",
+    "| **Transformer**        | Hyperparameters | **When to use?** |\n",
+    "|------------------|--------------------------------|---------------------------|\n",
+    "| `SimpleImputer`  |  |  | \n",
+    "| `StandardScaler`              |  |  | \n",
+    "| `OneHotEncoder`            |  |  | \n",
+    "| `OrdinalEncoder`         |  |  | \n",
+    "| `CountVectorizer`        |  |  | \n",
+    "| `TransformedTargetRegressor` | | |\n"
+   ]
+  },
   {
    "cell_type": "markdown",
    "id": "bf30b454-9f43-481e-9b1c-da43031fc0d8",
@@ -586,7 +623,14 @@
     "\n",
     "- What makes hyperparameter optimization a hard problem?\n",
     "- What are two different tools provided by sklearn for hyperparameter optimization?  \n",
-    "- What is optimization bias? "
+    "- What is optimization bias?\n",
+    "\n",
+    "\n",
+    "| **Method**        | Strengths/Weaknesses | **When to use?** |\n",
+    "|------------------|--------------------------------|---------------------------|\n",
+    "| Nested for loops |  |  | \n",
+    "| Grid search  |  |  | \n",
+    "| Random search  |  |  | "
    ]
   },
   {
@@ -604,6 +648,31 @@
     "- What are advantages of RMSE or MAPE over MSE? "
    ]
   },
+  {
+   "cell_type": "markdown",
+   "id": "7e11a3f7-0ec3-4306-a84e-43fe74869e20",
+   "metadata": {},
+   "source": [
+    "**Classification Metrics**\n",
+    "| **Metric**        | How to generate/calculate? | **When to use?** |\n",
+    "|------------------|--------------------------------|---------------------------|\n",
+    "| Accuracy  |  |  | \n",
+    "| Precision              |  |  | \n",
+    "| Recall          |  |  | \n",
+    "| F1-score         |  |  | \n",
+    "| AP score        |  |  | \n",
+    "| AUC        |  |  | \n",
+    "\n",
+    "\n",
+    "**Regression Metrics**\n",
+    "| **Metric**        | How to generate/calculate? | **When to use?** |\n",
+    "|------------------|--------------------------------|---------------------------|\n",
+    "| MSE  |  |  | \n",
+    "| RMSE              |  |  | \n",
+    "| r2 score          |  |  | \n",
+    "| MAPE         |  |  | "
+   ]
+  },
   {
    "cell_type": "markdown",
    "id": "a1e6c11b-ee26-4d37-87ea-2b6bd3560f60",