Skip to content

GalvanizeDataScience/DS-Glossary-RPT1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 

Repository files navigation

Glossary of Data Science Terms

How to update:

  1. Pull the latest version of the glossary on main branch. git pull
  2. Create a branch from main git checkout -b new_branch
  3. Make your changes on your new branch
  4. Commit and push your changes git commit -am 'commit_message'; git push -u origin new_branch
  5. Submit a pull request on github.
  6. Get one other student to review your changes and merge them into main.

Terms

Predictor - see feature, regressor, independent variable. A column that we use for prediction in regression.

Regressor

Independent Variable - same as predictor

Feature

Target

Outcome

Response Variable

Dependant Variable

Error

Residual = true_y - predicted_y

Regression - ML technique used to predict numeric or continuous values

Classification

Supervised

Unsupervised

In-Sample Prediction

Out-of-Sample Prediction

Cross Validation

Training Set

Testing Set

Validation Set

Holdout Set

RMSE = root mean squared error (what you try to minimize) its in the same units as what you are trying to predict.

MAE

R-Squared =

Logistic Regression -

About

student-led glossary of data science terms

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published