Name		Name	Last commit message	Last commit date
parent directory ..
data		data
images		images
README.md		README.md
Topic1-High_Dimensional_Data.ipynb		Topic1-High_Dimensional_Data.ipynb
Topic2-Dimensionality_Reduction.ipynb		Topic2-Dimensionality_Reduction.ipynb
Topic3-Clustering.ipynb		Topic3-Clustering.ipynb
Topic4-Generative_Models.ipynb		Topic4-Generative_Models.ipynb

README.md

Exploratory Data Analysis

When working with large and high-dimensional datasets it can be difficult to get insight into the data. There are a range of techniques and algorithms that can assist in this process, many of which are classified as "unsupervised" data analysis algorithms (the data has features/inputs but no labeled outputs). This module will explore a few of these approaches and show how they can be used to visualize and analyze complex datasets.

Associated Notebooks:

Lectures

High Dimensional Data - Introduction to high-dimensional data, inspecting and visualizing features.
Dimensionality Reduction - Performance assessment, principal component analysis, and manifold learning.
Clustering - Expectation-Maximization models (k-means), density based models (mean shift), and hierarchical models.
Generative Models - Intro to generative models, Gaussian mixture models, kernel density estimation, and not-so-naïve Bayes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

5-exploratory_data_analysis

5-exploratory_data_analysis

README.md

Exploratory Data Analysis

Recommended Reading:

Associated Notebooks:

Lectures

Files

5-exploratory_data_analysis

Directory actions

More options

Directory actions

More options

Latest commit

History

5-exploratory_data_analysis

Folders and files

parent directory

README.md

Exploratory Data Analysis

Recommended Reading:

Associated Notebooks:

Lectures