In real-world data science, it's extremely rare to have an ideal data set to begin with. Instead data often has to be aggregated from disparate sources and a good amount of data cleaning has to be done before we can do any analysis and modeling work. In this project, we'll go through the first part of a complete data science project which begins with the acquisition of raw data. The primary focus in this project will be on data acquisition, preparation, cleaning, and aggregation followed by exploratory data analyis. We will combine several messy data sets into a single clean one before commencing analysis. For the purposes of this project, we'll be using data about New York City public schools.
-
Notifications
You must be signed in to change notification settings - Fork 6
In this project, we aggregate several messy datasets about schools in New York city acquired from the NYC OpenData website. Afterwards we find correlations to discover some interesting insights.
kshehzad221/Aggregating-and-Analyzing-NYC-High-School-Data
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
In this project, we aggregate several messy datasets about schools in New York city acquired from the NYC OpenData website. Afterwards we find correlations to discover some interesting insights.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published