Project Proposal
The sources of data that you will extract from
- Vehicle Data: https://www.kaggle.com/gagandeep16/car-sales
- Belarus Used Cars Catalog https://www.kaggle.com/lepchenkov/usedcarscatalog
- Japan Used Cars Dataset https://www.kaggle.com/doaaalsenani/used-cars-dataets
Transform: We will be using pandas to transform the dataset. We will select the important columns from each dataset, remove any irrelevant data or empty rows, and combine the datasets into the relevant tables.
- Clean
- Join
- Stack
- Drop/Filter
- Append
Load: PostgreSQL Relational database, expected tables to generate: Important subset data from each dataset (x3) Make & model Key table Combined tables that will demonstrate each combinations of our tables such as comparing horsepower data between our used car datasets or examining car brands available in each region.