This repository contains project related to big data. All the projects are using real-world data of real-world problems.
1. Flight Delay Prediction
- The goal of this project is to generate the insights from the airlines data to know the following and prevent you to stuck at the airport.
- The 3 airlines with the highest and lowest probability, respectively, for being on schedule
- The 3 airports with the longest and shortest average taxi time per flight - The most busy airports
- The most common reason for flight cancellations
- Dataset: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/HG7NV7