In this project, we are working on a real-world dataset of zomato, one of the most used food ordering platforms. This project aims on cleaning the dataset, analyze the given dataset, and mining informational quality insights.
- This project will help you understand how a real-world database is analyzed using SQL.
- how to get maximum available insights from the dataset, pre-process the data using python for better upcoming performance.
- how a structured query language helps us retrieve useful information from the database.
- Pre-Processing The Data And Removing Unwanted Columns
- Renaming And Selecting Columns In A Dataset
- Dealing With Null Values In A Dataset
- Identifying Duplicate Data In A Dataset
- Text Cleaning
- Unique Value Check And Irrelevant Value Handling
- Cleaning And Exporting Zomato Dataset
- Data Cleaning And Analysis
- For a high-level overview of the hotels, provide us the top 5 most voted hotels in the delivery category.
- The rating of a hotel is a key identifier in determining a restaurant’s performance. Hence for a particular location called Banashankari find out the top 5 highly rated hotels in the delivery category.
- Compare the ratings of the cheapest and most expensive hotels in Indiranagar.
- Online ordering of food has exponentially increased over time. Compare the total votes of restaurants that provide online ordering services and those that don’t provide online ordering services.
- Number of votes defines how much the customers are involved with the service provided by the restaurants For each Restaurant type, find out the number of restaurants, total votes, and average rating. Display the data with the highest votes on the top( if the first row of output is NA display the remaining rows).
- What is the most liked dish of the most-voted restaurant on Zomato(as the restaurant has a tie-up with Zomato, the restaurant compulsorily provides online ordering and delivery facilities.
- To increase the maximum profit, Zomato is in need to expand its business. For doing so Zomato wants the list of the top 15 restaurants which have min 150 votes, have a rating greater than 3, and is currently not providing online ordering. Display the restaurants with the highest votes on the top.