Skip to content

The project is focused on performing exploratory data analysis (EDA) on the Zomato dataset, which contains comprehensive information about restaurants, cuisines, user ratings, and reviews from the Zomato platform.

License

Notifications You must be signed in to change notification settings

sofia-rajan/Feature-Engineering

Repository files navigation

Feature Engineering

Zomato Dataset - Exploratory Data Analysis (EDA)

This repository contains a Jupyter Notebook file for performing Exploratory Data Analysis (EDA) on the Zomato dataset. The EDA is conducted step by step, covering various aspects of the dataset including data import, data cleaning, analysis of numerical and categorical variables, visualization of relationships between variables, and more.

Overview

  • Project Title: Zomato Dataset - EDA Python Notebook
  • Date: February 25, 2024

Project Structure

The project consists of a single Jupyter Notebook file:

  • Zomato Dataset - EDA Python Notebook: This notebook contains the code and explanations for each step of the Exploratory Data Analysis process.

Contents

The notebook is structured into different sections:

  1. Step 0: Importing Libraries

    • Imports necessary libraries such as Pandas, NumPy, Matplotlib, and Seaborn for data analysis and visualization.
  2. Step 1: Import Data and Preview

    • Reads the Zomato dataset from a CSV file and displays the first few rows to get an overview of the data.
  3. Step 2: Data Analysis Part

    • Conducts various analyses on the dataset, including:
      • Checking for missing values.
      • Exploring numerical variables using descriptive statistics.
      • Exploring categorical variables.
      • Finding relationships between variables.
  4. Observations

    • Summarizes key observations and insights obtained from the data analysis.

Usage

To run the notebook:

  1. Clone this repository to your local machine.
  2. Install Jupyter Notebook and the required dependencies (Pandas, NumPy, Matplotlib, Seaborn).
  3. Open the Jupyter Notebook file in your Jupyter Notebook environment.
  4. Execute each cell in the notebook sequentially to perform the analysis steps.

Data Source

The Zomato dataset used in this project is available on GitHub: Dataset Link

Dependencies

  • Pandas
  • NumPy
  • Matplotlib
  • Seaborn

Author

This EDA project was conducted by Sofia Rajan.

License

This project is licensed under the [GNU General Public License] - see the LICENSE file for details.

About

The project is focused on performing exploratory data analysis (EDA) on the Zomato dataset, which contains comprehensive information about restaurants, cuisines, user ratings, and reviews from the Zomato platform.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published