Restaurant Dataset Analysis

This project focuses on analyzing a dataset of restaurants, including various aspects such as online orders, ratings, and votes. The analysis involves visualizations and statistical exploration using Python libraries like pandas, matplotlib, and seaborn.

Table of Contents

Project Overview

This project aims to provide insights into the restaurant industry through data analysis. The dataset includes information on customer reviews, online orders, ratings, and other key metrics that influence restaurant performance. The analysis explores trends, patterns, and correlations within the data.

Installation

To run the project locally, you'll need to have the following installed:

  • Python 3.x
  • Jupyter Notebook
  • Required Python libraries (can be installed using requirements.txt)

Step-by-Step Setup:

  1. Clone this repository:
    git clone https://github.com/yourusername/restaurant-dataset-analysis.git
  2. Navigate to the project directory:
    cd restaurant-dataset-analysis
  3. Install the required dependencies:
    pip install -r requirements.txt
  4. Start Jupyter Notebook:
    jupyter notebook
  5. Open and run the Restaurants_Analysis.ipynb notebook.

Dataset

The dataset used for this analysis contains restaurant information, including:

  • Restaurant names
  • Online order availability
  • Ratings
  • Votes

Make sure the dataset (restaurants.csv) is located in the correct path when running the notebook.

Analysis

The analysis is divided into several key sections:

  1. Data Cleaning: Handling missing values, correcting data types, and filtering relevant information.
  2. Exploratory Data Analysis (EDA): Visualizations and summary statistics to uncover trends and insights.
  3. Statistical Analysis: Investigating relationships and correlations between different variables.
  4. Conclusion: Summarizing the findings and providing actionable insights.

Results

Some of the key findings from the analysis include:

  • The distribution of online orders across restaurants.
  • Correlation between ratings and votes.
  • Trends in customer preferences.