- Project overview
- Data sources
- Tools used
- Data cleaning / preparation
- Exploratory Data Analysis
- Data analysis
- Results / Findings
- Recommendations
- References
This data analysis project aims to provide insights into weather data (2012), Uber data (2016) & Inventory data. By analyzing various aspects of the weather data, Uber data & Inventory seperately, we seek to identify trends, make data-driven recommendations, and gain a deeper understanding of these factors.
The primary dataset used for this analysis are
- Weather data: kaggle weather
- Uber data: kaggle Uber
- Inventory data: kaggle inventory
- EXCEL - For data cleaning. Download here
- SQL - For data analysing.
- Python - Exploratory data analysis.
In the initial data preparation phase, we performed the following tasks:
- Data loading and inspection.
- Handling missing values.
- Data cleaning and formatting.
EDA involves exploring the datas to answer key questions, such as:
- Weather data
- What is the overall temperature trend?
- How often does it rains?
- What is the wind speed trend?
- Uber data (2016)
- what is the most opted trip category and purpose?
- what are top 10 routes?
- which is the most opted route?
- which is the longest travelled route?
- which month has the most number of trips?
- Inventory data
Include some interesting codes or features worked with
Weather data analysis
- Temperature trend analysis
SELECT DATE(Date_Time)AS date,ROUND(AVG(Temp_C ),0)AS temp_in_degree_celcius
FROM `weather_datas.weather`
GROUP BY date
ORDER BY date ASC;
- precipitation pattern analysis
SELECT weather,COUNT (Weather)AS weather_count
FROM `weather_datas.weather`
GROUP BY Weather
HAVING COUNT (Weather) >100
ORDER BY weather_count DESC;
- Windspeed analysis
SELECT DATE(Date_Time)AS date,ROUND (AVG (Wind_Speed_km_h),0)AS windspeed_km_hr, ROUND(AVG(temp_C),0)AS temp_in_degree_celcius
FROM `weather_datas.weather`
GROUP BY date
ORDER BY date ASC;
Uber data analysis
SELECT START_MONTH AS MONTH,ROUND (SUM(SUM_of_MILES),0)AS MILES_COVERED
FROM `braided-topic-402311.Babynames.uberr`
GROUP BY START_MONTH
ORDER BY START_MONTH ASC;
SELECT month,SUM(count)AS count
FROM `braided-topic-402311.Babynames.ube`
GROUP BY month;
Inventory data analysis
Weather Data analysis results are summarized as follows:
- Coldest month is January and Hottest month is July.
- 3.7% of the days in 2012 are rainy.
- Average windspeed of 15 Km/hr with a maximum value of 42km/hr is observed.
Uber Data analysis results are summarized as follows:
- The most opted trip category and purpose are Business and Meeting.
- The most opted route is Morrisville - Cary.
- The longest travelled route is Latta - Jacksonville.
- December has the most number of trips.