About | Questions Analyzed | Data Source | Install Dependencies | Data Analysis Process | Summary Findings | Contributors
COVID-19 has been one of the most unpredictable and transformative pandemics in recorded history. It has been overwhelming because most businesses could not sustain, and many people lost their jobs. Our team decided to study the overall impacts of this pandemic on various businesses and workforce trends in the US.
This project analyzes COVID-19-related business trends through a detailed assessment of unemployment rates, economic fluctuations, and factors (e.g., demographics, travel, personal habits) that likely contribute to the growth or contraction US during the worldwide pandemic. Correlations and associations between various factors and increased downturn point to specific, consistent vulnerabilities.
COVID-19 impacts and the questions we wanted to analyze as part of this project:
- Unemployment - Which ethnic group, industry and states had the highest and lowest unemployment rates during the pandemic?
- Businesses - Which businesses economically benefitted or lost the most due to COVID-19?
- Airlines & Travel Routes - Which airlines and travel routes were most affected during the pandemic?
- Mobility Trends - Is there any relationship between the total COVID-19 cases and mobility trends (i.e., people staying at home) in the US?
- Safest Travel Destination - In the current situation, which is the US's safest travel destination right now?
-
Unemployment
-
Businesses
- Jupyter Notebook Script was written based on the python script available in kaggle to retrieve all NASDAQ traded stock prices on 2019/12/31(Pre COVID), 2020/03/31, 2020/06/30 and 2020/10/09 into a CSV file for further analysis.
-
Airlines & Travel Routes
- Wrote a Jupyter Notebook Script to split the csv file in the downloads folder available in Kaggle to retrieve all airlines data from January 2020 to June 2020 into nine CSV files (300,000 rows of data in each file) for further analysis.
-
Mobility Trends
-
Safest Travel Destination
- Businesses
pip install yfinance
- Mobility Trends
pip install sodapy
Update APP Token, Key ID, Key Secret from https://data.bts.gov in config.py file
- Safest Travel Destination
conda install plotly
conda install -c plotly plotly-orca
conda install -c plotly plotly-geo
conda install -c plotly plotly_express==0.4.1
The project collects data from multiple sources cited above, performs normalization and aggregations after that, and, based on the data, generates various plots/maps to analyze and determine useful insights and draw conclusions to answer the questions.
Performed following data clean up and wrangling steps wherever required. The cleaned and transformed data was used for the remaining steps.
- Cleaned up the data by removing NaNs and duplicates
- Converted the data to appropriate data types
- Merged multiple sources of data into one dataset and used it for the remaining steps
-
Unemployment
- Chose the US states that were most affected (or) less affected due to the pandemic to generate the plot
-
Businesses
- Calculated the percentage of change in stock price for each specified period
- Top 10 and Bottom 10 performing businesses were determined
-
Airlines & Travel Routes
- The route that has maximum cancellations was determined
- The Route that has maximum delays was determined
- Airline that has maximum cancellations was determined
- Airline that has maximum delays was determined
-
Safest Travel Destination
- Top 15 Safest travel destinations at County Level in the US was determined
-
Unemployment
- Generated a line plot with Unemployment Rates for all US states
- Generated a line plot with Unemployment Rates for the States that were most affected (or) less affected due to the pandemic
- Generated a line plot with Unemployment Rates for various races
- Generate a line plot with Unemployment Rates for ten chosen industries
-
Businesses
- Generated a line plot for top 10 performing businesses with Pre COVID, Mar 2020, June 2020 and Oct 2020 Profit % Changes
- Generated a bar plot for top 10 performing businesses with Oct 2020 Profit % Changes
- Generated a line plot for bottom 10 performing businesses with Pre COVID, Mar 2020, June 2020 and Oct 2020 Profit % Changes
- Generated a bar plot for bottom 10 performing businesses with Oct 2020 Profit % Changes
-
Airlines & Travel Routes
-
Mobility Trends
- Generated a scatter plot with a linear regression model to see if there is a correlation between population staying at home and covid cases at national level.
- Generated a scatter plot with linear regression model to see if there is correlation between population staying at home and covid deaths at national level.
- Generated a line plot to show the mobility and covid trend at national level
-
Safest Travel Destination
- Generated a bar plot showing the Percentage of Population difference between 2018 and 2019 at US County Level
- Generated a bar plot showing the Top 15 Safest Counties to Travel
- Generated a Plotly Map showing the County level Covid Cases
We looked across all the generated figures/tables/maps and noted the observations and inferences. We included these observations at the bottom of the notebook for each question.
-
Unemployment
-
Businesses
-
Airlines & Travel Routes
-
Mobility Trends
-
Safest Travel Destination
Summarized overall Analysis for all the questions in the project in Summary Findings Jupyter Notebook
- Python
- Pandas Library
- Matplotlib Library
- Jupyter Notebook
- Crystal Dalsania (cdalsania)
- Usha Saravanakumar (ushaakumaar)
- Prarthna Ashutoshmunidottir (Prarthna-design)
- Chahnaz Kbaisi (Chahnaz-Kbaisi)
- Susan Thomas (SusanCThomas)
- Jame'l Brown (JayyMaurice2020)