/COVID-19-Worldwide-Data-Analysis-using-SQL

Leveraged SQL to generate insights on the global impact of COVID-19 and track vaccination progress worldwide.

COVID-19 Worldwide Data Analysis using SQL

This project focuses on analyzing data related to the COVID-19 pandemic using SQL queries. We will be working with a dataset that provides information about COVID-19 cases, deaths, population, and vaccinations across different locations and continents. The main objective is to gain valuable insights and answer important questions about the pandemic.

We will be using SQL queries to address different aspects of the COVID-19 data. Here is a summary of the queries and the insights they provide:

  1. Mortality Rate: This query calculates the percentage of deaths in relation to the total number of cases. It helps us understand how severe the virus is in different locations and continents.

  2. Percentage of Population Infected: This query determines the percentage of the population that has contracted COVID-19. By comparing the total number of cases to the population size, we can evaluate the spread of the virus in various regions.

  3. Highest Infection Rate: This query identifies the country with the highest infection rate in proportion to its population. It calculates the percentage of the population that has been infected and highlights the location with the highest value.

  4. Highest Death Rate: This query identifies the country with the highest death rate per population. It calculates the percentage of the population that has died due to COVID-19 and highlights the location with the highest mortality rate.

  5. Country with the Highest Death Count: This query determines the country that has experienced the highest total number of deaths. It provides insights into the countries most impacted by the virus.

  6. Continent with the Highest Death Count: This query identifies the continent with the highest total number of deaths. It helps us understand the overall impact of the pandemic on a larger scale.

  7. Global Cases per Day: This query presents the daily global COVID-19 cases, including the total number of new cases and new deaths. It also calculates the death rate as a percentage of new cases, giving us an indication of the severity of the virus over time.

  8. Rolling Count of People Vaccinated: This query calculates the cumulative count of vaccinated individuals on each day. It tracks the total number of people vaccinated and presents it alongside the population size. Additionally, it shows the percentage of the population that has received the vaccine.

Some SQL techniques used in this project include joins, CTEs, temp tables, window functions, and aggregate functions to derive insights from the data. In addition to the SQL queries, this project involves creating views to store the results of our analyses. These views can be used for further exploration, data visualization, and reporting purposes.

By analyzing the COVID-19 data using SQL, our project aims to provide meaningful insights into the impact of the virus on different regions, assess the effectiveness of vaccination efforts, and understand the severity of the pandemic over time. The findings from this project can contribute to informed decision-making, resource allocation, and the development of effective public health strategies.