ETL Project: Group 8

Members: -Raj -Jaimie -Steven P.

alt text

Background:

Formula 1 is the highest class of auto racing and the most thrilling. Formula 1 is sanctioned by the FIA (Federation Internalionale de l’Automobile) and was started in 1950. With drivers zooming around the track at speeds above 200mph. The data below is for Constructors who are the manufacturers of the cars being raced.

Process:

Data for our ETL project comes from Kaggle. The data consists of 4 csv files. We are going to load the csv files into Pandas and store the csv files as a dataframe. We will then clean the dataframes. We will then create tables for the files in pgAdmin 4 and create an ERD diagram showing the tables. We will then connect our Jupyter notebook file to our database and check our table names. We will then load the dataframes into the database. Lastly, we will query the data to confirm that the data is present and was uploaded properly.

Sources:

Source: https://www.kaggle.com/cjgdev/formula-1-race-data-19502017?select=constructors.csv Files names: -Races

-Constructor Results

-Constructor Standings

-Constructors

Presentation : https://docs.google.com/presentation/d/1jLxB7sAr2oRuy52eunUcMoOhNrDreQ3tUAn99Jf4HSI/edit#slide=id.gb16784c331_0_103