This project is focused on analysing the legacy employee data of Pewlett Hackard, a fictional company, during the 1980s and 1990s. The goal is to perform data modeling, data engineering, and data analysis on the available data from six CSV files.
The project is divided into three main parts:
Inspected the CSV files and created an Entity Relationship Diagram (ERD) of the tables using QuickDBD.
1.Created a table schema for each of the six CSV files, specifying data types, primary keys, foreign keys, and other constraints.
Performed various SQL queries to answer questions about the data, such as employee details, hire dates, managers, department information, and frequency counts of employee last names.
Data Engineering: Table definitions, column data types, primary keys, foreign keys, constraints, and table relationships
The project is deployed to a GitHub repository, including the necessary files for data modeling, data engineering, and data analysis.
The data used in this project was generated by Mockaroo, LLC (2022), a realistic data generator.