/Payroll_Database_ETL

An ETL project for Public Payroll Data.

Primary LanguagePython

EPayroll Database ETL

The EParyroll Database ETL project is an Extract Transform Load job for the payroll data from data.gov. The project contains a data model, Python Scripts, and requirement files.

Install

There are two different ways you can prepare your environment to run the program. To recreate the conda environement, you can run conda env create -f environment.yaml . The second method is using pip and the requirement.txt file pip install -r requirements.txt . There are several places in the create_tables.py and etl.py files that you will need to fill out. You can find them using the following marks ### Insert ...

What's next:

The next steps for the project will be to create some test and refactor where necessary.

Data source:

https://www.data.gov/