Coding exercises for Data Engineer role.
Consists in PostgreSQL scripts and Bash scripts (to use relative paths).
- Tasks
-
Which are the top 3 exchange with the most transactions in the file?
-
In August 2017, which 2 companyNames had the highest combined valueEUR?
-
For 2017, only considering transactions with tradeSignificance 3, what is the percentage of transactions per month?
Python scripts to read, analyze and extract data from .csv
- Tasks
-
Which are the top 3 source with the highest ratio of Buy to Sell transactions weighted by the number of shares per transaction?
-
Which are the top 3 currency by the total numerical value of trades in that currency?
-
What is the total number of transactions where inputdate was more than 2 weeks after tradedate?
Software used:
- PostgreSQL 13
- Python 3.8.0
In order to run the Python script:
- "cd" inside "PART 2" folder.
- Install required libraries using
pip install -r requirements.txt
- Run
python main.py
- at the end, you'll be asked if you want to export function results as .csv files.