- This repo contains different projects using Python language and different SQL & NoSQL databases like PostgreSQL, Apache Cassandra, SQlite3.
- Prime focus of this projects are to get data from different sources and load into some database based on the nature and usecase of the project.
- We will use some API to connect to the data and do transformation of the data using Python libraries and functions.
- Usage of AWS services like S3, Redshift, IAM, Glue,EMR.
- Transforming schemas from 3NF to star schema for simplification of query and to increase optimization.
amod26/DataEngineeringWithPython
Various Projects on Python related to Data Engineering
Jupyter Notebook