PHIDELIST/AWS-CDK-micro-ETL-pipeline

This project presents serverless event driven ETL pipeline leveraging Python Data Wrangler to process raw files uploaded to an input s3 bucket using a Lambda function and store the processed files in an output s3 bucket. It has a glue crawler that update the processed file’s metadata to glue catalogue and uses Athena to query processed files.

Python

Watchers

PHIDELIST
Nairobi