Building Serverless Data Lakes on AWS

Note: this fork uses US West (Oregon)

Author: Unni Pillai | Amazon Web Services | Twitter | Linkedin

Updated by: Vikas Omer | Amazon Web Services | Linkedin

Design serverless data lake architecture
Build a data processing pipeline and Data Lake using Amazon S3 for storing data
Use Amazon Kinesis for real-time streaming data
Use AWS Glue to automatically catalog datasets
Run interactive ETL scripts in an Amazon SageMaker Jupyter notebook connected to an AWS Glue development endpoint
Query data using Amazon Athena & visualize it using Amazon QuickSight

Pre-requisites:

Please do check on the pre-requisites for each module before starting the activities within the module.

Also, do not forget to clean up the resources at the end of the workshop!