This repository consists a Cloudformation template and pyspark code sample for Glue streaming job to implement following ETL pipeline :
Related AWS Blog : https://aws.amazon.com/blogs/big-data/build-a-serverless-pipeline-to-analyze-streaming-data-using-aws-glue-apache-hudi-and-amazon-s3/
See CONTRIBUTING for more information.
This library is licensed under the MIT-0 License. See the LICENSE file.