glue-crawler
There are 3 repositories under glue-crawler topic.
SourabhSinghRana/real-time_crypto_data_pipeline_using_kafka
I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that utilizes Kafka to scrape, process, and load data onto S3 in JSON format. With a producer-consumer architecture, I ensure that the data is in the right format for loading onto S3 by performing minor transformations
DieGit0/windfarm
Data Engineering project using data streaming produced by python applications, ETL process and availability for ad-hoc SQL queries in the AWS cloud
DieGit0/data_realtime_-_batch_analytics
Data Streaming and Batch processing using AWS Services