awsglue

There are 22 repositories under awsglue topic.

prestodb/prestorials
Tutorials and examples of how to deploy Presto and connect it to different data sources
20 10 312
Akanksha-tetwar/YouTube-Trending-video-analysis-ETL-using-AWS-Services
In this project I have used the Trending YouTube Video Statistics data from Kaggle to analyze and prepare it for usage.
1 1 00
TanishkaMarrott/Real-Time-Streaming-Analytics-with-Kinesis-Flink-and-OpenSearch
This project focuses on real-time data streaming with Kinesis, using Flink for advanced processing and OpenSearch for analytics. This architecture has succinctly handled the complete lifecycle of data from ingestion to actionable insights, making it a comprehensive solution.
Language:Java1 1 00
Undisputed-jay/SpotifyAPI-Data-Engineering-Project
This projects uses ETL (Extract, Transform and Load) pipeline to extract data from Spotify using its API and loads the data to a data source(AWS Athena). The entire pipeline will be built using Amazon Web Services (AWS).
Language:Jupyter Notebook1 1 00
bhavanachitragar/Superstore-Data-Analysis-using-AWS
This project builds a pipeline to analyze Superstore sales data using the power of AWS. It transforms the data to make it ready for exploration. Querying the transformed data using SQL queries to uncover trends and patterns. Analyzing results and creates easy-to-understand visualizations, providing clear insights into Superstore sales performance.
0 1 00
catherman/Data-Science-Miscellaneous
AWS S3 & Sentiment Analysis, Basic Plotting with Matplotlib, & Supervised Learning & Machine Learning with Sklearn.
Language:Jupyter Notebook0 2 00
Harikishan-AI/Harikishan-AI
I am dedicated to delivering innovative solutions that align with business objectives while ensuring optimal performance, reliability, and security. My strong analytical skills, attention to detail, and problem-solving abilities drive me to create effective and efficient solutions.
0 1 00
iqrabismii/Big-Data-Projects-
Projects on Big Data Using Pyspark and AWS
Language:Jupyter Notebook0 1 00
Mopheshi/DataEngineeringSpecialization
Data Engineering Specialization offered by Joe Reis in partnership with DeepLearning.AI through Coursera...
Language:Jupyter Notebook0 1 00
nazish555/AWS-Data_Engineering-Spotify_Data
This project showcases a data transformation pipeline utilizing AWS Glue and Amazon Athena to process Spotify data from CSV files. It involves loading, transforming, and storing data in an S3 datawarehouse, enabling seamless querying through Amazon Athena.
Language:Python0 2 00
nischaybikramthapa/dbt-athena-tpch
This project demonstrates how you can build downstream data pipeline using dbt in athena
Language:Python0 1 00
olusimeon/reddit-sentimentanalyses-pipeline
This project sets up a real-time data pipeline to fetch data from Reddit, transform it using AWS Glue, and store it in Amazon S3. This involves data streaming, cloud storage, ETL (Extract, Transform, Load) processes, and orchestration using Apache Airflow.
Language:Python0 2 00
parth2050/aws-data-pipeline
An End-To-End data pipeline integration from Website Source to analytical dashboard in AWS using Python flask, ML models, DynamoDB and other AWS services.
Language:HTML0 1 00
pawanyoda/create_glue_table_using_gitlab_cicd
Create Glue table using CI -CD
00
shaundominic/Kafka-Streaming-Project
Leverages Apache Kafka to facilitate streaming real time data generated by Python to upload data into S3 using s3fs
Language:Python0 1 00
VivekaAryan/Reddit-Data-Pipeline
This project offers a robust data pipeline solution designed to efficiently extract, transform, and load (ETL) Reddit data into a Redshift data warehouse. Leveraging a blend of industry-standard tools and services, the pipeline ensures seamless data processing and integration.
Language:Jupyter Notebook0 2 00
wlopezm-unal/Project-airflow-AWSGlue
In this project we can run an ETL in AWS Glue by Orchestrating it with Airflow. This project we create a Docker Compose to raise the services as Airflow, Redis and PostgreSQL. PostgreSQL was use in this project to save metadata get of Airflow
Language:Python0 1 00
ArthurHenriqueSilva/Demo-AWS-GLUE-CN-2024.1
Repositório para demonstração da ferramenta AWS GLUE - COmputação em Nuvem - DCOMP/UFS - 2024.1
Language:Python1 0
Cuchuflim/ETL-S3-to-Redshift
Incremental Data Load from S3 Bucket to Amazon Redshift Using AWS Glue
Language:Python
riship1095/YouTube-ETL
Transformed YouTube’s raw JSON data to parquet & loaded it in an S3 bucket, used Glue Data Catalog for storing metadata & Athena to query the cleaned data. Developed an ETL process using a Lambda job that would be triggered when raw data is loaded into an S3 bucket, processed, and stored for analytical purposes in an S3 bucket.
Language:Python
shreyask1406/Financial-Market-AWS-Data-Pipeline
AWS Data pipeline
vanibhat02/Big-Data
Big data and Cloud Deployment
Language:Jupyter Notebook1 0

awsglue

prestodb/prestorials

Akanksha-tetwar/YouTube-Trending-video-analysis-ETL-using-AWS-Services

TanishkaMarrott/Real-Time-Streaming-Analytics-with-Kinesis-Flink-and-OpenSearch

Undisputed-jay/SpotifyAPI-Data-Engineering-Project

bhavanachitragar/Superstore-Data-Analysis-using-AWS

catherman/Data-Science-Miscellaneous

Harikishan-AI/Harikishan-AI

iqrabismii/Big-Data-Projects-

Mopheshi/DataEngineeringSpecialization

nazish555/AWS-Data_Engineering-Spotify_Data

nischaybikramthapa/dbt-athena-tpch

olusimeon/reddit-sentimentanalyses-pipeline

parth2050/aws-data-pipeline

pawanyoda/create_glue_table_using_gitlab_cicd

shaundominic/Kafka-Streaming-Project

VivekaAryan/Reddit-Data-Pipeline

wlopezm-unal/Project-airflow-AWSGlue

ArthurHenriqueSilva/Demo-AWS-GLUE-CN-2024.1

Cuchuflim/ETL-S3-to-Redshift

riship1095/YouTube-ETL

shreyask1406/Financial-Market-AWS-Data-Pipeline

vanibhat02/Big-Data