glue-catalog

There are 18 repositories under glue-catalog topic.

aws/aws-sdk-pandas
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Language:Python4k 60 1.2k702
dbt-labs/dbt-athena
The athena adapter plugin for dbt (https://getdbt.com)
Language:Python236 10 228102
bbenzikry/spark-eks
Examples and custom spark images for working with the spark-on-k8s operator on AWS
Language:Dockerfile27 2 65
webysther/aws-glue-docker
🐋 Docker image for AWS Glue Spark/Python
Language:Dockerfile23 3 38
miztiik/s3-to-rds-with-glue
Extract, transform, and load data for analytic processing using AWS Glue
Language:Python17 3 09
kyopark2014/case-study-wait-for-callback
This is a case study showing how to deploy "Wait-for-Callback" using Step Functions
Language:TypeScript4 3 01
GabrielDan92/AWS_Terraform_PySpark-ETL_Job
Terraform configuration that creates several AWS services, uploads data in S3 and starts the Glue Crawler and Glue Job.
Language:Python3 1 00
marwan116/aws-parquet
a toolkit that provides an object-oriented interface for working with parquet datasets on AWS
Language:Python3 2 00
PATRICIAJUNQUEIRA/DataLake_PipelineAWS
Pipeline ETL na AWS
Language:Python1 1 00
datahealer/jupyter-s3-parquet-redshift
1️⃣ Querying Parquet file from S3 using AwsWrangler. 2️⃣ Querying from Redshift tables using Glue & AwsWrangler
Language:Jupyter Notebook0 1 00
edrrezend/ETL_Streaming_DataLake
ETL using application streaming and creating a Data Lake
Language:Jupyter Notebook0 1 00
infraspecdev/terraform-aws-athena
This Terraform module automates the setup of AWS Athena to query ALB access and connection logs stored in an S3 bucket.
Language:HCL0 3 00
KRISHNASAIRAJ/AWS-Driven-Sales-Performance-Outlook
The Project aims to establish a robust data pipeline for tracking and analyzing sales performance using various AWS services. The process involves creating a DynamoDB database, implementing Change Data Capture (CDC), utilizing Kinesis streams, and finally, storing and querying the data in Amazon Athena.
Language:Python0 1 00
mineshmelvin/aws-forecast-pipeline-iac
IaC (Terraform) of AWS Forecast pipeline using Glue as workflow manager
Language:Python00
pranav-patil/aws-kinesis-analytics
AWS Kinesis Analytics gather metrics from various computers (cpu, memory), perform aggregation on Kinesis stream data using Kinesis Analytics (with flink) and store the stream data into AWS S3 bucket which is used by Amazon Athena for running various Analytics queries and rending charts using Grafana.
Language:Java0 2 00
Shilpaar90/AWS-Capturing-Schema-Changes-In-S3
A pipeline within AWS to capture schema changes in S3 files and to update them in a DB.
0 1 00
gakas14/AWS-Serverless-Data-Lake
This workshop is to build a serverless data lake architecture using Amazon Kinesis Firehose for streaming data ingestion, AWS Glue for Data Integration (ETL, Catalogue Management), Amazon S3 for data lake storage, Amazon Athena for SQL big data analytics.
Language:Jupyter Notebook2 0
Zain970/Stock-market-realtime-data-pipeline
Read the data from a source file using python and then produced that data to a kafka broker using a kafka producer , then consumed the message using a kafka consumer , uploaded the data to a aws s3 bucket then built crawler on top that and then queried that data using aws athena.
Language:Jupyter Notebook

glue-catalog

aws/aws-sdk-pandas

dbt-labs/dbt-athena

bbenzikry/spark-eks

webysther/aws-glue-docker

miztiik/s3-to-rds-with-glue

kyopark2014/case-study-wait-for-callback

GabrielDan92/AWS_Terraform_PySpark-ETL_Job

marwan116/aws-parquet

PATRICIAJUNQUEIRA/DataLake_PipelineAWS

datahealer/jupyter-s3-parquet-redshift

edrrezend/ETL_Streaming_DataLake

infraspecdev/terraform-aws-athena

KRISHNASAIRAJ/AWS-Driven-Sales-Performance-Outlook

mineshmelvin/aws-forecast-pipeline-iac

pranav-patil/aws-kinesis-analytics

Shilpaar90/AWS-Capturing-Schema-Changes-In-S3

gakas14/AWS-Serverless-Data-Lake

Zain970/Stock-market-realtime-data-pipeline