aws-redshift
There are 117 repositories under aws-redshift topic.
alanchn31/Data-Engineering-Projects
Personal Data Engineering Projects
tokern/piicatcher
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
aws/amazon-redshift-python-driver
Redshift Python Connector. It supports Python Database API Specification v2.0.
alanchn31/Movalytics-Data-Warehouse
Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow
Wittline/uber-expenses-tracking
The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such as Apache Airflow, AWS Redshift and Power BI.
shravan-kuchkula/udacity-data-eng-proj-1
Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation, validation and loading of data from S3 -> Redshift -> S3
awslabs/clickstream-analytics-on-aws
Build clickstream analytics on AWS for your mobile and web applications
KentHsu/Udacity-Data-Engineering-Nanodgree
Udacity Data Engineering Nanodegree Program
jackmleitch/StravaDataPipline
:arrows_counterclockwise: :running: EtLT of my own Strava data using the Strava API, MySQL, Python, S3, Redshift, and Airflow
AnMol12499/Reddit-Analytics-Integration-Platform
Project was based on an interest in Data Engineering, ETL pipeline. It also provided a good opportunity to develop skills and experience in a range of tools. As such, project is more complex than required, utilising dbt, airflow, docker and cloud based storage.
heroku-examples/analytics-with-kafka-redshift-metabase
An example system that captures a large stream of product usage data, or events, and provides both real-time data visualization and SQL-based data analytics.
moritzkoerber/covid-19-data-engineering-pipeline
A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation and CDK, deployable via Github Actions.
ismaildawoodjee/aws-data-pipeline
A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from locally hosted Airflow containers. The end product is a Superset dashboard and a Postgres database, hosted on an EC2 instance at this address (powered down):
tmheo/spring-data-jpa-redshift-sample
spring boot data jpa integration with aws redshift sample
AWS-Big-Data-Projects/Analysing-Census-Data-using-aws
Use aws-emr and aws-redshift to analyse dataset of adult census of USA
vsouza/spark-kinesis-redshift
Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark
kishlayjeet/Zomato-Twitter-Sentiment-Analysis-Data-Pipeline
This project provides valuable customer sentiment insights for Zomato by tracking and analyzing tweets related to their brand and services.
lenguyenthedat/aws-redshift-to-rds
A simple command-line tool to copy tables from Amazon Redshift to Amazon RDS (PostgreSQL).
essraahmed/Data-Warehouse-With-Redshift
Data Warehouse with AWS Redshift and Visualizing data using Power BI
kishaningithub/rdapp
rdapp - Redshift Data API Postgres Proxy
taise/Spectrometer
AWS Redshift monitoring web console
twistedFantasy/aws
The goal of this repository is to provide good and clear examples of Amazon CLI commands together with Amazon CDK to easily create any AWS services and resources
FedericoSerini/DEND-Project-3-Data-Warehouse-AWS
Project 3 - Data Engineering Nanodegree
eduardofb/redshift-create-manifest
Redshift script to create a MANIFEST file recursively
FedericoSerini/DEND-Project-5-Data-Pipelines
Project 5 - Data Engineering Nanodegree
lregnier/slick-amazon-redshift
A quick example of how to load data from Amazon S3 into Amazon Redshift using Redshift's COPY command through Slick
polo2444172276/Udacity-Data-Engineering-Nanodegree
Completed Udacity's data engineering nano degree. Went through a series of exercises and projects to learn and practice the trendy big data management tools.
DimaKuriptya/RedditETL
This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data warehouse. The pipeline leverages a combination of tools and services including Apache Airflow, Celery, PostgreSQL, Amazon S3, AWS Glue, Amazon Athena, and Amazon Redshift.
eduardofb/redshift-remove-duplicates
Remove duplicates entries from a Redshift cluster
meejahnsnutshell/AWS_ML_Crypto
A Java API that gathers historical cryptocurrency pricing data (via CryptoCompare API) & makes predictions (via AWS Machine Learning API)
micopes/AWS_Datalake
AWS 및 AWS를 이용한 Data Lake 구성 이해
paoliniluis/shift-to-spectrum
An automated SQL script generator to migrate AWS Redshift schemas (or tables) to AWS Redshift Spectrum
PopoPenguin/AWS_ML_Crypto
A Java API that gathers historical cryptocurrency pricing data (via CryptoCompare API) & makes predictions (via AWS Machine Learning API)
SaadAhmedWaqar/Data-Warehousing-Redshift
A Data Warehousing project for retail sales using dimension modelling best practices with SCD type 2 on AWS Redshift. Utilizing AWS Lambda, Glue Workflows and Python Shell jobs to create and automate an ELT pipeline where batch data coming into S3 is loaded onto Redshift and necessary transformations are performed to meet requirements.