amazon-redshift
There are 56 repositories under amazon-redshift topic.
sasha-alias/sqltabs
Rich SQL client for Postgresql, MySQL, MS SQL, Amazon Redshift, Google Firebase (Firestore)
helicalinsight/helicalinsight
Helical Insight software is world’s first Open Source Business Intelligence framework which helps you to make sense out of your data and make well informed decisions.
aws/amazon-redshift-python-driver
Redshift Python Connector. It supports Python Database API Specification v2.0.
aws-samples/aws-dbs-refarch-rdbms
Reference Architectures for Relational Databases on AWS
servian/amazon-redshift-checklist
This checklist aims to be an exhaustive list of all elements you should consider when using Amazon Redshift.
polakowo/yelp-3nf
3NF-normalize Yelp data on S3 with Spark and load it into Redshift - automate the whole thing with Apache Airflow
aws-samples/amazon-redshift-serverless-rsql-etl-framework
Amazon Redshift Serverless RSQL ETL Framework
lenguyenthedat/aws-redshift-to-rds
A simple command-line tool to copy tables from Amazon Redshift to Amazon RDS (PostgreSQL).
nathadriele/redshift-to-s3-unload-dag
This Airflow DAG automates the process of extracting data from an Amazon Redshift database and unloading it to Amazon S3 in Parquet format. It runs daily, exporting data from the previous day based on a specified query.
miztiik/redshift-demo
Simple getting started 1-node redshift cluster stack
Wathon/Data-Warehouse-with-AWS
Udacity Data Engineering Nano Degree Project, ETL for Data Warehouse using S3 and Amazon Redshift.
alicjamazur/data-engineering-case
ETL Redshift-based workflow automated with AWS Step Funtions.
mateensa/SyPy
This git project contains programming excercises written in python for almost everything ranging from Solving Problems using Algorithms and Data structures, webscrapping, managing connectivity to db, ml and plotting graphs
AbdullahMu/Data-Pipelines-with-Airflow
Schedule, automate, and monitor data pipelines using Apache Airflow. Run data quality checks, track data lineage, and work with data pipelines in production.
HemantNegi/django-redshift
Redshift ORM and Migration support for django
mateensa/Redshift_Querys
few important querys required most of time by dev. or dba
MaxGanzII/redshift-observatory.ch
Comprehensive, unified and coherent replacement system tables for Amazon Redshift, from a single author.
AmritPrakash3/Reddit-ETL-in-AWS-using-Airflow
Reddit ETL in AWS using Airflow is a full-stack data engineering project that builds a scalable ETL pipeline using cloud-based tools. It extracts data from Reddit via API, processes it with Apache Airflow, and leverages AWS services like S3, Glue, Athena, and Redshift for transformation, querying, and warehousing.
bdbao/etl-reddit
This project is a data pipeline that ETL Reddit data into a Redshift data warehouse. The pipeline includes a combination of tools and services: Apache Airflow, Celery, PostgreSQL, Amazon S3, AWS Glue, Amazon Athena, and Amazon Redshift.
CarlosUziel/sparkify-redshift
A project from the Data Engineer Nanodegree Program at Udacity to practice Data Warehouses and ETL pipelines on AWS.
Ditectrev/Amazon-Web-Services-Certified-AWS-Certified-Database-DBS-C01-Practice-Tests-Exams-Questions-Answers
⛳️ PASS: Amazon Web Services Certified (AWS Certified) Database Specialty (DBS-C01) by learning based on our Questions & Answers (Q&A) Practice Tests Exams.
donjude/data-pipelines-with-airflow
This data pipeline is an ETL process with Apache Airflow that loads data from Amazon S3 bucket, stage the data workload and insert into Amazon Redshift Datawarehouse
Faisal-AlDhuwayhi/Data-Warehouse
Building Data Warehouse and ETL pipelines using Amazon S3 and Redshift
lauravoicu/udacity_data_engineering
Repository for projects developed in Udacity's Data Engineering Nanodegree.
Naga-Manohar-Y/Airline_Data_Ingestion
An end-to-end event driven data pipeline for airline data, utilizing various AWS services to process and store flight data in redshift with efficient data model..
nasseredine/udacity-dend-p2
Data Warehouse on AWS with Amazon Redshift
Sampsonyu/Data_Warehouse_with_Amazon_Redshift
Data Warehouse with Amazon Redshift
siconge/Tencent-HQ-BIM-Data-Pipeline-with-AWS
This project delivers an end-to-end data pipeline solution designed to employ a comprehensive ETL process to move BIM data from Autodesk Revit model of Tencent Global Headquarters into cloud storage for processing and and analytics. The pipeline leverages tools and services such as Apache Airflow, Amazon S3, AWS Glue, and Amazon Redshift.
vaxdata22/Customer-Churn-Data-Analytics-ETL-Pipeline-by-Airflow-on-EC2
This is an end-to-end AWS Cloud ETL project. This orchestration uses Apache Airflow on AWS EC2 as well as AWS Glue. It demonstrates how to build ETL pipeline that would perform data transform using Glue job/crawler as well as loading into a Redshift table. It also shows how to connect Amazon Athena to Glue Data Catalog, and Power BI to Redshift.
vaxdata22/Zillow-Rapid-API-end-to-end-ETL-data-pipeline-by-Airflow-on-EC2
This is an end-to-end AWS Cloud ETL project. This data pipeline orchestration uses Apache Airflow on AWS EC2 as well as AWS Lambda. It demonstrates how to build ETL data pipeline that would perform data transformation using Lambda function as well as loading into a Redshift cluster table. The data would then be visualized using Amazon QuickSight.
AnnalieseTech/Sparkify
A music streaming company, Sparkify, has decided that it is time to introduce more automation and monitoring to their data warehouse ETL pipelines and come to the conclusion that the best tool to achieve this is Apache Airflow.
Kiran8053/Reddit-ETL-in-AWS-using-Airflow
Reddit ETL in AWS using Airflow is a full-stack data engineering project that builds a scalable ETL pipeline using cloud-based tools. It extracts data from Reddit via API, processes it with Apache Airflow, and leverages AWS services like S3, Glue, Athena, and Redshift for transformation, querying, and warehousing.
Tejesvani/IoT-Data-Streaming-and-Analytics
The Smart City Data Streaming Pipeline processes real-time data from IoT devices using Apache Kafka for ingestion and Apache Spark for processing. Data is stored in AWS S3 and analyzed with Glue, Athena, and Redshift. It enhances traffic management, predictive analytics, and urban planning, making cities smarter and more efficient.