datawarehousing
There are 128 repositories under datawarehousing topic.
Datavault-UK/automate-dv
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
cynkra/dm
Working with relational data models in R
mara/mara-schema
Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables
MohamedHmini/tweetsOLAPing
implementing an end-to-end tweets ETL/Analysis pipeline.
jazzido/mondrian-rest
A REST interface for Mondrian ROLAP server
CICIFLY/Data_Engineering_Project_Portfolio
Data Engineering, Data Warehouse, Data Mart, Cloud Data, AWS, SAS, Redshift, S3
kromozome2003/Snowflake-Json-DataPipeline
Building Json data pipeline within Snowflake using Streams and Tasks
dangalavan/Optimizing-DataVault-on-Snowflake
Scripts complement the Optimizing a Data Vault data warehouse on the Snowflake Cloud Data Platform webinar
techsparksguru/data_ai_for_all
Data Analysis, Analytics, Science, AI & ML, LLM etc.
victorskl/genomic-bigdata-spark
Genomic BigData Warehousing with Apache Spark and LakeHouse Architecture
Abhi0323/Full-Cycle-ETL-Analytics-with-Google-Analytics-and-Snowflake
Explore the transformative power of data analytics in my portfolio, where Google Analytics and Snowflake converge to provide comprehensive insights. This project leverages advanced ETL techniques and real-time data integration to enhance user engagement and optimize content delivery effectively.
dangalavan/SqlDBM-Snowflake-Hands-on-lab
Data modeling & the Snowflake Data Cloud using SqlDBM Hands-on lab - corresponding scripts.
essraahmed/Data-Warehouse-With-Redshift
Data Warehouse with AWS Redshift and Visualizing data using Power BI
dharm18/stock-datawarehouse
A data warehouse and business intelligence project on Stock market dataset to answer non-trivial BI queries.
jpseverance/DateAndTimeDimensionBuilders
Data warehousing date dimension and time dimension builders written in Python.
MiladNooraei/Quera-Superstore
Performed data pre-processing, optimized data warehousing, applied statistics and machine learning, and used Power BI for insightful visualizations to support informed decisions
praveendecode/YouTube-Data-Harvesting-Warehousing
Efficient YouTube data harvesting and warehousing with Python, SQL, MongoDB, and Streamlit, enabling seamless analysis and visualization for insightful decision-making in content management and audience engagement strategies
aessing/demo-mdwh
Modern Dataware House Demos with Azure Databricks, Azure Data Factory & Azure Dedicated SQL pool (formerly SQL DW)
epilif1017a/bigdatabenchmarks
Code and Documents related to the SSB+ Benchmark
jaehyeon-kim/iceberg-etl-demo
Data Warehousing ETL Demo with Apache Iceberg on EMR Local Environment
nglthu/Datawarehousing
Data Warehousing (DW) Project Building and Analysing a DW for NatureFresh Stores in NZ, built using a high-performance Oracle database 12c, and Index-Nested Loops Join-Oracle.
escobarana/SSIS_DWH
Datawarehouse & ETL using Visual Studio 2019 SSIS
FirasKahlaoui/retail-data-warehouse
This project demonstrates the creation of a Data Warehouse using SQL Server 2022. It includes the design of dimension and fact tables, ETL processes for data integration, Python scripts for synthetic data generation, and SQL queries for KPI analysis to support business decision-making.
hams71/Dbt_Demo
Using dbt to load(seed) and do some transformations and then finally load that data to some Cloud Warehouse
ibromley/sparkify-s3-datalake
Data Warehousing with Spark & Amazon S3
jennaallen/football_schools
Using a dimensional model, data warehouse, and Tableau I explored data from the College Scorecard and NCAA Division I FBS football games :football:
kstrassheim/datawarehouse-crawler
This is a content and schema crawler tool to receive, update and import various kinds of data into a Onprem or Cloud based SQLServer or Azure-Synapse-Analysis (Azure Datawarehouse SQLServer). As source it supports SQLServer Tables, ODATA Endpoints, CSV Files or Excel Files. For multiple sources it can run in parallel mode where it would make a thread for each connection. The speciality of this crawler is that it creates the target tables by himself using the additional info from source.json. In case of Azure-Synapse-Analysis it would estimate the distribution type and keys. The syncing works completely without SQL Transactions by using a consistency correction algorithm for very frequent fact tables. There are 5 Syncing Algorithms (see Manual/Insert) which can be selected as well as one Update Algorithm.
mehroosali/bigquery-sparksql-batch-etl
Batch ETL pipeline project on GCP to load and transform daily flight data using Spark to update tables in BigQuery. The pipeline is automated using Airflow.
oshadhi-vanodhya/Business-Intelligence
Business Intelligence Course work - R Studio (Neural Networks, Deep Learning, Data Warehousing)
Mouhamed-Jinja/Python-Airflow-Postgres-Docker-DWH
This repository contains Apache Airflow Directed Acyclic Graphs (DAGs) and associated scripts for orchestrating an Extract, Transform, Load (ETL) workflow. The workflow is designed to extract data from a source, perform transformations, and load it into a data warehouse.
pgrondein/data_platform_for_data_analytics
This project goal is to design a Data Platform for retail Data Analytics.
pranjals26/Data-Management-Project-Flight-delays
Data Cleaning and Analysis on Flight Delay & Cancellation
rehamessa/Airline-System-DWH-Modeling
A leading airline company engaged our services to support the executive management in their analysis of current business processes and identification of new opportunities for company growth.
Salma-Mamdoh/Datawarehouse_Project
Our project for Datawarehouse Course taken during fall 2024 semester
shahrushit1996/Data-Warehouse-on-Snowflake
A data warehouse project to analyze store performances of a retail chain