data-modelling

There are 128 repositories under data-modelling topic.

  • raystack/optimus

    Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

    Language:Go75515268154
  • gouline/dbt-metabase

    dbt + Metabase integration

    Language:Python554810980
  • alanchn31/Movalytics-Data-Warehouse

    Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow

    Language:Python1571034
  • rapiddweller/rapiddweller-benerator-ce

    BENERATOR is a leading software solution to generate, obfuscate, pseudonymize and migrate data for development, testing, and training purposes with a model-driven approach.

    Language:Java15395626
  • dmey/synthia

    📈 🐍 Multidimensional synthetic data generation with Copula and fPCA models in Python

    Language:Python6431110
  • hypergol/hypergol

    Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, standardised structure for data and ML and parallel processing out-of-the-box.

    Language:Python53402
  • vedanthv/data-engineering-portfolio

    Cool DE Projects

    Language:Jupyter Notebook42104
  • VishanthSurresh/Spotify-Capstone-Project---Data-Engineering

    This repository is a working ETL framework which utilizes user data from Spotify API using ➲Python for Extraction and Transformation ➲SQL for Data Loading and Staging ➲Airflow for Data Orchestration and Monitoring ➲PowerBI for Reporting

    Language:Python12101
  • goto/optimus

    Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

    Language:Go8024
  • ndleah/AWS-ETL-pipeline

    ⚙️ ETL pipeline on AWS using S3 and Redshift

    Language:Python8200
  • ndleah/dvd-rental-marketing-analytics

    🎥 Email marketing campaign analysis

    Language:SQL8204
  • UniTo-SEPI/COVID-19_Piedmont

    COVID-19 Surveillance Data Modelling and Management Pipeline in Piedmont.

    Language:Julia7201
  • xomda/xomda

    Extensible Object Model Data Abstraction

    Language:Java6120
  • halimocakli/database-design-and-sql-programming

    This repo covers the processes of designing a database by performing logical, conceptual and physical data modelling processes, creating the designed database using DML and DDL on various database server systems and performing SQL queries on the created database.

  • Opikadash/world-bank-powerbi-dashboard

    Developed a 3-page Power BI dashboard (global and Asian overview) using Python scripts to load and clean World Bank data (1960–2020), reducing data processing time by 25\%. and Containerized the database in Docker, enabling scalable access, and visualized trends (e.g., 3\% annual GDP growth in Asia), enhancing stakeholder insights.

    Language:Python51
  • Participatory-Image-Archives/pia-data-model

    Data model for the Participatory Knowledge Practices in Analogue and Digital Image Archives (PIA) project

    Language:Python43111
  • usama-s786/dbs211

    Repository with files that I worked upon during the DBS211 (Introduction to Database Systems) course.

    Language:C++4101
  • waqarg2001/Formula1-Insights-DE

    Formula 1 race data engineering project which utilises azure services and databricks to ingest and analyse the data.

    Language:Python4100
  • AkashSDas/reader

    Social blogging community build with React, Next.js, and Firebase.

    Language:TypeScript3100
  • Micronutrients_Dashboard

    bergerache/Micronutrients_Dashboard

    An interactive Tableau dashboard promoting nutritional health through the exploration of micronutrients

    Language:Jupyter Notebook30
  • dharc-org/chad-ap

    A CIDOC-CRM-based Application Profile, consisting in a set of entities and properties for representing the digitisation process of cultural heritage objects in a machine-readable format.

    Language:Jupyter Notebook3300
  • DivineSamOfficial/Snowflake-ELT-Pipeline-with-dbt-and-Airflow

    This project showcases an end-to-end ELT (Extract, Load, Transform) pipeline leveraging the TPCH orders table from Snowflake's sample database. The primary goal is to demonstrate modern data engineering practices using Snowflake, dbt (Data Build Tool), and Apache Airflow.

    Language:Python3101
  • sankeshyadav98/Accenture-Data-Analytics-and-Visualization

    This is Accenture Data Analytics virtual experience project with Forage. The goal was to help a company named "Social Buzz" leverage the use of their massive amount of data. Social Buzz has reached huge scale in recent years to become recognized as a global unicorn company.

    Language:Jupyter Notebook3104
  • vaishnavi-3003/hr-analytics-powerbi

    This repo contains HR Analytics project to analyze what factors impact employee attrition using dataset for Atlas Labs Company.

  • abhi14112/Ecommerce-Backend-.Net-Core-Web-API

    complete ecommerce backend in asp.net core web api

    Language:C#210
  • AkwasiTp/National-Clothing-Chain

    A Udacity Power BI project on an online clothing store

  • Data-Research-Analysis/data-research-analysis-platform

    Tired of complex data platforms slowing you down? Data Research Analysis makes the process of getting data insights easy and simple, so you can make confident, lightning-fast decisions.

    Language:Vue2100
  • edeng94/AI-Cognizant-Virtual-Internship

    Artificial Intelligence Virtual Experience Program

    Language:Jupyter Notebook2100
  • evelyn658/airflow-dbt-etl-pipeline

    ETL pipeline on PostgreSQLusing Apache Airflow and dbt Cloud

    Language:Python2
  • JBris/jackson-data-models-example

    Demonstration of Jackson data models

    Language:Java210
  • roti/lut

    A library for data modeling in Scala.

    Language:Scala2100
  • subalasingh/Atliq_Hardware_Sales_Insights

    Data Visualization for Atliq Hardware sales

  • DivyaRsawant/HR_Analytics_Dashboard

    To help an organisation to improve the employee performance and to improve employee retention (reduce Attrition) by creating a HR Analytics Dashboard Using Power BI.

  • LynnColeArt/The-Claudinator

    A simple Chromium plugin for downloading and archiving your Anthropic AI chats with Claude 3 models

    Language:JavaScript1190
  • mattiasthalen/hook-forge

    Hook Forge is a CLI tool for forging frames and bridges according to the data modeling methodologies Hook & Unified Star Schema

    Language:Clojure11
  • VengeARA/C-ncer_datapred

    A model prediction of C@ncer patients. This project contains informative analysis and model prediction. Unfortunately, the code doesn't work past the analysis. it would be great if someone could reach out to me to solve the problem. After clicking "Train model", and doing anything after that, you go back tot the train model button.

    Language:Python1