analytics-engineering

There are 92 repositories under analytics-engineering topic.

  • Hiflylabs/awesome-dbt

    A curated list of awesome dbt resources

  • zingg

    zinggAI/zingg

    Scalable identity resolution, entity resolution, data mastering and deduplication using ML

    Language:Java96716505123
  • raystack/optimus

    Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

    Language:Go74617268154
  • StructuredLabs/preswald

    📟 Preswald is a full-stack platform for building, deploying, and managing interactive data applications. It brings ingestion, storage, transformation, and visualization into a simple SDK, minimizing complexity while maintaining flexibility for both prototyping and production-grade use cases.

    Language:Python614113
  • nordquant/complete-dbt-bootcamp-zero-to-hero

    Supplementary Materials for the The Complete dbt (Data Build Tool) Bootcamp Udemy course

    Language:Shell49691364
  • elementary-data/dbt-data-reliability

    dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

    Language:Python40182693
  • DataRecce/recce

    The data-validation toolkit for enhanced dbt (data build tool) PR review

    Language:TypeScript2947797
  • dbt-msft/dbt-sqlserver

    dbt adapter for SQL Server and Azure SQL

    Language:Python21916227101
  • starburstdata/dbt-trino

    The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)

    Language:Python219615156
  • tuva-health/tuva

    Main repo including core data model, data marts, reference data, terminology, and the clinical concept library

    Language:Python206721959
  • dbt-labs/jaffle-shop

    🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.

  • anna-geller/dataflow-ops

    Project demonstrating how to automate Prefect 2.0 deployments to AWS ECS Fargate

    Language:Python1134925
  • anna-geller/prefect-dataplatform

    Example repository showing how to build a data platform with Prefect, dbt and Snowflake

    Language:Python973216
  • gmyrianthous/dbt-airflow

    A Python package that creates fine-grained dbt tasks on Apache Airflow

    Language:Python6235424
  • gmyrianthous/dbt-dummy

    A dbt (data build tool) project you can use for testing purposes or experimentation

    Language:Dockerfile332115
  • mattiasthalen/arcane-insight

    Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and attributes, this project reveals detailed insights into card mechanics, strengths, and trends to support BI and strategic analysis.

    Language:Python30270
  • dbt-labs/jaffle-shop-generator

    🥪🏭 A simple CLI for generating synthetic Jaffle Shop data.

    Language:Python283126
  • montara-io/dbt-command-center

    Never sift through endless dbt™ logs again. dbt Command Center is a free, open-source, local web application that provides a user-friendly interface to monitor and manage dbt runs.

    Language:TypeScript2338
  • jaysobel/dbt-snowflake-queries

    dbt starter code for enterprise Snowflake usage data artifacts

  • kestra-io/examples

    Best practices for data workflows, integrations with the Modern Data Stack (MDS), Infrastructure as Code (IaC), Cloud Provider Services

    Language:HCL22828
  • TextQLLabs/dbt-documentor

    ✍️ dbt doc generator for advanced data teams

    Language:F#18221
  • mindyng/analytics-readings

    Readings for Analytics Engineers

  • tuva-health/demo

    A starter dbt project and synthetic claims dataset for trying out the Tuva Project.

  • tuva-health/medicare_cclf_connector

    This connector is a dbt project that maps Medicare CCLF claims data to the Tuva Input Layer.

  • sanchitvj/sports_betting_analytics_engine

    A data and analytics engineering platform designed for real-time sports betting analytics.

    Language:Python111
  • tuva-health/docs

    The Tuva Project Docs i.e. where we write and share our knowledge about healthcare data and analytics.

    Language:JavaScript1145424
  • tuva-health/medicare_lds_connector

    Maps Medicare LDS claims data to the Tuva Input Layer so you can easily run the Tuva Project.

  • zkan/getting-started-with-analytics-engineering

    Getting Started with Analytics Engineering

    Language:Makefile1020147
  • ryanrozich/snowflake-dbml-generator

    Automatically generate DBML files from Snowflake databases for quickly reverse engineer interactive ER diagrams and documentation from your Snowflake DB. Ideal for data engineers and analysts, it supports custom primary key configurations and relationship inference.

    Language:Python9180
  • VulknData/eruptr

    Don't ETL or ELT. LET your data be free.

    Language:Python91210
  • anna-geller/prefect-getting-started

    Get started with Prefect by scheduling your Prefect flows with GitHub Actions

    Language:Python7101
  • gwenwindflower/copier-dbt

    📝🖨️ A copier template for dbt projects. ⚙️🧡

    Language:Jinja7190
  • goto/optimus

    Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

    Language:Go6122
  • nicosuave/sqlctx

    SQLContext is a tool for generating LLM context from database tables for consumption from IDEs

    Language:Python6130
  • dbt-labs/jaffle-shop-mesh-platform

    A ✨ meshified ✨ open source sandbox project exploring dbt workflows via a fictional sandwich shop's data. This is the base of the mesh project that contains staging models.

  • dioz95/marketing-analytics-engineering

    This repo contains an end-to-end analytics engineering project within the marketing domain. Utilisation of DBT semantic layer is introduced in this project to standardize metrics definition that will be used later to produce end data products.

    Language:Python5100