analytics-engineering

There are 141 repositories under analytics-engineering topic.

  • preswald

    StructuredLabs/preswald

    Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, into single files, runnable completely in-browser, using Pyodide, DuckDB, Pandas, and Plotly, Matplotlib, etc. Build dashboards, reports, and notebooks that run offline, load fast, and share like a document.

    Language:Python4.3k6138662
  • Hiflylabs/awesome-dbt

    A curated list of awesome dbt resources

  • raystack/optimus

    Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

    Language:Go75415268154
  • nordquant/complete-dbt-bootcamp-zero-to-hero

    Supplementary Materials for the The Complete dbt (Data Build Tool) Bootcamp Udemy course

    Language:Python705124506
  • elementary-data/dbt-data-reliability

    dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

    Language:Python468534114
  • DataRecce/recce

    The data-validation toolkit for enhanced dbt (data build tool) PR review

    Language:Python43179623
  • tuva-health/tuva

    Main repo including core data model, data marts, data quality tests, and terminology sets.

    Language:HTML2761236599
  • starburstdata/dbt-trino

    The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)

    Language:Python251616568
  • dbt-msft/dbt-sqlserver

    dbt adapter for SQL Server and Azure SQL

    Language:Python24415247104
  • dbt-labs/jaffle-shop

    🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.

  • anna-geller/dataflow-ops

    Project demonstrating how to automate Prefect 2.0 deployments to AWS ECS Fargate

    Language:Python1162924
  • anna-geller/prefect-dataplatform

    Example repository showing how to build a data platform with Prefect, dbt and Snowflake

    Language:Python1072216
  • gmyrianthous/dbt-airflow

    A Python package that creates fine-grained dbt tasks on Apache Airflow

    Language:Python7425541
  • dbt-labs/jaffle-shop-generator

    🥪🏭 A simple CLI for generating synthetic Jaffle Shop data.

    Language:Python4461214
  • sanchitvj/sports_betting_analytics_engine

    A data and analytics engineering platform designed for real-time sports betting analytics.

    Language:Python37106
  • addhen/kanalytics

    Kotlin Multiplatform Analytics with a debug viewer

    Language:Kotlin361131
  • gmyrianthous/dbt-dummy

    A dbt (data build tool) project you can use for testing purposes or experimentation

    Language:Dockerfile362117
  • kestra-io/examples

    Best practices for data workflows, integrations with the Modern Data Stack (MDS), Infrastructure as Code (IaC), Cloud Provider Services

    Language:HCL35729
  • mattiasthalen/arcane-insight

    Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and attributes, this project reveals detailed insights into card mechanics, strengths, and trends to support BI and strategic analysis.

    Language:Python351172
  • montara-io/dbt-command-center

    Never sift through endless dbt™ logs again. dbt Command Center is a free, open-source, local web application that provides a user-friendly interface to monitor and manage dbt runs.

    Language:TypeScript313110
  • tuva-health/demo

    A starter dbt project and synthetic claims dataset for trying out the Tuva Project.

  • jaysobel/dbt-snowflake-queries

    dbt starter code for enterprise Snowflake usage data artifacts

  • TextQLLabs/dbt-documentor

    ✍️ dbt doc generator for advanced data teams

    Language:F#20221
  • ryanrozich/snowflake-dbml-generator

    Automatically generate DBML files from Snowflake databases for quickly reverse engineer interactive ER diagrams and documentation from your Snowflake DB. Ideal for data engineers and analysts, it supports custom primary key configurations and relationship inference.

    Language:Python17291
  • mindyng/analytics-readings

    Readings for Analytics Engineers

  • tuva-health/medicare_cclf_connector

    This connector is a dbt project that maps Medicare CCLF claims data to the Tuva Input Layer.

  • tuva-health/docs

    The Tuva Project Docs i.e. where we write and share our knowledge about healthcare data and analytics.

    Language:JavaScript1468137
  • zkan/getting-started-with-analytics-engineering

    Getting Started with Analytics Engineering

    Language:Makefile1420147
  • Geobatpo07/datahut-duckhouse

    DataHut-DuckHouse is a modern, modular, and multi-tenant analytics platform that combines DuckDB, Apache Iceberg, Arrow Flight, dbt, and Trino to build a hybrid, lightweight, and scalable data stack ready for SaaS.

    Language:Python12
  • sidequery/sidemantic

    A universal metrics layer. Compatible with definitions in LookML, MetricFlow, Cube with DuckDB, Snowflake, Clickhouse, Bigquery & more!

    Language:Python12001
  • tuva-health/medicare_lds_connector

    Maps Medicare LDS claims data to the Tuva Input Layer so you can easily run the Tuva Project.

  • tuva-health/provider

    A dbt project that transforms messy public provider datasets into usable data for the Tuva Project.

  • VulknData/eruptr

    Don't ETL or ELT. LET your data be free.

    Language:Python90210
  • anna-geller/prefect-getting-started

    Get started with Prefect by scheduling your Prefect flows with GitHub Actions

    Language:Python8101
  • goto/optimus

    Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

    Language:Go8024
  • nicosuave/sqlctx

    SQLContext is a tool for generating LLM context from database tables for consumption from IDEs

    Language:Python7130