data-platform

There are 202 repositories under data-platform topic.

  • odd-platform

    opendatadiscovery/odd-platform

    First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.

    Language:Java1.3k18645128
  • bruin-data/bruin

    Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.

    Language:Go97981043
  • stitchfix/hamilton

    A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton

    Language:Python8601810636
  • meltwater/served

    A C++11 RESTful web server library

    Language:C++7096937172
  • flowerfine/scaleph

    Open data platform based on Kubernetes. Scaleph supports SeaTunnel、Flink and Doris backended by SeaTunnel on Flink engine、Flink Kubernetes Operator and Doris operator.

    Language:Java39010194108
  • pracdata/awesome-open-source-data-engineering

    A curated list of open source tools used in analytics platforms and data engineering ecosystem

  • jay86cn/techui-vue2

    TechUI is a easy to use Dynamic SVG Data Visualization Dashboard development tool, based on vite + vue2 development

    Language:Vue2229150
  • silverton-io/buz

    Serverless multi-protocol + multi-destination event collection system.

    Language:Go207731226
  • src-d/sourced-ce

    source{d} Community Edition (CE)

    Language:Go1921810152
  • linktimecloud/kubernetes-data-platform

    KDP(Kubernetes Data Platform) delivers a modern, hybrid and cloud-native data platform based on Kubernetes.

    Language:CUE191102049
  • Azure/data-management-zone

    Template to deploy the Data Management Zone of Cloud Scale Analytics (former Enterprise-Scale Analytics). The Data Management Zone provides data governance and management capabilities for the data platform of an organization.

    Language:Bicep1772010189
  • Azure/data-landing-zone

    Template to deploy a single Data Landing Zone of the Data Management & Analytics Scenario (former Enterprise-Scale Analytics). The Data Landing Zone is a logical construct and a unit of scale in the architecture that enables data retention and execution of data workloads for generating insights and value with data.

    Language:Bicep170196971
  • atrocore

    atrocore/atrocore

    AtroCore is an open-source Data Platform, Data Management and Master Data Management (MDM) software, which can be used to quickly create any business application.

    Language:JavaScript161812046
  • opendatadiscovery-specification

    opendatadiscovery/opendatadiscovery-specification

    ODD Specification is a universal open standard for collecting metadata.

  • anna-geller/prefect-dataplatform

    Example repository showing how to build a data platform with Prefect, dbt and Snowflake

    Language:Python1043217
  • jay86cn/techui-vue3-lite

    A free, simple, and easy-to-use technology-style UI component, developed based on vue3

    Language:Vue1042412
  • taogeYT/pyetl

    python ETL framework

    Language:Python1047336
  • blueapron/kafka-connect-protobuf-converter

    Protobuf converter plugin for Kafka Connect

    Language:Java94183054
  • ssimunic/Temp-Monitor

    Internet of Things data platform for temperature and humidity sensors with maps

    Language:PHP9016035
  • Azure/data-product-analytics

    Template to deploy a Data Product for analytics and data science use-cases into a Data Landing Zone of the Data Management & Analytics Scenario (former Enterprise-Scale Analytics). The Data Product template can be used by cross-functional teams to create insights and products for external users.

    Language:Bicep80172327
  • fabioms-br/estudados

    Banco de Dados para Estudo

  • Graviti-AI/tensorbay-python-sdk

    Graviti TensorBay Python SDK

    Language:Python7611535
  • Shoonya

    AI4Bharat/Shoonya

    Shoonya - Platform to Annotate and label data at scale.

  • josephmachado/online_store

    End to end data engineering project

    Language:Python543318
  • thanhENC/e2e-data-platform

    End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore, Minio, Postgres)

    Language:Python443247
  • opendatadiscovery/odd-collector

    Open-source metadata collector based on ODD Specification

    Language:Python4337813
  • Azure/data-product-batch

    Template to deploy a Data Product for Batch data processing into a Data Landing Zone of the Data Management & Analytics Scenario (former Enterprise-Scale Analytics). The Data Product template can be used by cross-functional teams to ingest, provide and create new data assets within the platform.

    Language:Bicep38152023
  • Azure/data-product-streaming

    Template to deploy a Data Product for data stream processing into a Data Landing Zone of the Data Management & Analytics Scenario (former Enterprise-Scale Analytics). The Data Product template can be used by cross-functional teams to ingest, provide and create new data assets within the platform.

    Language:Bicep36141616
  • kaggle-look-alike

    evoluteur/kaggle-look-alike

    Kaggle Data Explorer UI look-alike built in React.

    Language:JavaScript35203
  • chumaky/docker-images

    Postgres database with different foreign data wrapper extensions installed. Datero data platform engine image.

    Language:Shell341213
  • govflow/govflow

    An open, modular work order and workflow management system for local government and residents.

    Language:TypeScript346394
  • luatnc87/open-source-modern-data-stack

    This repo demonstrate a comprehensive modern data stack using popular open-source tools.

    Language:Shell33116
  • data-platforms-tools

    ryandawsonuk/data-platforms-tools

    Guide to data platforms and tools

  • davidgasquez/filecoin-data-portal

    🧮 Open, serverless, and local friendly Data Platform for the Filecoin Ecosystem

    Language:Python2921019
  • rpj/rpi

    RPJiOS: RPJ's RPi OS, a sensor data platform for the Raspberry Pi built with python2.7 and redis.

    Language:Python255181
  • PawMark

    xuwenyihust/PawMark

    PawMark is a platform for developers to build, schedule and monitor data pipelines.

    Language:JavaScript2131840