data-platform
There are 202 repositories under data-platform topic.
opendatadiscovery/odd-platform
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
bruin-data/bruin
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
stitchfix/hamilton
A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton
meltwater/served
A C++11 RESTful web server library
flowerfine/scaleph
Open data platform based on Kubernetes. Scaleph supports SeaTunnel、Flink and Doris backended by SeaTunnel on Flink engine、Flink Kubernetes Operator and Doris operator.
pracdata/awesome-open-source-data-engineering
A curated list of open source tools used in analytics platforms and data engineering ecosystem
jay86cn/techui-vue2
TechUI is a easy to use Dynamic SVG Data Visualization Dashboard development tool, based on vite + vue2 development
silverton-io/buz
Serverless multi-protocol + multi-destination event collection system.
src-d/sourced-ce
source{d} Community Edition (CE)
linktimecloud/kubernetes-data-platform
KDP(Kubernetes Data Platform) delivers a modern, hybrid and cloud-native data platform based on Kubernetes.
Azure/data-management-zone
Template to deploy the Data Management Zone of Cloud Scale Analytics (former Enterprise-Scale Analytics). The Data Management Zone provides data governance and management capabilities for the data platform of an organization.
Azure/data-landing-zone
Template to deploy a single Data Landing Zone of the Data Management & Analytics Scenario (former Enterprise-Scale Analytics). The Data Landing Zone is a logical construct and a unit of scale in the architecture that enables data retention and execution of data workloads for generating insights and value with data.
atrocore/atrocore
AtroCore is an open-source Data Platform, Data Management and Master Data Management (MDM) software, which can be used to quickly create any business application.
opendatadiscovery/opendatadiscovery-specification
ODD Specification is a universal open standard for collecting metadata.
anna-geller/prefect-dataplatform
Example repository showing how to build a data platform with Prefect, dbt and Snowflake
jay86cn/techui-vue3-lite
A free, simple, and easy-to-use technology-style UI component, developed based on vue3
taogeYT/pyetl
python ETL framework
blueapron/kafka-connect-protobuf-converter
Protobuf converter plugin for Kafka Connect
ssimunic/Temp-Monitor
Internet of Things data platform for temperature and humidity sensors with maps
Azure/data-product-analytics
Template to deploy a Data Product for analytics and data science use-cases into a Data Landing Zone of the Data Management & Analytics Scenario (former Enterprise-Scale Analytics). The Data Product template can be used by cross-functional teams to create insights and products for external users.
fabioms-br/estudados
Banco de Dados para Estudo
Graviti-AI/tensorbay-python-sdk
Graviti TensorBay Python SDK
AI4Bharat/Shoonya
Shoonya - Platform to Annotate and label data at scale.
josephmachado/online_store
End to end data engineering project
thanhENC/e2e-data-platform
End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore, Minio, Postgres)
opendatadiscovery/odd-collector
Open-source metadata collector based on ODD Specification
Azure/data-product-batch
Template to deploy a Data Product for Batch data processing into a Data Landing Zone of the Data Management & Analytics Scenario (former Enterprise-Scale Analytics). The Data Product template can be used by cross-functional teams to ingest, provide and create new data assets within the platform.
Azure/data-product-streaming
Template to deploy a Data Product for data stream processing into a Data Landing Zone of the Data Management & Analytics Scenario (former Enterprise-Scale Analytics). The Data Product template can be used by cross-functional teams to ingest, provide and create new data assets within the platform.
evoluteur/kaggle-look-alike
Kaggle Data Explorer UI look-alike built in React.
chumaky/docker-images
Postgres database with different foreign data wrapper extensions installed. Datero data platform engine image.
govflow/govflow
An open, modular work order and workflow management system for local government and residents.
luatnc87/open-source-modern-data-stack
This repo demonstrate a comprehensive modern data stack using popular open-source tools.
ryandawsonuk/data-platforms-tools
Guide to data platforms and tools
davidgasquez/filecoin-data-portal
🧮 Open, serverless, and local friendly Data Platform for the Filecoin Ecosystem
rpj/rpi
RPJiOS: RPJ's RPi OS, a sensor data platform for the Raspberry Pi built with python2.7 and redis.
xuwenyihust/PawMark
PawMark is a platform for developers to build, schedule and monitor data pipelines.