hudi
There are 42 repositories under hudi topic.
apache/doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
StarRocks/starrocks
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.
apache/hudi
Upserts, Deletes And Incremental Processing on Big Data.
alldatacenter/alldata
🔥🔥 AllData可定义数据中台,以数据平台为底座,以数据中台为桥梁,以机器学习平台为工厂,以大模型应用为上游产品,提供全链路数字化解决方案。采购商业版、加入技术社区:https://docs.qq.com/doc/DVHlkSEtvVXVCdEFo
collabH/bigdata-growth
大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。
Mrkuhuo/data-warehouse-learning
【2024最新版】 大数据 数据分析 电商系统 实时数仓 离线数仓 数据湖 建设方案及实战代码,涉及组件 #flink #paimon #doris #seatunnel #dolphinscheduler #datart #dinky #hudi #iceberg。
leesf/hudi-resources
汇总Apache Hudi相关资料
fancyChuan/bigdata-hub
数据建设与大数据技术知识体系,包含hadoop、hive、spark、flink主流框架和系列框架,数据中台、数据湖、数据治理、数仓建设、数据化转型等
apache/hudi-rs
The native Rust implementation for Apache Hudi, with Python API bindings.
izhangzhihao/Real-time-Data-Warehouse
Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi
WeBankFinTech/Streamis
Streaming application development and management system, based on Linkis and DSS, planning to provide the workflow-like graphical drag-and-drop development capability.
apache/doris-website
Apache Doris Website
leesf/hudi-demos
汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)
1ambda/lakehouse
Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)
dacort/modern-data-lake-storage-layers
Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work
Mrkuhuo/bigdata_learning
大数据组件学习代码
apache/doris-thirdparty
Self-managed thirdparty dependencies for Apache Doris
jaehyeon-kim/dbt-on-aws
dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats
apache/doris-streamloader
Stream Loader for Apache Doris
shangyuantech/hudi-multistream
Consumption and writing to Hudi based on multiple topic
apache/doris-sdk
SDK for Apache Doris
bihaiyang/datalake-example
Data lake implementation demo, include iceberg on flink, iceberg on spark, hudi on flink, hudi on spark
sanhebigdata/bigdatateam
数据中台建设、离线数仓建设、实时数仓建设、数据湖建设、区块链技术应用。组件包括但不限于:flink/spark/hadoop/hive/kafka/doris/kudu/clickhouse
xushiyan/apachehudi-from0to1
Companion code and examples for blog series - Apache Hudi: From Zero To One
ev2900/EMR_Studio_Hudi
Apache Hudi examples designed to be run on AWS Elastic Map Reduce (EMR) via. EMR Studio or EMR Notebooks
MaximeGuinard/HUD-MX
⌨️ A hud addon it allows to have a personalized HUD menu for Garry's mod
Chenzhiling/datalake-metadata-api
this project can help you to get iceberg,delta,hudi table's metadata info by java
JinsYin/awesome-datalake
📚 Awesome list for Data Lake
cevoaustralia/data-lake-demo
Data lake demo using change data capture (CDC) on AWS
guanlisheng/presto-event-stream
Stream events from presto to a kafka topic
jasondavindev/delta-lake-dms-cdc
Example application for DMS CDC with Delta Lake and Apache Hudi
runalddsouza/hudi-kafka
Data ingestion using Hudi DeltaStreamer and Kafka
Data-Kube/tst-datalakehouse-hudi
#Test - Create a Data Lakehouse in Kubernetes
mpouttu/djynn
A pythonic FOSS expert system built by a collaboration of data engineers, devops, and analysts to automate common corporate data use cases. Pronounced "gin" as in engine.
OpenTableFormat/OpenTableFormat.github.io
Website for open table format 🕸