Pinned Repositories
alldata
🔥🔥 BigData 💥 大数据 💥大数据AllData平台,通过二开大数据BigData生态组件,以及大数据BigData采集、大数据BigData存储、大数据BigData计算、大数据BigData开发来建设开源社区大数据BigData平台。联系作者: https://docs.qq.com/doc/DVFVMYUp6cFhSRVJs
amoro
Amoro is a Lakehouse management system built on open data lake formats.
bitsail
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every day.
ByConity
ByConity is an open source cloud-native data warehouse
cdhproject
hadoop各组件使用,持续更新
dataCompare
Database comparison platform: Hive table data comparison, MySQL data comparison, automatic configuration for data comparison, avoid frequent write SQL processing
ddia
《Designing Data-Intensive Application》DDIA中文翻译
dr-elephant
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
gluten
Gluten: Plugin to Double SparkSQL's Performance
incubator-kyuubi
Apache Kyuubi is a distributed multi-tenant JDBC server for large-scale data processing and analytics, built on top of Apache Spark
xufengnian2022's Repositories
xufengnian2022/alldata
🔥🔥 BigData 💥 大数据 💥大数据AllData平台,通过二开大数据BigData生态组件,以及大数据BigData采集、大数据BigData存储、大数据BigData计算、大数据BigData开发来建设开源社区大数据BigData平台。联系作者: https://docs.qq.com/doc/DVFVMYUp6cFhSRVJs
xufengnian2022/dataCompare
Database comparison platform: Hive table data comparison, MySQL data comparison, automatic configuration for data comparison, avoid frequent write SQL processing
xufengnian2022/incubator-kyuubi
Apache Kyuubi is a distributed multi-tenant JDBC server for large-scale data processing and analytics, built on top of Apache Spark
xufengnian2022/amoro
Amoro is a Lakehouse management system built on open data lake formats.
xufengnian2022/bitsail
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every day.
xufengnian2022/ByConity
ByConity is an open source cloud-native data warehouse
xufengnian2022/cdhproject
hadoop各组件使用,持续更新
xufengnian2022/ddia
《Designing Data-Intensive Application》DDIA中文翻译
xufengnian2022/dr-elephant
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
xufengnian2022/gluten
Gluten: Plugin to Double SparkSQL's Performance
xufengnian2022/polynote
A better notebook for Scala (and more)
xufengnian2022/seatunnel
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.