Pinned Repositories
alibabacloud-dla-demo
alibabacloud-dla-demo
amoro
Amoro is a Lakehouse management system built on open data lake formats.
compass
Compass is a task diagnosis platform for bigdata
CoolplaySpark
酷玩 Spark: Spark 源代码解析、Spark 类库等
dinky
Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.
doris
Apache Doris is an MPP-based interactive SQL data warehousing for reporting and analysis.
doris-flink-connector
Flink Connector for Apache Doris
doris-manager
Cluster manager for Apache Doris
kylin
Apache Kylin
ssb-dbgen
Star Schema Benchmark dbgen
liujinhui1994's Repositories
liujinhui1994/alibabacloud-dla-demo
alibabacloud-dla-demo
liujinhui1994/amoro
Amoro is a Lakehouse management system built on open data lake formats.
liujinhui1994/compass
Compass is a task diagnosis platform for bigdata
liujinhui1994/dinky
Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.
liujinhui1994/doris
Apache Doris is an MPP-based interactive SQL data warehousing for reporting and analysis.
liujinhui1994/doris-flink-connector
Flink Connector for Apache Doris
liujinhui1994/doris-manager
Cluster manager for Apache Doris
liujinhui1994/doris-spark-connector
Spark Connector for Apache Doris
liujinhui1994/flink
Apache Flink
liujinhui1994/emr-hudi-example
emr-hudi-example
liujinhui1994/flink-cos-fs
Flink-cos-fs 是腾讯云对象存储系统COS针对Flink的文件系统实现,并且支持了recoverwriter接口。
liujinhui1994/fluss
Fluss is a streaming storage built for real-time analytics.
liujinhui1994/gluten
Gluten: Plugin to Double SparkSQL's Performance
liujinhui1994/go-ldap-admin
🌉 基于Go+Vue实现的openLDAP后台管理项目
liujinhui1994/gravitino
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
liujinhui1994/hertzbeat
Apache HertzBeat(incubating) is a real-time monitoring system with agentless, performance cluster, prometheus-compatible, custom monitoring and status page building capabilities.
liujinhui1994/hudi
Upserts, Deletes And Incremental Processing on Big Data.
liujinhui1994/incubator-celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
liujinhui1994/incubator-seatunnel
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
liujinhui1994/incubator-uniffle
Uniffle is a high performance, general purpose Remote Shuffle Service.
liujinhui1994/jiron-cloud
liujinhui1994/kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
liujinhui1994/LakeView
Monitoring and insights on your data lakehouse tables
liujinhui1994/nessie
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
liujinhui1994/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
liujinhui1994/paimon
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
liujinhui1994/Qualitis
Qualitis is a one-stop data quality management platform that supports quality verification, notification, and management for various datasource. It is used to solve various data quality problems caused by data processing. https://github.com/WeBankFinTech/Qualitis
liujinhui1994/sorafm
Sora AI Video Showcases by Sora.FM
liujinhui1994/starrocks
StarRocks is a next-gen sub-second MPP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics and ad-hoc query.
liujinhui1994/supersonic
SuperSonic is the next-generation BI+AI platform that combines Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.