harveyyue

Shanghai, China

Pinned Repositories

1brc
1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java
Language:Java00
ansj_seg
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
Language:Java00
arrow-ballista
Apache Arrow Ballista Distributed Query Engine
Language:Rust00
blaze
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
Language:Rust00
Burrow
Kafka Consumer Lag Checking
Language:Go0 0 00
celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
Language:Java00
concurrent-map
a thread-safe concurrent map for go
Language:Go0 2 00
config
configuration library for JVM languages using HOCON files
Language:Java00
debezium
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
Language:Java00
paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
Language:Java02

harveyyue's Repositories

harveyyue/1brc
1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java
Language:Java00
harveyyue/arrow-ballista
Apache Arrow Ballista Distributed Query Engine
Language:Rust00
harveyyue/blaze
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
Language:Rust00
harveyyue/celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
Language:Java00
harveyyue/debezium
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
Language:Java00
harveyyue/hudi
Upserts And Incremental Processing on Big Data
Language:Java01
harveyyue/paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
Language:Java02
harveyyue/starrocks
StarRocks is a next-gen sub-second MPP database for full analysis senarios, including multi-dimensional analytics, real-time analytics and ad-hoc query, formerly known as DorisDB.
Language:Java01
harveyyue/datafusion
Apache Arrow DataFusion SQL Query Engine
Language:Rust
harveyyue/datafusion-orc
Implementation of Apache ORC file format use Apache Arrow in-memory format
Language:Rust1
harveyyue/debezium-connector-cassandra
An incubating Debezium CDC connector for Apache Cassandra
Language:Java1 0
harveyyue/debezium-connector-db2
An incubating Debezium connector for Db2
Language:Java
harveyyue/debezium-connector-informix
An incubating Debezium CDC connector for IBM Informix database
Language:Java0 0
harveyyue/debezium-connector-jdbc
An exploration for building a JDBC sink connector aware of the Debezium change event format
Language:Java
harveyyue/debezium-connector-spanner
An incubating Debezium CDC connector for Google Spanner
Language:Java
harveyyue/debezium-connector-vitess
An incubating Debezium CDC connector for Vitess
Language:Java
harveyyue/doris
Apache Doris (Incubating)
Language:Java1
harveyyue/flink
Apache Flink
Language:Java1 01
harveyyue/flink-cdc-connectors
Change Data Capture (CDC) Connectors for Apache Flink
Language:Java1 0
harveyyue/kafka
Mirror of Apache Kafka
harveyyue/kafka-connect-jdbc
Kafka Connect connector for JDBC-compatible databases
Language:Java1 0
harveyyue/kafka-connect-storage-cloud
Kafka Connect suite of connectors for Cloud storage (Amazon S3)
Language:Java
harveyyue/kcctl
A modern and intuitive command line client for Kafka Connect
Language:Java
harveyyue/merkle-proof
Language:Java1
harveyyue/mysql-binlog-connector-java
MySQL Binary Log connector
Language:Java1 0
harveyyue/schema-registry
Confluent Schema Registry for Kafka
harveyyue/seatunnel
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
Language:Java0 01
harveyyue/spark
Apache Spark - A unified analytics engine for large-scale data processing
Language:Scala0 0
harveyyue/tiflow
This repo maintains DM (a data migration platform) and TiCDC (change data capture for TiDB)
Language:Go
harveyyue/web3j
Lightweight Java and Android library for integration with Ethereum clients
Language:Java