Pinned Repositories
1brc
1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java
ansj_seg
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
arrow-ballista
Apache Arrow Ballista Distributed Query Engine
blaze
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
Burrow
Kafka Consumer Lag Checking
celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
concurrent-map
a thread-safe concurrent map for go
config
configuration library for JVM languages using HOCON files
debezium
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
harveyyue's Repositories
harveyyue/CS-Notes
:books: Computer Science Learning Notes
harveyyue/merkle-tree-java
Bachelor thesis on Merkle Trees at Royal Institute of Technology KTH
harveyyue/ansj_seg
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
harveyyue/kylin-mondrian
harveyyue/impala-udf-samples
Sample UDF and UDAs for Impala.
harveyyue/saiku3
The second development version based on the branch release-3.8 of saiku.
harveyyue/hbase.rowkeycomparator
harveyyue/kylin-mondrian-interaction
Some information about Apache Kylin interaction with Pentaho Mondrian
harveyyue/impala-udf-devel
Impala SDK for UDF development