caicancai
@apache Calcite & StreamPark Committer. Don't lose your motivation and keep and keep pushing.
@aftership | @apacheShenzhen, China
Pinned Repositories
calcite
Apache Calcite
airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Basic-data-structure
cmu15.445
CMU 15-445/645: Intro to Database Systems (Fall 2022). A course on the design and implementation of database management systems.
hive-parser
mini-lsm
A tutorial of building an LSM-Tree storage engine in a week!
MIT6.824
paper_reading_cn
just for fun
Simple_DBMS
risinglight
An educational OLAP database system.
caicancai's Repositories
caicancai/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
caicancai/arrow-datafusion
Apache Arrow DataFusion SQL Query Engine
caicancai/risinglight
An educational OLAP database system.
caicancai/caicancai
Config files for my GitHub profile.
caicancai/calcite
Apache Calcite
caicancai/calcite-avatica
Apache Calcite Avatica
caicancai/cancai_learning
caicancai/datafusion-java
Java binding to Apache DataFusion
caicancai/duckdb
DuckDB is an in-process SQL OLAP Database Management System
caicancai/egg
egg is a flexible, high-performance e-graph library
caicancai/feldera
The Feldera Incremental Computation Engine
caicancai/flink
Apache Flink
caicancai/foyer
Hybrid in-memory and disk cache in Rust
caicancai/grpc-java
The Java gRPC implementation. HTTP/2 based RPC
caicancai/iceberg-rust
Apache Iceberg
caicancai/iggy
Iggy is the persistent message streaming platform written in Rust, supporting QUIC, TCP and HTTP transport protocols, capable of processing millions of messages per second.
caicancai/lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
caicancai/netty
Netty project - an event-driven asynchronous network application framework
caicancai/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
caicancai/optd
CMU-DB's Cascades optimizer framework
caicancai/pebble
RocksDB/LevelDB inspired key-value database in Go
caicancai/portable-simd
The testing ground for the future of portable SIMD in Rust
caicancai/prql
PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement
caicancai/pushgateway
Push acceptor for ephemeral and batch jobs.
caicancai/risingwave
Cloud-native SQL stream processing, analytics, and management. KsqlDB and Apache Flink alternative. 🚀 10x more productive. 🚀 10x more cost-efficient.
caicancai/roc
A fast, friendly, functional language.
caicancai/seatunnel
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
caicancai/skywalking-banyandb
An observability database aims to ingest, analyze and store Metrics, Tracing and Logging data.
caicancai/slatedb
A cloud native embedded storage engine built on object storage.
caicancai/yunikorn-core
Apache YuniKorn Core