doki23's Stars
excalidraw/excalidraw
Virtual whiteboard for sketching hand-drawn like diagrams
mtdvio/every-programmer-should-know
A collection of (mostly) technical things every software developer should know about
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
apache/flink
Apache Flink
datafuselabs/databend
𝗗𝗮𝘁𝗮, 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 & 𝗔𝗜. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com
apache/iceberg
Apache Iceberg
apache/datafusion
Apache DataFusion SQL Query Engine
real-logic/agrona
High Performance data structures and utility methods for Java
apache/arrow-rs
Official Rust implementation of Apache Arrow
apache/parquet-java
Apache Parquet Java
vigna/fastutil
fastutil extends the Java™ Collections Framework by providing type-specific maps, sets, lists and queues.
Warrenren/inside-rust-std-library
本书已经正式出版,目前正预售,可在京东搜索《深入RUST标准库》即可。本书主要对RUST的标准库代码进行分析,并试图给出RUST标准库代码的分析脉络。This project try to give a venation of how reading the RUST standard library source code.
spiraldb/vortex
An extensible, state-of-the-art columnar file format
openjdk/jol
https://openjdk.org/projects/code-tools/jol
apache/datasketches-java
A software library of stochastic streaming algorithms, a.k.a. sketches.
feldera/feldera
The Feldera Incremental Computation Engine
apache/datafusion-comet
Apache DataFusion Comet Spark Accelerator
apache/ozone
Scalable, reliable, distributed storage system optimized for data analytics and object store workloads.
tonbo-io/tonbo
A portable embedded database using Arrow.
apache/iceberg-rust
Apache Iceberg
LucaCanali/sparkMeasure
This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination of Spark metrics, making it a practical choice for both developers and data engineers.
foyer-rs/foyer
Hybrid in-memory and disk cache in Rust
kangkaisen/olap-performance
OLAP Database Performance Tuning Guide
fast/fastrace
A tracing library 10~100x faster than others
haraldng/omnipaxos
OmniPaxos is a distributed log implemented as a Rust library.
apache/hudi-rs
The native Rust implementation for Apache Hudi, with Python API bindings.
apache/paimon-rust
Apache Paimon Rust The rust implementation of Apache Paimon.
Kimahriman/hdfs-native
Gifted-s/velarixdb
An LSM storage engine designed for high throughput and significant reduction in I/O amplification written in safe rust (Under active development)