kakamessi99's Stars
pola-rs/polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
dapr/dapr
Dapr is a portable, event-driven, runtime for building distributed applications across cloud and edge.
pybind/pybind11
Seamless operability between C++11 and Python
linkerd/linkerd2
Ultralight, security-first service mesh for Kubernetes. Main repo for Linkerd 2.x.
redpanda-data/redpanda
Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
pycaret/pycaret
An open-source, low-code machine learning library in Python
datafuselabs/databend
𝗗𝗮𝘁𝗮, 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 & 𝗔𝗜. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com
facebookincubator/velox
A composable and fully extensible C++ execution engine library for data management systems.
flaneur2020/pua-lang
a dialect of The Monkey Programming Language
ekzhu/datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
rajasekarv/vega
A new arguably faster implementation of Apache Spark from scratch in Rust
fugue-project/fugue
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
weaiken/ebook
classic books of computer science!
tensorbase/tensorbase
TensorBase is a new big data warehousing with modern efforts.
jni-rs/jni-rs
Rust bindings to the Java Native Interface — JNI
substrait-io/substrait
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
apache/datasketches-java
A software library of stochastic streaming algorithms, a.k.a. sketches.
AbsaOSS/spline
Data Lineage Tracking And Visualization Solution
rlink-rs/rlink-rs
High-performance Stream Processing Framework. An alternative to Apache Flink.
linkedin/transport
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Apache Hive, and Presto.
flock-lab/flock
Flock: A Low-Cost Streaming Query Engine on FaaS Platforms
cloudfuse-io/buzz-rust
Serverless query engine
TU-Berlin-DIMA/scotty-window-processor
This repository provides Scotty, a framework for efficient window aggregations for out-of-order Stream Processing.
HeartSaVioR/spark-state-tools
Spark Structured Streaming State Tools
superjobru/clickhouse-sql-parser
Rust parser for Clickhouse SQL dialect.
sane-lab/Trisk
Trisk on Flink
Roronoa-Zoro/delay-server
delay message system, when message reaches its ready time, will delivery to kafka
hpides/disco
Stream processing engine for distributed window aggregation (EDBT '20)
nebulastream/distributed-scotty
Distributed general stream slicing on top of TU Berlin DIMA's scotty-window-processor
tigerst/rpc-tiger
rpc框架