ethany21's Stars
awesome-spark/awesome-spark
A curated list of awesome Apache Spark packages and resources.
apache/ignite
Apache Ignite
apache/hudi
Upserts, Deletes And Incremental Processing on Big Data.
apache/shardingsphere
Distributed SQL transaction & query engine for data sharding, scaling, encryption, and more - on any database.
apache/iceberg-rust
Apache Iceberg
apache/hudi-rs
A native Rust library for Apache Hudi, with bindings into Python
apache/doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
apache/fury
A blazingly fast multi-language serialization framework powered by JIT and zero-copy.
apache/kvrocks
Apache Kvrocks is a distributed key value NoSQL database that uses RocksDB as storage engine and is compatible with Redis protocol.
apache/opendal
Apache OpenDAL: access data freely.
apache/polaris
The interoperable, open source catalog for Apache Iceberg
apache/paimon-trino
Trino Connector for Apache Paimon.
facebook/rocksdb
A library that provides an embeddable, persistent key-value store for fast storage.
debezium/debezium
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
apache/pulsar
Apache Pulsar - distributed pub-sub messaging system
redpanda-data/redpanda
Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
apache/kafka
Mirror of Apache Kafka
apache/arrow-adbc
Database connectivity API standard and libraries for Apache Arrow
apache/arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
duckdb/duckdb
DuckDB is an analytical in-process SQL database management system
trinodb/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
facebookincubator/velox
A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
sushant2019/bustub-private
My repository for the code for CMU-DB Intro to Database Course by Andy Pavlo
paradedb/paradedb
Postgres for Search and Analytics
apache/incubator-xtable
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
PacktPublishing/In-Memory-Analytics-with-Apache-Arrow-
In-Memory Analytics with Apache Arrow, published by Packt
apache/amoro
Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.
Eventual-Inc/Daft
Distributed DataFrame for Python designed for the cloud, powered by Rust
voltrondata/sqlflite
An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.
apache/arrow-rs
Official Rust implementation of Apache Arrow