/RBIR

A collection of RBIR projects and posts for anyone interested in joining this journey.

Primary LanguageRust

RBIR

RBIR stands for Rewrite Bigdata in Rust. RBIR aims to create a big data ecosystem using Rust.

This project declares our manifesto and serves as a collection of RBIR projects and posts for anyone interested in joining this journey.

Projects

  • Apache DataFusion Comet github-repo start-contribute

    A high-performance accelerator for Apache Spark, built on top of the powerful Apache DataFusion query engine.

  • Apache HoraeDB (incubating) github-repo start-contribute

    A high-performance, distributed, cloud native time-series database.

  • Arroyo github-repo start-contribute

    A distributed stream processing engine written in Rust, designed to efficiently perform stateful computations on streams of data.

  • BLAZE github-repo start-contribute

    The Blaze accelerator for Apache Spark leverages native vectorized execution to accelerate query processing.

  • Daft github-repo start-contribute

    A distributed query engine for large-scale data processing in Python and is implemented in Rust.

  • Databend github-repo start-contribute

    An open-source cloud data warehouse that serves as a cost-effective alternative to Snowflake

  • Fluvio github-repo start-contribute

    Lean and mean distributed stream processing system written in rust and web assembly. Alternative to Kafka + Flink in one.

  • GlareDB github-repo start-contribute

    An analytics DBMS for distributed data.

  • GreptimeDB github-repo start-contribute

    An open-source, cloud-native, unified time series database for metrics, logs and events with SQL/PromQL supported.

  • LanceDB github-repo start-contribute

    An open-source database for vector-search built with persistent storage, which greatly simplifies retrieval, filtering and management of embeddings.

  • ParadeDB github-repo start-contribute

    An Elasticsearch alternative built on Postgres.

  • Quickwit github-repo start-contribute

    Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo

  • RisingWave github-repo start-contribute

    A Postgres-compatible SQL database engineered to provide the simplest and most cost-efficient approach for processing, analyzing, and managing real-time event streaming data

  • SlateDB github-repo start-contribute

    A cloud native embedded storage engine built on object storage.

  • TiKV github-repo start-contribute

    Distributed transactional key-value database, originally created to complement TiDB

  • influxdb github-repo start-contribute

    The leading open source time series database for metrics, events, and real-time analytics.

Libraries

Posts