Flamefork's Stars
pola-rs/polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
TanStack/table
🤖 Headless UI for building powerful tables & datagrids for TS/JS - React-Table, Vue-Table, Solid-Table, Svelte-Table
seaweedfs/seaweedfs
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.
juanfont/headscale
An open source, self-hosted implementation of the Tailscale control server
duckdb/duckdb
DuckDB is an analytical in-process SQL database management system
PrefectHQ/prefect
Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines
airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
apache/arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.
jackc/pgx
PostgreSQL driver and toolkit for Go
redpanda-data/redpanda
Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
vaexio/vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
apache/beam
Apache Beam is a unified programming model for Batch and Streaming data processing.
mage-ai/mage-ai
🧙 Build, run, and manage data pipelines for integrating and transforming data.
ibis-project/ibis
the portable Python dataframe library
redpanda-data/console
Redpanda Console is a developer-friendly UI for managing your Kafka/Redpanda workloads. Console gives you a simple, interactive approach for gaining visibility into your topics, masking data, managing consumer groups, and exploring real-time data with time-travel debugging.
facebookincubator/velox
A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
unionai-oss/pandera
A light-weight, flexible, and expressive statistical data testing library
riverqueue/river
Fast and reliable background jobs in Go
delta-io/delta-rs
A native Rust library for Delta Lake, with bindings into Python
twmb/franz-go
franz-go contains a feature complete, pure Go library for interacting with Kafka from 0.8.0 through 3.6+. Producing, consuming, transacting, administrating, etc.
spotify/voyager
🛰️ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
aio-libs/aiokafka
asyncio client for kafka
hapostgres/pg_auto_failover
Postgres extension and service for automated failover and high-availability
segmentio/topicctl
Tool for declarative management of Kafka topics
delta-io/kafka-delta-ingest
A highly efficient daemon for streaming data from Kafka into Delta Lake
sdatkinson/NeuralAmpModelerCore
Core DSP library for NAM plugins
jhnnsrs/turms
Turms is a pure python implementation of the awesome graphql-codegen library, following a simliar extensible design.
danielgafni/dagster-polars
[Project moved] Polars integration for Dagster
simon3z/webhook-logger
A small server that saves incoming webhook JSON objects and makes them queryable over HTTP.