stream-processing
There are 980 repositories under stream-processing topic.
google-ai-edge/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
vectordotdev/vector
A high-performance observability data pipeline.
zhisheng17/flink-learning
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
oxnr/awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
madd86/awesome-system-design
A curated list of awesome System Design (A.K.A. Distributed Systems) resources.
redpanda-data/redpanda
Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
redpanda-data/connect
Fancy stream processing made operationally mundane
ThreeDotsLabs/watermill
Building event-driven applications the easy way in Go.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
risingwavelabs/risingwave
Best-in-class stream processing, analytics, and management. Perform continuous analytics, or build event-driven applications, real-time ETL pipelines, and feature stores in minutes. Unified streaming and batch. PostgreSQL compatible.
robinhood/faust
Python Stream Processing
hazelcast/hazelcast
Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.
fluent/fluent-bit
Fast and Lightweight Logs and Metrics processor for Linux, BSD, OSX and Windows
MaterializeInc/materialize
The Cloud Operational Data Store: use SQL to transform, deliver, and act on fast-changing data.
apache/hudi
Upserts, Deletes And Incremental Processing on Big Data.
online-ml/river
🌊 Online machine learning in Python
javascriptdata/danfojs
Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
infinyon/fluvio
Lean and mean distributed stream processing system written in rust and web assembly. Alternative to Kafka + Flink in one.
ArroyoSystems/arroyo
Distributed stream processing engine in Rust
airtai/faststream
FastStream is a powerful and easy-to-use Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.
manuzhang/awesome-streaming
a curated list of awesome streaming frameworks, applications, etc
douban/dpark
Python clone of Spark, a MapReduce alike framework in Python
pipelinedb/pipelinedb
High-performance time-series aggregation for PostgreSQL
memgraph/memgraph
Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.
PeerDB-io/peerdb
Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage
francoispqt/gojay
high performance JSON encoder/decoder with stream API for Golang
reugn/go-streams
A lightweight stream processing library for Go
numaproj/numaflow
Kubernetes-native platform to run massively parallel data/streaming jobs
yomorun/yomo
🦖 Stateful Serverless Framework for Geo-distributed Edge AI Infra. with function calling support, write once, run on any model.
timeplus-io/proton
A stream processing engine and database, and a fast and lightweight alternative to ksqlDB and Apache Flink, 🚀 powered by ClickHouse
nerevu/riko
A Python stream processing engine modeled after Yahoo! Pipes
bytewax/bytewax
Python Stream Processing
siddhi-io/siddhi
Stream Processing and Complex Event Processing Engine
WallarooLabs/wally
Distributed Stream Processing
quixio/quix-streams
Python stream processing for Kafka
spring-cloud/spring-cloud-dataflow
A microservices-based Streaming and Batch data processing in Cloud Foundry and Kubernetes