streaming-data
There are 426 repositories under streaming-data topic.
oxnr/awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
provectus/kafka-ui
Open-Source Web UI for Apache Kafka Management
johnkerl/miller
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
redpanda-data/connect
Fancy stream processing made operationally mundane
MaterializeInc/materialize
The Cloud Operational Data Store: use SQL to transform, deliver, and act on fast-changing data.
online-ml/river
🌊 Online machine learning in Python
readysettech/readyset
Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches the results of cached select statements and incrementally updates these results over time as the underlying data changes.
infinyon/fluvio
Lean and mean distributed stream processing system written in rust and web assembly. Alternative to Kafka + Flink in one.
piskvorky/smart_open
Utils for streaming large files (S3, HDFS, gzip, bz2...)
memgraph/memgraph
Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.
pravega/pravega
Pravega - Streaming as a new software defined storage primitive
reugn/go-streams
A lightweight stream processing library for Go
bytewax/bytewax
Python Stream Processing
microsoft/Trill
Trill is a single-node query processor for temporal or streaming data.
python-streamz/streamz
Real-time stream processing for python
quixio/quix-streams
Python stream processing for Kafka
zpl-c/zpl
📐 Pushing the boundaries of simplicity
joshday/OnlineStats.jl
⚡ Single-pass algorithms for statistics
scikit-multiflow/scikit-multiflow
A machine learning package for streaming data in Python. The other ancestor of River.
hstreamdb/hstream
HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.
kafbat/kafka-ui
Open-Source Web UI for managing Apache Kafka clusters
streamdal/streamdal
Code-Native Data Privacy
infoslack/awesome-kafka
A list about Apache Kafka
Stratio/sparta
Real Time Analytics and Data Pipelines based on Spark Streaming
kLabUM/rrcf
🌲 Implementation of the Robust Random Cut Forest algorithm for anomaly detection on streams
swimos/swim
Full stack application platform for building stateful microservices, streaming APIs, and real-time UIs
guillermo-navas-palencia/optbinning
Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.
radiantly/you-cant-download-this-image
Downloading images from the web is as easy as right clicking them and selecting "Save image as..", right? Well, not anymore xD
lightbend/cloudflow
Cloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
microsoft/data-accelerator
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Chulong-Li/Real-time-Sentiment-Tracking-on-Twitter-for-Brand-Improvement-and-Trend-Recognition
A real-time interactive web app based on data pipelines using streaming Twitter data, automated sentiment analysis, and MySQL&PostgreSQL database (Deployed on Heroku)
keithknott26/datadash
Visualize and graph data in the terminal
goodboy/tractor
A distributed, structured concurrent runtime for Python (and friends)
bbejeck/kafka-streams-in-action
Source code for the Kafka Streams in Action Book
selimfirat/pysad
Streaming Anomaly Detection Framework in Python (Outlier Detection for Streaming Data)
Western-OC2-Lab/PWPAE-Concept-Drift-Detection-and-Adaptation
Data stream analytics: Implement online learning methods to address concept drift and model drift in data streams using the River library. Code for the paper entitled "PWPAE: An Ensemble Framework for Concept Drift Adaptation in IoT Data Streams" published in IEEE GlobeCom 2021.