streaming-data
There are 487 repositories under streaming-data topic.
oxnr/awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
provectus/kafka-ui
Open-Source Web UI for Apache Kafka Management
johnkerl/miller
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
redpanda-data/connect
Fancy stream processing made operationally mundane
MaterializeInc/materialize
Real-time Data Integration and Transformation: use SQL to transform, deliver, and act on fast-changing data.
online-ml/river
🌊 Online machine learning in Python
readysettech/readyset
Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches the results of cached select statements and incrementally updates these results over time as the underlying data changes.
infinyon/fluvio
🦀 event stream processing for developers to collect and transform data in motion to power responsive data intensive applications.
piskvorky/smart_open
Utils for streaming large files (S3, HDFS, gzip, bz2...)
memgraph/memgraph
Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.
reugn/go-streams
A lightweight stream processing library for Go
pravega/pravega
Pravega - Streaming as a new software defined storage primitive
bytewax/bytewax
Python Stream Processing
kafbat/kafka-ui
Open-Source Web UI for managing Apache Kafka clusters
quixio/quix-streams
Python Streaming DataFrames for Kafka
python-streamz/streamz
Real-time stream processing for python
microsoft/Trill
Trill is a single-node query processor for temporal or streaming data.
zpl-c/zpl
📐 Pushing the boundaries of simplicity
DoneDeal0/superdiff
Superdiff provides a complete and readable diff for both arrays and objects. Plus, it supports stream and file inputs for handling large datasets efficiently, is battle-tested, has zero dependencies, and is super fast.
joshday/OnlineStats.jl
⚡ Single-pass algorithms for statistics
scikit-multiflow/scikit-multiflow
A machine learning package for streaming data in Python. The other ancestor of River.
hstreamdb/hstream
HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.
streamdal/streamdal
Code-Native Data Privacy
infoslack/awesome-kafka
A list about Apache Kafka
Stratio/sparta
Real Time Analytics and Data Pipelines based on Spark Streaming
kLabUM/rrcf
🌲 Implementation of the Robust Random Cut Forest algorithm for anomaly detection on streams
graphform/swim
Full stack application platform for building stateful microservices, streaming APIs, and real-time UIs
guillermo-navas-palencia/optbinning
Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.
radiantly/you-cant-download-this-image
Downloading images from the web is as easy as right clicking them and selecting "Save image as..", right? Well, not anymore xD
lightbend/cloudflow
Cloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
Chulong-Li/Real-time-Sentiment-Tracking-on-Twitter-for-Brand-Improvement-and-Trend-Recognition
A real-time interactive web app based on data pipelines using streaming Twitter data, automated sentiment analysis, and MySQL&PostgreSQL database (Deployed on Heroku)
microsoft/data-accelerator
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
keithknott26/datadash
Visualize and graph data in the terminal
pathwaycom/pathway-benchmarks
Benchmarks for data processing systems: Pathway, Spark, Flink, Kafka Streams
goodboy/tractor
distributed structured concurrency
selimfirat/pysad
Streaming Anomaly Detection Framework in Python (Outlier Detection for Streaming Data)