structured-streaming

There are 93 repositories under structured-streaming topic.

lw-lin/CoolplaySpark
酷玩 Spark: Spark 源代码解析、Spark 类库等
Language:Scala3.5k 441 371.4k
databricks/LearningSparkV2
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Language:Scala1.2k 41 19731
japila-books/spark-structured-streaming-internals
The Internals of Spark Structured Streaming
416 40 7171
Azure/azure-event-hubs-spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Language:Scala234 47 320175
polomarcus/Spark-Structured-Streaming-Examples
Spark Structured Streaming / Kafka / Cassandra / Elastic
Language:Scala184 11 479
qubole/kinesis-sql
Kinesis Connector for Structured Streaming
Language:Scala137 13 8080
streamnative/pulsar-spark
Spark Connector to read and write with Pulsar
Language:Scala113 35 6750
chermenin/spark-states
Custom state store providers for Apache Spark
Language:Scala93 8 1026
radoslawkrolikowski/financial-market-data-analysis
Real-Time Financial Market Data Processing and Prediction application
Language:Python80 6 035
IBM/kafka-streaming-click-analysis
Use Kafka and Apache Spark streaming to perform click stream analytics
Language:Jupyter Notebook76 32 1256
astrolabsoftware/fink-broker
Astronomy Broker based on Apache Spark
Language:Python68 7 50913
zaleslaw/Spark-Tutorial
How to build your first Spark application with MLlib, StructuredStreaming, GraphFrames, Datasets and so on? Answer is here!
Language:Scala53 10 015
Klarrio/open-stream-processing-benchmark
This repository contains the code base for the Open Stream Processing Benchmark.
Language:Jupyter Notebook49 5 114
HeartSaVioR/spark-sql-kafka-offset-committer
Kafka offset committer for structured streaming query
Language:Scala37 6 1115
sankamuk/PysparkCheatsheet
PySpark Cheatsheet
Language:Python35 3 027
HeartSaVioR/spark-state-tools
Spark Structured Streaming State Tools
Language:Scala34 6 219
AndrewKuzmin/spark-structured-streaming-examples
Spark structured streaming examples with using of version 3.5.1
Language:Scala26 3 014
aamend/spark-gdelt
Binding the GDELT universe in a Spark environment
Language:Scala22 7 210
mozilla/telemetry-streaming
Spark Streaming ETL jobs for Mozilla Telemetry
Language:Scala18 27 021
sev7e0/wow-spark
:high_brightness: spark自学手册，包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake，以及scala基础练习，还有一些例如master、shuﬄe源码分析，总结及翻译。
Language:Scala18 1 07
qubole/s3-sqs-connector
A library for reading data from Amzon S3 with optimised listing using Amazon SQS using Spark SQL Streaming ( or Structured streaming).
Language:Scala17 5 912
qubole/streaminglens
Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines
Language:Scala17 7 65
qubole/spark-state-store
Rocksdb state storage implementation for Structured Streaming.
Language:Scala16 4 48
zekeriyyaa/PySpark-Structured-Streaming-ROS-Kafka-ApacheSpark-Cassandra
A structured streaming was applied to the robot data from ROS-Gazebo simulation environment using Apache Spark. Data is collected in Kafka, analyzed by Apache Spark and stored in Cassandra.
Language:Python16 2 06
Neuw84/structured-streaming-avro-demo
Spark 3.0.0 Structured Streaming Kafka Avro Demo
Language:Java15 4 016
xiaogp/recsys_structured_streaming
kafka + structured streaming + phoenix + elasticsearch 基于行为日志实现热门推荐，用户偏好推荐，召回融合策略实现。
Language:Scala15 2 011
aws-samples/iceberg-streaming-examples
This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenarios using best practices. The code can be deployed into any Spark compatible engine like Amazon EMR Serverless or AWS Glue. A fully local developer environment is also provided.
Language:Java13 1 51
NashTech-Labs/structured-streaming-application
Structured Streaming is a reference application showing how to easily integrate structured streaming Apache Spark Structured Streaming, Apache Cassandra and Apache Kafka for fast, structured streaming computations on data.
Language:Scala13 8 09
yjshen/spark-connector-test
A tutorial on how to use pulsar-spark-connector
Language:Scala10 2 23
awslabs/aws-cloudwatch-metrics-custom-spark-listener
Example Spark streaming sample codes with Custom Listeners to push streaming metrics into Amazon CloudWatch metrics
Language:Scala8 3 08
epishova/Structured-Streaming-Cassandra-Sink
An example of how to create and use Cassandra sink in Spark Structured Streaming application
Language:Scala8 0 02
Rishav273/kafkaPysparkAnalytics
Real-time ETL pipeline for financial data (kafka, pyspark) .
Language:Python8 1 11
chenyyyang/spark-sql-custom-mq-dataSource
基于Spark 3.1.x 数据源API实现的MQ数据源示例代码
Language:Java7 2 11
cynthia1wang/jdbcsink
This test program uses structured streaming of spark, receive message that is json type from kafka, then get counts of DNS for every people In one minute. (I think one source Ip addr is one people). Finally, the result will insert to mysql database.
Language:Scala7 2 08
TrainingByPackt/Big-Data-Processing-with-Apache-Spark-eLearning
Efficiently tackle large datasets and perform big data analysis with Spark and Python
Language:Python7 5 06
thestyleofme/spark-explore
spark生态学习
Language:Scala6 0 02

structured-streaming

lw-lin/CoolplaySpark

databricks/LearningSparkV2

japila-books/spark-structured-streaming-internals

Azure/azure-event-hubs-spark

polomarcus/Spark-Structured-Streaming-Examples

qubole/kinesis-sql

streamnative/pulsar-spark

chermenin/spark-states

radoslawkrolikowski/financial-market-data-analysis

IBM/kafka-streaming-click-analysis

astrolabsoftware/fink-broker

zaleslaw/Spark-Tutorial

Klarrio/open-stream-processing-benchmark

HeartSaVioR/spark-sql-kafka-offset-committer

sankamuk/PysparkCheatsheet

HeartSaVioR/spark-state-tools

AndrewKuzmin/spark-structured-streaming-examples

aamend/spark-gdelt

mozilla/telemetry-streaming

sev7e0/wow-spark

qubole/s3-sqs-connector

qubole/streaminglens

qubole/spark-state-store

zekeriyyaa/PySpark-Structured-Streaming-ROS-Kafka-ApacheSpark-Cassandra

Neuw84/structured-streaming-avro-demo

xiaogp/recsys_structured_streaming

aws-samples/iceberg-streaming-examples

NashTech-Labs/structured-streaming-application

yjshen/spark-connector-test

awslabs/aws-cloudwatch-metrics-custom-spark-listener

epishova/Structured-Streaming-Cassandra-Sink

Rishav273/kafkaPysparkAnalytics

chenyyyang/spark-sql-custom-mq-dataSource

cynthia1wang/jdbcsink

TrainingByPackt/Big-Data-Processing-with-Apache-Spark-eLearning

thestyleofme/spark-explore