spark-streaming-kafka
There are 42 repositories under spark-streaming-kafka topic.
zqhxuyuan/kafka-book
《Kafka技术内幕》代码
EthicalML/kafka-spark-streaming-zeppelin-docker
One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)
tmcgrath/spark-scala
Spark with Scala example projects
artem0/kafka-scala-api
Samples for using Kafka within Spark Streaming and Akka Actors, Akka Streams
manasbundele/big-data-projects
These are a select few projects related to Big Data Analytics and Management. The projects listed are a combination of both small and big projects but interesting ones.
rafaelvp-db/databricks-end-to-end-streaming
End-to-end Kafka Streaming Examples on Databricks with Evolving Avro Schemas.
arpendu11/graph-based-data-lake
An ETL application which is written in Quarkus, Spark SQL Streaming, Neo4j and various types of Databases and stores. It also covers the devops frameworks like Jenkins CI/CD, docker and Kubernetes.
AbdullahMu/Data-Streaming-Nanodegree-Project_02-Evaluate-Human-Balance-with-Spark-Streaming
Design data streaming architecture and API for a real-life application called the Step Trending Electronic Data Interface (STEDI). It is a working application used to assess fall risk for seniors. When a senior takes a test, they are scored using an index which reflects the likelihood of falling, and potentially sustaining an injury in the course of walking. STEDI uses a Redis datastore for risk score and other data. The Data Science team has completed a working graph for population risk at a STEDI clinic. The problem is the data is not populated yet. You will work with Kafka Connect Redis Source events and Business Events to create a Kafka topic containing anonymized risk scores of seniors in the clinic.
dharaneeshvrd/spark-examples
Spark Examples
haozhang-x/log-analysis-spark
Structured Streaming Log Analysis
roksolana-d/spark-streaming-examples
Research on legacy and structured streaming with Spark
chiayongjian/twitter-kafka-sparkstreaming
A working example of Twitter -> Kafka -> Spark Streaming integration by a beginner
faizpuad/DataEngineeringProject-DocumentStreamingWithData
The core objective of this project is to build an end-to-end data streaming pipeline that processes this dataset in real-time. By leveraging modern data engineering tools and techniques, we aim to connect, buffer, process, store, and visualize streaming data. This allows for better understanding of data flows, handling of large-scale real-time data
froblesmartin/BachFinalProject
Project to compare Apache Spark Streaming vs Apache Flink.
ludengke95/spark-streaming-kafka-template
SparkStreaming新手友好向模板,简化SparkStreaming开发
martinywwan/spark-kafka-streaming
Near real-time streaming using Apache Spark and Apache Kafka
pereldegla/twitter-trend-sentiment-analysis-world-cup-use-case
How to get closer to the audience using Twitter: an use case following the France football team run during the 2022 World Cup
Ragadeepthi/Machine-Learning-on-Bigdata--Loading-Data-using-Kafka-and-Flume
Playstore apps rating analysis - Machine Learning on Bigdata- Loading streaming Data using Kafka and Flume
rajeshsantha/MonitoredStructuredStreaming
Repository for Spark structured streaming use case implementations.
sergei-grigorev/spark-streaming-project
In-Stream final project
silverstone1903/stream-101
Intro to streaming data with Kafka, Spark and AWS Glue
vchoudhari45/spark-kafka-integration
spark-kafka-integration
viyadb/viyadb-spark
Data processing ang ingestion backend for ViyaDB based on Spark streaming
daniel-pape/nyc-ml-app
Streaming ML with NYC taxi data
fadhilyori/kaspacore
Mata Elang | Data Preprocessing using Scala and Spark
hieuung/Streaming-Kafka
Using various data processing tool for real time data pipeline with Kafka
michelheil/BigData
Projects related to Big Data technologies
monyedavid/substance-effects-on-reflexes
substance effects on reflexes
trendyol-data-eng-summer-intern-2019/recom-engine-streaming
Streaming component of the project, which is written with Spark Streaming.
abhay6694/PySpark-Component
Collection of spark-components functions for big-data processing
chandreshsutariya/bigdata---train-analysis
batch processing and realtime tains(railway) data analysis to help Station Masters refreshing each 20 seconds
deepcloudlabs/dcl700-2021-jun-21
DCL-700: Big Data Essentials
Faisal-AlDhuwayhi/Evaluate-Human-Balance-with-Spark-Streaming
Design a data streaming pipeline around Apache Spark, Kafka, and Redis for a real-time application
rotemfogel/spark-streaming-app
Spark Streaming Playground