sparkstreaming
There are 37 repositories under sparkstreaming topic.
LinMingQiang/sparkstreaming
:boom: :rocket: 封装sparkstreaming动态调节batch time(有数据就执行计算);:rocket: 支持运行过程中增删topic;:rocket: 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。
amir-rahnama/pyspark-twitter-stream-mining
Real-time Machine Learning with Apache Spark on Twitter Public Stream
liumingmusic/HadoopLearning
全套大数据基础学习教程,包含最基础的centos、maven。大数据主要包含hdfs、mr、yarn、hbase、kafka、scala、sparkcore、sparkstreaming、sparksql。教程包含所有的源代码演示以及在线文档说明。
E-SoulDataGroup/spark_streaming_kafka_offset
SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失
LinMingQiang/spark-utils
:boom: :alien: :hotsprings::rocket:Encapsulated spark 与其他组件的结合api,方便使用,例如 es,hbase,kudu,kafka,mq等
lei-zuquan/java_spark
Spark 2.x 案例操作:Scala版本与 Java1.8lambda版代码示例。涵盖Spark核心技术操作SparkCore、SparkSql、SparkStreaming。同时提供了Spark高级性能优化、序列化、广播变量、数据倾斜、算子优化、JVM优化、troubleshooting、数据倾斜解决方案。是多年来根据工作积累整理出来!
Captain-SpongeBob/MovieRecommendSystem
电影推荐系统,包括基于ALS、LFM的离线推荐、实时推荐,基于Spark
jgperrin/net.jgp.books.spark.ch10
Spark in Action, 2e - chapter 10 - Ingestion through structured streaming
kaiweiang/Simple-DDOS-Attacks-Detector
A Simple Real-Time Detector of DDOS Attacks with Apache Kafka And Spark Streaming
MahsaShk/ApacheSpark
Apache Spark machine learning project using pyspark
keks51/spark-salesforce
spark salesforce connector
alaahgag/Real-Time-Sales-Data-Analysis-Application
A real-time sales data analysis Application using Spark Structured Streaming, Kafka as a messaging system, PostgreSQL as a storage for processed data, and Superset for creating a dashboard.
karthikchaganti/Stockafolio-Insight-Project
A Realtime Stock Portfolio Manager built using Apache Distributed Technologies!
neema233/Real_Time_Kafka_Spark_Streaming
A Dockerized Kafka system for streaming server metrics and load balancer logs, with data processing using Spark and storage in a relational database and Hadoop.
elouardyabderrahim/Development-of-a-Real-Time-Movie-Recommendation-Pipeline
The project aims to design and implement a real-time movie recommendation system using the EK Stack (Elasticsearch and Kibana), Kafka, and a personalized recommendation API to enhance the user experience on Jay-Zz Entertainment's streaming platform.
swarna0712/San-Fransisco-Crime-Classification-using-PySpark
Big Data Project - SSML - Spark Streaming for Machine Learning
yucl80/avrodemo
write , append avro to hdfs file
jrphub/jrphub.github.io
Blog on github pages with Jekyll
likemezhoujie/SparkStreaming_splits
this is a spark project
ludengke95/spark-streaming-kafka-template
SparkStreaming新手友好向模板,简化SparkStreaming开发
MehdiTAZI/Spark-POCS
Full poc on spark 2, Spark RDD, Spark DStream, Spark SQL, Spark Datasets & DataFrames & Spark Structured Streaming [SCALA][SPARK]
paulatumwine/hashtag-stream
Track trending hashtags on Twitter in real time.
SudhansuTaparia/BigData
This is a repository i have created to put up some of the knowledge i have gained around Big Data Technologies especially Spark, GraphX etc.
Z-ingdotnet/bigdata_fromscratch
This repository contains files, codes and markdown documents for "big data from scratch" writings on my blog (z-ing.net)
ZGG2016/data-warehouse
数仓理论和项目(离线、实时)
camilobetanieto/BigDataComputing
Big data computing tasks conducted with PySpark. The problems involve MapReduce and Streaming algorithms.
deepanshu-yadav/stream_processing_project_s23
This project gets data from Spotify API , ingests into kafka for streaming and processes it through spark streaming. All this is done on Azure.
urvashiforreal/Retail-Data-Analysis
Developed a real-time streaming analytics pipeline using Apache Spark to calculate and store KPIs for e-commerce sales data, including total volume of sales, orders per minute, rate of return, and average transaction size. Used Spark Streaming to read data from Kafka, Spark SQL to calculate KPIs, and Spark DataFrame to write KPIs to JSON files.
xjtuchuqiwu/sparkML_project
sparkML智能客户系统项目实战-全套笔记,详细记录学习过程
24jmwangi/coingecko-streamapp
a streaming app and a dashboard for visualizing cryptocurrency data fetched from the CoinGecko API. The streaming app retrieves real-time cryptocurrency information using Spark Streaming and stores it in a PostgreSQL database.
ASKRAJPUT5/sparkStreaming
This Code helps take a close look of spark Data Streaming Structure
neilthaker07/BigDataBusinessRuleEngine
BigDataBusinessRuleEngine in Spark, scala, Drools.
Rogelio-Bustamante/ETL_Pipeline_Development_for_Wel_Logs_Analysis
Development and implementation of an ETL pipeline for processing oil well data using Databricks, Delta Live Tables, Spark Structured Streaming and PySpark. The project focused on automating the ingestion, transformation, and validation of large volumes of well log data.
rupeshtr78/blog
Big Data Spark Hadoop Kafka Flink Spark Streaming
sanogotech/SparkPriseEnMain
Spark in Action