airscholar/RealtimeStreamingEngineering
This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenAI LLM, Kafka and Elasticsearch. It covers each stage from data acquisition, processing, sentiment analysis with ChatGPT, production to kafka topic and connection to elasticsearch.
Python
Stargazers
- akarceIstanbul
- alibaghdadi1368Dotin
- ArtodMontréal
- behnamyazdanIran, Shiraz
- dimuspav
- duongvgmVGM.AI - Khoa học dữ liệu không gian
- enessoztrkKoçSistem
- gaurichaudhari9Indiana University Bloomington
- judeleonardData2Bots
- kmlspktaaOxford, UK
- lucas-sfernandesFlorianópolis
- mediumhust
- mvandermeulenFivenynes
- namanngala
- nishkershUniversity Institute Of Engineering And Technology , Panjab University
- OckJuWon0831University of Nottingham
- P13LIAM
- puneetganiBangalore
- Salamaleko
- simonazyBoston, USA
- TawfikYasserNile University
- thanhmcisaiMQ ICT Solutions
- tuanpa2295Hanoi
- tuanpham12215