Kafka Twitter Integration
Problem Statement:
A user wants to get a tragetted tweet (say bitcoin) from Twitter and do the data mining using elastic search.
Data mining includes how many times a tweet is reshared or liked or commented.
- Kafka : 7.10.2 with Multicluster + Replication setup(Active <—>Active) using MirrorMaker
- Kafka Connect for scalable and reliably streaming data between Apache Kafka® and other data systems
- Kafka Streams for transforming data from Kafka Topic to another topic in Real Time
- Debezium for Change Data Capture (CDC)- https://lnkd.in/eyHHdZs
- Kafka Schema Registry for establishing Contracts between Producers and Consumers.
- Kafka Producer using Twitter (https://lnkd.in/ewhDfdN)
- Kafka Consumer and storing data in Elastic Search using https://Bonsai.io