JustKshitijD/Print_Tweets_of_last_10_minutes_of_NY_and_CA
This repo has the code for a project we got in our Cloud Computing Course at the Department of Computer Science at IIT Madras. It has code for spinning 2 ec2 instances on Amazon Cloud, detect tweets on Corona coming from NY and CA for the last 10 minutes; feed these tweets to a kafka producer; apply Spark Streaming on these tweets; run the FP-Growth Algorithm on this streamed data; make an online interface having buttons for choosing CA or NY; and printing the most frequent words and number of tweets in the tweets of that respective state selected
Scala