Live twitter stream analysis using Spark Streaming
Please refer the setup file
Server.ipynb is IPython notebook which can be run to push stream live tweets over socket on port: 9994. This code will give you geo-located tweets all over USA
We collect our tweets using Twitter Streaming API in Python. We bounce these tweets onto a socket. We then collect these tweets over the socket using Spark Streaming using Socket Streaming. We push analyzed data over elasticsearch over an index with a particular mapping. Kibana then takes this data on the mapping and gives us different visual analysis.