/Twitter-clustering

In this project, we collect the tweets from twitter API, using sentiment algorithm to divide users into two groups-positive and negative. Then using Spark to do k-means clustering in these two groups based on the followings of users. In the end, calculating the TF-IDF of tweets of different users, then do k-means clustering based on TF-IDF.

Primary LanguagePython

This repository is not active