tweet-classification: A Jupyter Notebook repository from HermawanHermawan

PROJECT DESCRIPTION

Let's apply our natural language processing knowledge to Twitter. Tweets are notoriously difficult, as they are shorter than most texts and usually have hard-to-parse content like hashtags, mentions, links and emoji.

Despite the difficulties, tweets are fun content, so in this notebook we'll take a look at classifying two prominent North American politicians. Can we determine if it is Donald Trump or Justin Trudeau based on just a tweet? Let's see!

PROJECT TASK

Tweet classification: Trump vs. Trudeau
Transforming our collected data
Vectorize the tweets
Training a multinomial naive Bayes model
Evaluating our model using a confusion matrix
Trying out another classifier: Linear SVC
Introspecting our top model
Writing a Trump or Trudeau tweet

HermawanHermawan/tweet-classification