PROJECT DESCRIPTION
Let's apply our natural language processing knowledge to Twitter. Tweets are notoriously difficult, as they are shorter than most texts and usually have hard-to-parse content like hashtags, mentions, links and emoji.
Despite the difficulties, tweets are fun content, so in this notebook we'll take a look at classifying two prominent North American politicians. Can we determine if it is Donald Trump or Justin Trudeau based on just a tweet? Let's see!
PROJECT TASK
- Tweet classification: Trump vs. Trudeau
- Transforming our collected data
- Vectorize the tweets
- Training a multinomial naive Bayes model
- Evaluating our model using a confusion matrix
- Trying out another classifier: Linear SVC
- Introspecting our top model
- Writing a Trump or Trudeau tweet