Distinguishing between fake and real news by using text mining algorithm
I used text mining algorithm to distinguish fake news by counting word frequency. Preporcessing:
- removing pronouns
- removing word "reuters"
- lemmatizing (stopwords)
- word frequency
Machine learning algorithms
- Logistic Regression
- Naive Bayes
- Random Forest
- SVM
Data visualization
- Histogram
- Word Cloud