/NLP_Pytorch

NLP - CNN & LSTM with PyTorch - Fake news detection

Primary LanguageJupyter Notebook

Project - Machine Learning for Natural Language Procesing

Authors : Bastien Billiot and Rémy Deshayes

Machine Learning for Natural Language Processing 2021 module @ ENSAE 3rd year semester 2.

In this project we focus on fake news and their significant impact on various aspects of our society, let it be damaging someone’s reputation, create controversy and even manipulate political outcomes. With the rise of false or misleading information presented as genuine news, all news publishers, news aggregators and social media platform tried to curtail the phenomena by introducing automatic text classification techniques.

We first explore the ISOT dataset (S. Saad, H. Ahmed, I. Traore. 2018) and then train a LSTM and a CNN with PyTorch to address the above-mentionned binary fake news detection task. Lastly, leveraging the very high precision and recall yielded, we discuss the results of our model in the light of crucial topics such as censorship and information overflow.

Datasets can be found here
Please have a look at our report

References

  1. S. Saad, H. Ahmed, I. Traore. 2018. Detecting opinion spams and fake news using text classification. Journal of Security and Privacy, Volume 1, Issue 1.B.
  2. Kleinberg, A. Lefevre, N. Mihalcea, V. Perez-Rosas.2018. Fake news detector algorithm works betterthan a human. 27th International Conference on Computational Linguistics.
  3. E. Andrews. 2019. How fake news spreads like a realvirus. Stanford Engineering Magazine.
  4. J. Carrie Wong. 2020. Russian agency created fakeleftwing news outlet with fictional editors.TheGuardian UK.