/toxic_twitter

Topic modelling and text classification analysis for 'Toxicity in Twitter' research

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

toxic_twitter

Topic modelling and text classification analysis for 'Toxicity in Twitter' research

The folder contains:

  • final thesis text
  • classified data that was used for model analysis
  • code for Latent Dirichlet Allocation topic modeling
  • code for text classification based on manual sorting of tweets