A Gated Recurrent Neural Network for Supervised Text Classification: detecting hate speech from different online textual genres.
This project was developed for the βHuman Language Technologiesβ course of Professor Giuseppe Attardi.
BiBiNET
βββ src
β βββ data_import.py # process 1/6
β βββ data_prep.py # process 2/6
β βββ preproc.py # process 3/6
β βββ transform.py # process 4/6
β βββ classifiers.py # process 5/6
β βββ test.py # process 6/6
β βββ utilities.py
β βββ main.py # file to run
βββ data
β βββ forum_data
β β βββ all_files.csv # text
β β βββ annotations_metadata.csv # labels
β βββ twitter_1
β β βββ twitter_dataset.csv
β βββ twitter_2
β β βββ train.csv
β β βββ test.csv
β βββ wikipedia_data
β βββ train.csv
β βββ test.csv
βββ glove
β βββ glove.twitter.27B.100d
β βββ glove.twitter.27B.200d
βββ model
β βββ model.h5 # final model
βββ requirements.txt
βββ report.pdf # project guide
βββ README.md
βββ LICENSE
Install Python:
sudo apt install python3
Install pip:
sudo apt install --upgrade python3-pip
Install requirements:
python -m pip install --requirement requirements.txt
Execute main
cd src/
python main.py
Dr. Diletta Goglia
Postgraduate Student in MSc in Artificial Intelligence
Computer Science department, University of Pisa, Italy
d.goglia@studenti.unipi.it
dilettagoglia.netlify.app