/Unintended-Bias-in-Toxicity-Classification

The repo contains notebooks for the Jigsaw Unintend Bias in Toxicity Classification contest hosted on Kaggle

Primary LanguageJupyter NotebookMIT LicenseMIT

Unintended-Bias-in-Toxicity-Classification

The repo contains notebooks for the Jigsaw Unintend Bias in Toxicity Classification contest hosted on Kaggle

Score 1

Score 2

Methodology

Both the attempts use GloVe embeddings. However Attempt2 uses a custom loss function but the AUC-ROC score was less than Attempt1. Also BERT embedding mostly would have givven a superior result, however it wasn't possible to run it on Google Colab as the ram limit exceeded everytime. An alternative was to train on only 40% of the dataset using BERT but we could lose some valuable information.