Toxicity Detection using Scikit-Learn

Project members

All the cleaning and feature engineering processes done on dataset are in master branch.
Get dataset from https://www.kaggle.com/c/jigsaw-toxic-comment-classification-challenge/data
For deployment we have used Heroku with CI/CD process. Configuration can be found in backend directory.
Process.ipynb is our final jupyter notebook in which we have all processes at one place. This file can be found in backend folder.
All other files in main branch are individual files as all of us were working using our own logics.
All individual worked files can be found in separate branches which are given by group member name.