Did you offend me? Classification of Offensive Tweets in Hinglish Language
The repo contains the Hinglish profanity data for the paper titled "Did you offend me? Classification of Offensive Tweets in Hinglish Language", accepted at ALW2 Workshop at EMNLP 2018. Since the data corresponds to publicly available tweets of offensive nature. Twitter's policies forbade us from sharing the HOT tweet dataset publicly without author consent. Kindly contact the primary author for access to the data resource.
@InProceedings{W18-5118, author = "Mathur, Puneet and Sawhney, Ramit and Ayyar, Meghna and Shah, Rajiv", title = "Did you offend me? Classification of Offensive Tweets in Hinglish Language", booktitle = "Proceedings of the 2nd Workshop on Abusive Language Online (ALW2)", year = "2018", publisher = "Association for Computational Linguistics", pages = "138--148", location = "Brussels, Belgium", url = "http://aclweb.org/anthology/W18-5118" }
Please cite the paper if you use any datasets for research. Every attempt has been taken to protect the identity of the Twitter users mentioned in the tweet datasets by modifying it appropriately. The dataset is strictly for research purposes and any attempt to violate the privacy of the Twitter users mentioned knowingly or unknowingly will not be liable to the authors of the paper or repository.