Pinned Repositories
hasoc-fire-2020
Bert for multilingual hate speech and offensive content identification on english, german and hindi datasets
Hate-Speech-Detection-on-Code-Mixed-Dataset-using-a-Fusion-of-Custom-and-Pre-Trained-models-with-Pro
With the increase in user-generated content on social media networks, hate speech and offensive language content are also increasing. From the perspective of computer science, automatic detection of such hate speech and offensive language content is an interesting problem to solve. The natural language community has taken a step to identify such content via automated hate speech and offensive content detection. The hate speech content is generated mostly on social media, and automatic hate speech and offensive language detection face many challenges due to non-standard spelling and grammar variations. Specifically, in a multilingual community, the hate content would be in code-mixed form, making the task further challenging. In this article, we propose a model for code-mixed hate speech detection. This model embeds the knowledge from both user-trained and multilingual pre-trained models. The proposed method also calculates the profanity word list and augments it. Experimental results on code-mixed hate speech and offensive language detection benchmarks show that our method outperforms the existing baselines.
Code-Mixed-TOD-Medical-Dataset
Basic-DS
CMMT
Gated conv neural network for code-mixed machine translation
Extracting-a-Challenging-Dataset-from-Dravidian-Code-Mixed-Hate-Speech-Detection-Models
Contains the implementation of paper "A Method for Extracting Challenging Dataset for Deep Study of Dravidian Code-Mixed Automated Hate Speech Detection Models"
fire-2020-Dravidian-CodeMix
A sentiment analysis shared task on the Tamil and Malayalam code-mixed text
Gaussian-Hidden-Markov-Model
nlp
online-hate-speech-recog
An online hate speech recognition system.
suman101112's Repositories
suman101112/Code-Mixed-TOD-Medical-Dataset
suman101112/Extracting-a-Challenging-Dataset-from-Dravidian-Code-Mixed-Hate-Speech-Detection-Models
Contains the implementation of paper "A Method for Extracting Challenging Dataset for Deep Study of Dravidian Code-Mixed Automated Hate Speech Detection Models"
suman101112/Hate-Speech-Detection-on-Code-Mixed-Dataset-using-a-Fusion-of-Custom-and-Pre-Trained-models-with-Pro
With the increase in user-generated content on social media networks, hate speech and offensive language content are also increasing. From the perspective of computer science, automatic detection of such hate speech and offensive language content is an interesting problem to solve. The natural language community has taken a step to identify such content via automated hate speech and offensive content detection. The hate speech content is generated mostly on social media, and automatic hate speech and offensive language detection face many challenges due to non-standard spelling and grammar variations. Specifically, in a multilingual community, the hate content would be in code-mixed form, making the task further challenging. In this article, we propose a model for code-mixed hate speech detection. This model embeds the knowledge from both user-trained and multilingual pre-trained models. The proposed method also calculates the profanity word list and augments it. Experimental results on code-mixed hate speech and offensive language detection benchmarks show that our method outperforms the existing baselines.
suman101112/Basic-DS
suman101112/CMMT
Gated conv neural network for code-mixed machine translation
suman101112/techdofication-with-bert-cnn
technical domain identification with multilingual bert and convolutional models.
suman101112/hasoc-fire-2020
Bert for multilingual hate speech and offensive content identification on english, german and hindi datasets
suman101112/fire-2020-Dravidian-CodeMix
A sentiment analysis shared task on the Tamil and Malayalam code-mixed text
suman101112/online-hate-speech-recog
An online hate speech recognition system.
suman101112/pytorch-seq2seq
Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.
suman101112/Gaussian-Hidden-Markov-Model
suman101112/project
project
suman101112/nlp