/reddit-br-toxicity-dataset

This repository makes available a new dataset for toxicity detection in Brazilian Portuguese from the work accepted by the 16th International Conference on Computational Processing of Portuguese (PROPOR 2024). The data collected is from the most popular Brazilian subreddits in 2022.

MIT LicenseMIT

Stargazers