This is the data repo for the dataset described in EMO-KNOW: A Large Scale Dataset on Emotion and Emotion-cause
EMO-KNOW is a dataset of 700k tweets with labeled emotions and emotion causes. The emotion labels are extracted from users' own words, reserving the authenticity and ecological validity. To access the data, please fill out this form
❗ We are planning to release a bigger and better version of EMO-KNOW (~3M tweet!) ! Please stay tuned!
🌟 If you find this dataset helpful, please give us a star 😊 We'll be really happy :)
@inproceedings{emo-know-huongnguyen-2023,
title = "{EMO}-{KNOW}: A Large Scale Dataset on Emotion-Cause",
author = "Nguyen, Mia Huong and
Samaradivakara, Yasith and
Sasikumar, Prasanth and
Gupta, Chitralekha and
Nanayakkara, Suranga",
booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2023",
month = dec,
year = "2023",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2023.findings-emnlp.737",
doi = "10.18653/v1/2023.findings-emnlp.737",
}