EMO-KNOW: A Large Scale Dataset on Emotion and Emotion-cause

This is the data repo for the dataset described in EMO-KNOW: A Large Scale Dataset on Emotion and Emotion-cause

EMO-KNOW is a dataset of 700k tweets with labeled emotions and emotion causes. The emotion labels are extracted from users' own words, reserving the authenticity and ecological validity. To access the data, please fill out this form

NEWS:

❗ We are planning to release a bigger and better version of EMO-KNOW (~3M tweet!) ! Please stay tuned!

🌟 If you find this dataset helpful, please give us a star 😊 We'll be really happy :)

Emotion Distrition in EMO-KNOW

@inproceedings{emo-know-huongnguyen-2023,
    title = "{EMO}-{KNOW}: A Large Scale Dataset on Emotion-Cause",
    author = "Nguyen, Mia Huong  and
      Samaradivakara, Yasith  and
      Sasikumar, Prasanth  and
      Gupta, Chitralekha  and
      Nanayakkara, Suranga",
    booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2023",
    month = dec,
    year = "2023",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.findings-emnlp.737",
    doi = "10.18653/v1/2023.findings-emnlp.737",
}