Meu Querido Diário or MQD (http://www.meuqueridodiario.com.br/) is a social network where users write texts related to their everyday experiences, like records in a personal diary that can contain thoughts, experiences or their feelings. This dataset contains 32,244 posts associated with happiness class and 26,922 with sadness class. We collected a total of 59,166 posts with emotions happiness and sadness associated, randomly selected from users in the age between 13 and 99, without gender distinction. Caution was taken to assure identifying information was stripped, such that it cannot be linked to the person who wrote it.
When using this dataset for academic purposes, please cite our article:
@INPROCEEDINGS{
AUTHOR="Flavio Carvalho and Gabriel dos Santos and Gustavo Paiva Guedes",
TITLE="AffectPT-br: an Affective Lexicon based on LIWC 2015",
BOOKTITLE="37th International Conference of the Chilean Computer Science Society (SCCC 2018)",
ADDRESS="University Andres Bello, Campus Antonio Varas, Santiago – Chile.",
DAYS="05-09",
MONTH="nov",
YEAR="2018",
ABSTRACT="Emotion detection is crucial in Human-Computer Interaction (HCI). Computers can never respond to individuals affective state if it cannot identify its emotion. Categorical approaches are commonly used for emotion detection in texts. It considers training models based on a set of affective states. Categorical approaches are often based on a dictionary that contains words associated with emotional categories. In this scenario, LIWC dictionaries have been widely used in psychology and linguistics. There are dictionaries in many languages, including Brazilian Portuguese (LIWC2007pt). This work focuses on the development of AffectPT-br, a new Brazilian Portuguese affective dictionary based on the LIWC 2015 English dictionary. We produced two text classification experiments with real datasets from social networks in order to compare AffectPT-br with LIWC2007pt. Results indicate AffectPT-br outperforms LIWC2007pt in the classification task with all classification algorithms we adopted.",
KEYWORDS="Natural Language Processing, Emotion Detection, Human Computer Interaction, LIWC",
}