This repository contains an analysis of a WhatsApp group chat using exploratory data analysis, sentiment analysis, and topic modeling.
The analysis is originally presented in the following article:
https://medium.com/@barklight/cracking-the-conversation-973839be5b88
The chat data used in this analysis is whatsapp_chat.txt
Please note that the message has been encrypted for privacy reasons.
- ID-Stopwords - Used to remove common words that do not contribute to the overall meaning of the messages.
- InSet (Indonesia Sentiment Lexicon) - Used to perform sentiment analysis on the messages.
- Kamus Alay (Colloquial Indonesian Lexicon) - Used to identify and clean up slang words in the messages.
All the lexicons used in this analysis are specifically designed for handling natural language processing (NLP) tasks in Bahasa Indonesia