Exploratory data analysis for approximately 600 Simpsons episodes and scripts, topic modeling and text generation.
- Identify most on-screen characters
- Group the most common words for each character (WordClouds)
- Vectorizing
- Bag-Of-Words
- TF-IDF
- Word-Vector (Google's Word2Vec)
- Sentiment Analysis
- Topic Modeling
- Best & worst episode