/PySpark-Social-Media-sentiment-analysis

performing sentiment analysis on social media data. The project uses the sentiment140 dataset from Kaggle, which contains 1.6 million tweets annotated with positive, negative, or neutral polarity. The project explores various aspects of data processing, such as data cleaning, tokenization, stopword removal, and feature extraction.

Primary LanguageJupyter Notebook

Stargazers

No one’s star this repository yet.