areegtarek/PySpark-Social-Media-sentiment-analysis
performing sentiment analysis on social media data. The project uses the sentiment140 dataset from Kaggle, which contains 1.6 million tweets annotated with positive, negative, or neutral polarity. The project explores various aspects of data processing, such as data cleaning, tokenization, stopword removal, and feature extraction.
Jupyter Notebook
Stargazers
No one’s star this repository yet.