This project contains all the tools developed for the purposes of my bachelor thesis on Characterizing the Popularity, Virality and Sentimentalism of Video Content Categories on YouTube and Twitter
- Collect YouTube & Twitter data
- Analyze the data
Analysis on how YouTube videos disseminate through social media
This project is divided into 2 main/core services:
- Collector
- Analyzer
TPS: Twitter Public Stream
- Finds new videos from TPS and stores their static data into db.
- Collects tweets that mention YouTube videos through TPS.
- Collects the dynamic information of YouTube videos once per day. Currently is cummulative
- Collects video comments.
- Removes incomplete Videos video comments.
- All data is stored into the following collections:
- Videos
- Tweets
- Comments
- YouTubeKeys
- TwitterApps
- configurations
- Commands are handled via Rest API - Authentication Token Needed
- Sentiment Analysis
- Statistical Analysis
- Videos Information
- Groups Information
- Classification
- Feature Importance Analysis