saulhazelius/Reddit-Demographics-Segmentation
A full blown ML sentyment analysis app and statistic generator fed with an automatic ETL pipleline conformed by web crawling spiders deployed over configurable sub-reddits from the popular website "Reddit" -the scraped elements being only comments. Used tools: Scrapy, NLTK, Django, Scipy, Pandas, MongoDB and Pytorch. The objective to accomplish here is to get some practice deploying ML models and to also get a better grasp over ETL fundamentals and web scraping. May also eventually build a very shitty front end for the generated analytics.
Python