- Project Question:
- Can computer data modelling predict which of two subreddits a post "is in"?
- Project Answer:
- Not better than people, yet.
For project 3, the goal was two-fold:
- Using PRAW, to collect posts from two subreddits of my choosing.
- then use NLP to train a classifier on which subreddit a given post came from.
- Gathered and prepared data from two subreddits using PRAW,
- First in the example notebook, later via an exported script
- Some data cleaning of pulled posts.
- Created and compared two models.
- Fed both to GridSearchCV
- exploratory data visualization