/Reddit

What makes a post voted as a top post on reddit? What are discussed in various reddit communities? Data analysis and machine algorithm are used to predict if a post make it to the top or not. Several relevant questions are discussed using live data from reddit's API.

Primary LanguageJupyter Notebook

Reddit

A deeper look into one of the main internet websites Reddit and various communities on this website, aka subreddits. What factors can help a post make it to the top links (with highests number of votes).

Factors such length of the title, body text, including special words and/or characters in the titles, time of submission, etc. are amongst few potential factors.

To fetch data, I have used the package praw to download data from reddit API.

You can see the report here: Link

Requirements:

  • praw
  • pandas
  • numpy
  • matplotlib
  • datetime
  • collections