Scraping Reddit Data with pushshift

The Pushshift API is a project from r/datasets mod Pushshift https://github.com/pushshift/api

  • It allows easy access to the databse of reddit posts
  • allows powerful aggregation over queries

This repo has code for automating push shift requests to get the number of relevant posts for a keyword on a subreddit over granular time periods (e.g month by month).