reddit_mongodb_reconstructor
Takes Reddit comments and Reddit submissions from files.pushshift.io and creates a nested collection out of it.
Assumes the following
- You have downloaded the RC_ and RS_ reddit archives from http://files.pushshift.io
- You have imported the json into a mongo database with a collection for the submissions and a collection for the comments.
This ipython code with reconcile the two creating the comments as nested children of the parent submission.