Pinned Repositories
api
Pushshift API
google_bigquery
imdb_to_json
Fetch movie data from IMDB and output in JSON format.
Parallel-NDJSON-Reader
Parallel NDJSON Reader for Python
Reddit-Bot-Detector
Script to extract highly probable bots for further analysis
reddit_sse_stream
A Server Side Event stream to deliver Reddit comments and submissions in near real-time to a client.
rinzler
A high performance indexing and search system for managing big data
telegram
Pushshift Telegram Ingest
tiktok
Module to access TikTok Private API
zreader
Read compressed NDJSON .zst files easily
pushshift's Repositories
pushshift/api
Pushshift API
pushshift/telegram
Pushshift Telegram Ingest
pushshift/tiktok
Module to access TikTok Private API
pushshift/reddit_sse_stream
A Server Side Event stream to deliver Reddit comments and submissions in near real-time to a client.
pushshift/zreader
Read compressed NDJSON .zst files easily
pushshift/Parallel-NDJSON-Reader
Parallel NDJSON Reader for Python
pushshift/imdb_to_json
Fetch movie data from IMDB and output in JSON format.
pushshift/tiktok-scraper
TikTok Scraper. Download video posts, collect user/trend/hashtag/music feed metadata, sign URL and etc.
pushshift/token_manager
Code to handle multiple Twitter user access tokens when making requests
pushshift/ndjson_processor
High Speed multiprocessing ndjson processor
pushshift/extract_json_from_html
This script will make it much easier to extract a JSON object from HTML (e.g. getting Tiktok data)
pushshift/gab_mastodon
Ingest scripts and Elasticsearch Mapping for Gab's new Mastodon Site
pushshift/US_Election_Data
Code to grab election data from CNN's election data API
pushshift/ap_story_fetcher
Associated Press Story Fetcher
pushshift/feed_seeker
Find rss, atom, xml, and rdf feeds on webpages
pushshift/officer_dot_com
Example code to start parsing data from the website officer.com
pushshift/ps_proxy_manager
Pushshift Proxy Manager
pushshift/binary_search
Example of a binary search implementation using real data (Reddit author info)
pushshift/browser_extension_parser
Parser module for Facebook observations returned from the browser extension
pushshift/meetup
Code for ingesting meetup.com streams (comments, photos, etc.)
pushshift/parse_wiki_tables
Simple Example to parse out data from Wikipedia tables using selectolax
pushshift/scrape_subreddit_categories
reddit
pushshift/compress
Optimized Go Compression Packages
pushshift/JSON-Flatten
This code shows how to flatten nested keys which can help convert a nest JSON object into CSV, etc.
pushshift/mediacloud
Media Cloud is an open source, open data platform that allows researchers to answer quantitative questions about the content of online media.
pushshift/optlib
A library for financial options pricing written in Python.
pushshift/selectolax
Python binding to Modest engine (fast HTML5 parser with CSS selectors).
pushshift/slate
Beautiful static documentation for your API
pushshift/socks_manager
pushshift/tiktok-signature
Generate tiktok signature token using node