smh2019's Stars
swoodford/aws
A collection of bash shell scripts for automating various tasks with Amazon Web Services using the AWS CLI and jq.
apache/gobblin
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
airbnb/knowledge-repo
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
dmarx/RedditUtilities
A collection of useful tools for scraping reddit
dmarx/Topological-Anomaly-Detection
Topological Anomaly Detection (TAD) per Gartley and Basener 2009
shadowmoose/RedditDownloader
Scrapes Reddit to download media of your choice.
azouaoui-cv/SubredditsGraph
Aggregating a bunch of graph theory results on a live dataset based on Reddit's API. Explore subreddits connections through ``cross-posting``.
ByGeorge-/RedditCorpusBuilder
Build a corpus for NLP etc from Reddit self posts and comments.
joshsungasong/Project-3-Reddit-Post-Data-Analysis
This project allowed me to hone my web scraping and natural language processing skills by scraping data from Reddit. I collected a multitude of Reddit post attributes including the titles, number of upvotes, number of comments and subreddits attached with each post. I used natural language processing to create features out of the scraped data to input them into a Random Forest classifier and Logistic Regression to classify what would go into a popular Reddit post based on the collected features. The project involved a real-life scenario and the FiveThirtyEight team as the client.
ansin218/reddit-comments-data-analysis
Analysis of Reddit Comments for Mining Massive Datasets at the Technical University of Munich
devinrewis/SubredditRecommend
Recommends you Subreddits based on Word2Vec neural net!
mbc1990/CNN-Image-Classifier
A Simple Deep Neural Network to classify images made with Keras
mbc1990/political-bias
Analyzing political bias in subreddits
mbc1990/reddit-ingest
Ingest from reddit api
georgetown-analytics/XBUS-503-01.Data_Ingestion_and_Wrangling
Materials for Georgetown Data Science certificate. https://scs.georgetown.edu/programs/375/certificate-in-data-science/
crazyfrogspb/RedditScore
Package for performing Reddit-based text analysis
smh2019/daily
Solving problems from https://www.reddit.com/r/dailyprogrammer
eyee19/SpaceXCorrectionBot
A Reddit bot that looks for instances of "Space X" and replies with "SpaceX". Just a silly side project.
wyattshapiro/reddit_scraper
Scrape reddit for comments
markditsworth/RedditCommentAnalysis
Network analysis of reddit comments from Nov 2017, with emphasis on bot accounts.
voussoir/timesearch
The subreddit archiver
ScriptSmith/socialreaper
Social media scraping / data collection library for Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs
eriklindernoren/ML-From-Scratch
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
srijyothsna/nlp
Retrieve PubMed abstracts for a list of PMIDs and cluster them into groups of different diseases
smh2019/PubMed-Data-Miner
PubMed Data mining algorithm
orionmelt/sherlock
Extract user info from their reddit comments and activity.
mattrjacobs/FpInScalaExercises
Working through the examples in the Functional Programming in Scala book
junyanz/iGAN
Interactive Image Generation via Generative Adversarial Networks
JamesTheHacker/Neuron
Neuron - Electron, ES6, React, PouchDB, Sass, Webpack
gdb/kaggle
A collection of Kaggle solutions. Not very polished.