smh2019

United States

smh2019's Stars

facebookresearch/ParlAI
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
Language:Python10.5k2.1k
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Language:Python270k45.6k
fivethirtyeight/russian-troll-tweets
768214
gyglim/dvn
Reference implementation for Structured Prediction with Deep Value Networks
Language:Jupyter Notebook5513
smh2019/probcomp-stack
MIT Probabilistic Computing Project software stack
Language:Shell1
Serene-Arc/bulk-downloader-for-reddit
Downloads and archives content from reddit
Language:Python2.3k211
iamtrask/Grokking-Deep-Learning
this repository accompanies the book "Grokking Deep Learning"
Language:Jupyter Notebook7.4k1.6k
danielecook/Awesome-Bioinformatics
A curated list of awesome Bioinformatics libraries and software.
3.1k598
stepthom/text_mining_resources
Resources for learning about Text Mining and Natural Language Processing
557199
ethen8181/machine-learning
:earth_americas: machine learning tutorials (mainly in Python3)
Language:HTML3.2k649
jayinai/data-science-question-answer
A repo for data science related questions and answers
Language:Jupyter Notebook2.4k658
donnemartin/interactive-coding-challenges
120+ interactive Python coding interview challenges (algorithms and data structures). Includes Anki flashcards.
Language:Python29.3k4.4k
datopian/bad-data
Examples of bad data, especially from government.
Language:HTML2210
csvsoundsystem/federal-treasury-api
The scraper, parser, and database creation scripts for Financial Management Service daily U.S. Treasury statements.
Language:Python10527
jwasham/coding-interview-university
A complete computer science study plan to become a software engineer.
304k76.4k
pushshift/api
Pushshift API
Language:Python1.3k107
pk026/cuba
There is a continuous stream of user activity events generated from multiple users as they use our mobile Cube app. Objective is to implement a server to ingest these events. The server will expose a http end-point to which the events would be posted. Also the server will contain an admin interface to specify business rules, that alert the operator (an engineer in the Cube Ops team) or trigger an action (like sending an alert sms to the end user), when certain criteria is met.
Language:Python1
nio-blocks/reddit
Polls the Reddit API for the specified subreddit
Language:Python1
kelseyhightower/kubernetes-the-hard-way
Bootstrap Kubernetes the hard way. No scripts.
40.6k13.9k
ks-avinash/aws-lambda-function
Simple code for extracting data from excel sheet and Ingest into AWS S3 bucket
Language:Python42
ytian22/Bike-Share-Demand-Prediction
Predicted Bay Area bike share demand with Spark MLlib and built a pipeline to bridge Amazon S3, MongoDB server, and Spark EC2 cluster for NoSQL data processing.
Language:Jupyter Notebook1
prakhar1989/docker-curriculum
:dolphin: A comprehensive tutorial on getting started with Docker!
Language:SCSS5.6k2.1k
donnemartin/data-science-ipython-notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Language:Python27.1k7.8k
amararyal/Co-Tags
Language:Python2
ekhtiar/swiss-transport-datapipeline
A data pipeline to daily pull public transport data from the opentransportdata.swiss portal. This pipeline has three tasks, pull the right data from opentransportdata.swiss, push the data to s3 for storage, and transform and load the transformed data to a database. Hopefully this repository helps people explain ETL / Batch data pipeline.
Language:Python3
royhobbstn/s3-db
A serverless data processing pipeline to store Census data in AWS S3.
Language:JavaScript21
associatedpress/national-caseload-data-ingest
Scripts to download the U.S. Department of Justice's National Caseload Data and load it into Amazon Athena for querying
Language:Python133
damienmarlier51/Kinesis_Lambda_DynamoDB
Data ingestion on AWS
Language:HCL1
aws-samples/amazon-elasticsearch-lambda-samples
Data ingestion for Amazon Elasticsearch Service from S3 and Amazon Kinesis, using AWS Lambda: Sample code
Language:JavaScript391180
BracketJohn/is-this-an-mlm
Website to tell visitors whether a Company is an MLM
Language:JavaScript71

smh2019

smh2019's Stars

facebookresearch/ParlAI

donnemartin/system-design-primer

fivethirtyeight/russian-troll-tweets

gyglim/dvn

smh2019/probcomp-stack

Serene-Arc/bulk-downloader-for-reddit

iamtrask/Grokking-Deep-Learning

danielecook/Awesome-Bioinformatics

stepthom/text_mining_resources

ethen8181/machine-learning

jayinai/data-science-question-answer

donnemartin/interactive-coding-challenges

datopian/bad-data

csvsoundsystem/federal-treasury-api

jwasham/coding-interview-university

pushshift/api

pk026/cuba

nio-blocks/reddit

kelseyhightower/kubernetes-the-hard-way

ks-avinash/aws-lambda-function

ytian22/Bike-Share-Demand-Prediction

prakhar1989/docker-curriculum

donnemartin/data-science-ipython-notebooks

amararyal/Co-Tags

ekhtiar/swiss-transport-datapipeline

royhobbstn/s3-db

associatedpress/national-caseload-data-ingest

damienmarlier51/Kinesis_Lambda_DynamoDB

aws-samples/amazon-elasticsearch-lambda-samples

BracketJohn/is-this-an-mlm