Pinned Repositories
applied_econometrics
Problem sets, assignments from grad school
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
baselines-A2C
[DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation
bespoke
carlita
clearnet
Pytorch model explorer
codeForSync
Small proof-of-concept D3 dashboard for testing deployment
simple-A2C-PPO
Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.
transformers-playground
Transformer XL from scratch trained to perfection on toy dataset. PyTorch.
rgilman33's Repositories
rgilman33/simple-A2C-PPO
Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.
rgilman33/baselines-A2C
[DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation
rgilman33/transformers-playground
Transformer XL from scratch trained to perfection on toy dataset. PyTorch.
rgilman33/applied_econometrics
Problem sets, assignments from grad school
rgilman33/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
rgilman33/bespoke
rgilman33/carlita
rgilman33/clearnet
Pytorch model explorer
rgilman33/codeForSync
Small proof-of-concept D3 dashboard for testing deployment
rgilman33/django_shell
starter shell for django app
rgilman33/darkspark-app
rgilman33/dendrific
rgilman33/epidemic
Smallpox simulator for grad school
rgilman33/flaskSharepointDemo
rgilman33/hubgit.github.com
Home Page
rgilman33/imdb_sync
Implementing ULMFiT using FastAI
rgilman33/lucent
Lucid library adapted for PyTorch
rgilman33/machineLearning
Random classifiers, documents. Nothing of interest.
rgilman33/nbdev_test
rgilman33/npm_flask_shell
Generic template for npm, flask app
rgilman33/obs-tower
Unity's Obstacle Tower RL Challenge
rgilman33/openpilot
rgilman33/pytorch-a2c-ppo-acktr
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).
rgilman33/retro-contest-sonic
World Models applied to the Open AI Sonic Retro Contest
rgilman33/rgilman33.github.io
posts
rgilman33/stravachimp
Web app that pulls and analyzes Strava data in interactive dashboard
rgilman33/wm_sync
rgilman33/world-models
Experiments w World Models (Ha 2018)
rgilman33/WorldModelsExperiments
World Models Experiments
rgilman33/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow