Pinned Repositories
chainerrl
ChainerRL is a deep reinforcement learning library built on top of Chainer.
async-rl
Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)
caffe
Caffe: a fast framework for deep learning. For the most recent version checkout the dev branch. For the latest stable release checkout the master branch.
chainer-cocob
COCOB-Backprop (https://arxiv.org/abs/1705.07795) implementation for Chainer
deep-ensemble-uncertainty
An implementation of "Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles" (http://arxiv.org/abs/1612.01474)
deep-reinforcement-learning-papers
A list of papers and resources dedicated to deep reinforcement learning
dqn-in-the-caffe
An implementation of Deep Q-Network using Caffe
predictron
WIP implementation of "The Predictron: End-To-End Learning and Planning" (http://arxiv.org/abs/1612.08810) in Chainer
SexprParser
This is S-expression parser in C++11. It is aimed at being used in General Game Playing.
pfrl
PFRL: a PyTorch-based deep reinforcement learning library
muupan's Repositories
muupan/deep-reinforcement-learning-papers
A list of papers and resources dedicated to deep reinforcement learning
muupan/async-rl
Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)
muupan/dqn-in-the-caffe
An implementation of Deep Q-Network using Caffe
muupan/deep-ensemble-uncertainty
An implementation of "Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles" (http://arxiv.org/abs/1612.01474)
muupan/predictron
WIP implementation of "The Predictron: End-To-End Learning and Planning" (http://arxiv.org/abs/1612.08810) in Chainer
muupan/chainer-cocob
COCOB-Backprop (https://arxiv.org/abs/1705.07795) implementation for Chainer
muupan/chainerrl
ChainerRL is a deep reinforcement learning library built on top of Chainer.
muupan/chainer-entropy-adam
Chainer-based implementation of Entropy-Adam https://arxiv.org/abs/1611.01838
muupan/chainer-eve
An Eve optimizer implementation in Chainer
muupan/chainer-oplu
Orthogonal Permuatation Linear Unit (OPLU) https://arxiv.org/abs/1604.02313v3
muupan/chainer-weight-normalization
Weight normalization https://arxiv.org/abs/1602.07868
muupan/gym_torcs
muupan/rl-teacher
Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback
muupan/bullet3
Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
muupan/chainer
A flexible framework of neural networks for deep learning
muupan/chainer-yogi
An unofficial implementation of Yogi optimizer in Chainer. See https://papers.nips.cc/paper/8186-adaptive-methods-for-nonconvex-optimization
muupan/cupy
NumPy-like API accelerated with CUDA
muupan/gvgai
This is the framework for the General Video Game Competition - http://www.gvgai.net/
muupan/LC_NGSIM
lane change trajectories extracted from NGSIM
muupan/mujoco-py
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
muupan/muupan.github.io
muupan/NGSIM.jl
A Julia package for handling the Next Generation Simulation (NGSIM) traffic dataset
muupan/pfrl
PFRL: a PyTorch-based deep reinforcement learning library
muupan/pybrain
muupan/pytorch-optimizer
torch-optimizer -- collection of optimizers for Pytorch
muupan/resume
My resume
muupan/self-normalizing-networks
Chainer implementation of Self-Normalizing Networks (SNN)
muupan/slimevolleygym
A simple OpenAI Gym environment for single and multi-agent reinforcement learning
muupan/trl
Train transformer language models with reinforcement learning.
muupan/ViZDoom
Doom-based AI Research Platform for Reinforcement Learning from Raw Visual Information.