muupan

Engineer at @pfnet

@pfnet

Pinned Repositories

chainerrl
ChainerRL is a deep reinforcement learning library built on top of Chainer.
Language:Python1.2k 70 198226
async-rl
Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)
Language:Python401 29 2883
caffe
Caffe: a fast framework for deep learning. For the most recent version checkout the dev branch. For the latest stable release checkout the master branch.
Language:C++8 8 09
chainer-cocob
COCOB-Backprop (https://arxiv.org/abs/1705.07795) implementation for Chainer
Language:Python6 4 01
deep-ensemble-uncertainty
An implementation of "Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles" (http://arxiv.org/abs/1612.01474)
Language:Jupyter Notebook34 3 17
deep-reinforcement-learning-papers
A list of papers and resources dedicated to deep reinforcement learning
835 112 1187
dqn-in-the-caffe
An implementation of Deep Q-Network using Caffe
Language:C++213 16 20118
predictron
WIP implementation of "The Predictron: End-To-End Learning and Planning" (http://arxiv.org/abs/1612.08810) in Chainer
Language:Python11 6 03
SexprParser
This is S-expression parser in C++11. It is aimed at being used in General Game Playing.
Language:C++3 2 00
pfrl
PFRL: a PyTorch-based deep reinforcement learning library
Language:Python1.2k 91 75157

muupan's Repositories

muupan/deep-reinforcement-learning-papers
A list of papers and resources dedicated to deep reinforcement learning
835 112 1187
muupan/async-rl
Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)
Language:Python401 29 2883
muupan/dqn-in-the-caffe
An implementation of Deep Q-Network using Caffe
Language:C++213 16 20118
muupan/deep-ensemble-uncertainty
An implementation of "Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles" (http://arxiv.org/abs/1612.01474)
Language:Jupyter Notebook34 3 17
muupan/predictron
WIP implementation of "The Predictron: End-To-End Learning and Planning" (http://arxiv.org/abs/1612.08810) in Chainer
Language:Python11 6 03
muupan/chainer-cocob
COCOB-Backprop (https://arxiv.org/abs/1705.07795) implementation for Chainer
Language:Python6 4 01
muupan/chainerrl
ChainerRL is a deep reinforcement learning library built on top of Chainer.
Language:Python2 2 0
muupan/chainer-entropy-adam
Chainer-based implementation of Entropy-Adam https://arxiv.org/abs/1611.01838
Language:Python1 3 0
muupan/chainer-eve
An Eve optimizer implementation in Chainer
Language:Python1 3 0
muupan/chainer-oplu
Orthogonal Permuatation Linear Unit (OPLU) https://arxiv.org/abs/1604.02313v3
Language:Python1 4 01
muupan/chainer-weight-normalization
Weight normalization https://arxiv.org/abs/1602.07868
Language:Python1 2 1
muupan/gym_torcs
Language:C++1 3 02
muupan/rl-teacher
Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback
Language:Python1 3 0
muupan/bullet3
Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
Language:C++2 0
muupan/chainer
A flexible framework of neural networks for deep learning
Language:Python2 0
muupan/chainer-yogi
An unofficial implementation of Yogi optimizer in Chainer. See https://papers.nips.cc/paper/8186-adaptive-methods-for-nonconvex-optimization
Language:Python2 01
muupan/cupy
NumPy-like API accelerated with CUDA
Language:Python2 0
muupan/gvgai
This is the framework for the General Video Game Competition - http://www.gvgai.net/
Language:Java2 01
muupan/LC_NGSIM
lane change trajectories extracted from NGSIM
Language:Matlab2 0
muupan/mujoco-py
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
Language:Python2 0
muupan/muupan.github.io
Language:HTML2 0
muupan/NGSIM.jl
A Julia package for handling the Next Generation Simulation (NGSIM) traffic dataset
Language:Jupyter Notebook2 0
muupan/pfrl
PFRL: a PyTorch-based deep reinforcement learning library
Language:Python2 0
muupan/pybrain
Language:Python2 0
muupan/pytorch-optimizer
torch-optimizer -- collection of optimizers for Pytorch
Language:Python1 0
muupan/resume
My resume
3 1
muupan/self-normalizing-networks
Chainer implementation of Self-Normalizing Networks (SNN)
Language:Python2 0
muupan/slimevolleygym
A simple OpenAI Gym environment for single and multi-agent reinforcement learning
Language:Python1 0
muupan/trl
Train transformer language models with reinforcement learning.
muupan/ViZDoom
Doom-based AI Research Platform for Reinforcement Learning from Raw Visual Information.
Language:C++3 0