Pinned Repositories
A3C_TensorFlow
Asynchronous Methods for Deep Reinforcement Learning
BGAIL
Bayesian Approach to Generative Adversarial Imitation Learning
ConditionalVariationalAutoencoder
CVAE
DistributedTensorFlowExample
asynchoronous learning example working inside localhost
maddpg-rllib
MADDPG in Ray/RLlib
mujoco-py-1.50.1.68
multiagent-gail
multiagent-gail working with multiagent-particle-env-v2 (which was modified by magail authors)
multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
multiagent-particle-envs-maac
multiagent-particle-envs used in MAAC repo
SVGD
TensorFlow Implementation of Stein Variational Gradient Descent (SVGD)
wsjeon's Repositories
wsjeon/maddpg-rllib
MADDPG in Ray/RLlib
wsjeon/multiagent-gail
multiagent-gail working with multiagent-particle-env-v2 (which was modified by magail authors)
wsjeon/BGAIL
Bayesian Approach to Generative Adversarial Imitation Learning
wsjeon/multiagent-particle-envs-maac
multiagent-particle-envs used in MAAC repo
wsjeon/mujoco-py-1.50.1.68
wsjeon/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
wsjeon/multiagent-particle-envs-v2
Modified multiagent-particle-env used in multi-agent gail
wsjeon/d4rl
A benchmark for offline reinforcement learning.
wsjeon/dm_control
DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
wsjeon/flow
Computational framework for reinforcement learning in traffic control
wsjeon/google-research
Google Research
wsjeon/gym
A toolkit for developing and comparing reinforcement learning algorithms.
wsjeon/handful-of-trials
Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
wsjeon/KFAC-Pytorch
Pytorch implementation of KFAC and E-KFAC
wsjeon/minimal-mistakes
:triangular_ruler: Jekyll theme for building a personal site, blog, project documentation, or portfolio.
wsjeon/models
Models and examples built with TensorFlow
wsjeon/numba
NumPy aware dynamic Python compiler using LLVM
wsjeon/papers
wsjeon/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
wsjeon/ray
A fast and simple framework for building and running distributed applications.
wsjeon/rllib-tf2
wsjeon/rlpyt
Reinforcement Learning in PyTorch
wsjeon/smac
SMAC: The StarCraft Multi-Agent Challenge
wsjeon/softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains.
wsjeon/sparsemax-pytorch
Implementation of Sparsemax activation in Pytorch
wsjeon/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
wsjeon/tensorflow-cmake
TensorFlow examples in C, C++, Go and Python without bazel but with cmake and FindTensorFlow.cmake
wsjeon/TensorFlow-Tutorials
텐서플로우를 기초부터 응용까지 단계별로 연습할 수 있는 소스 코드를 제공합니다
wsjeon/travis-ci
an example builder to build a container with Travis CI, and push to a Singularity Registry Server (or other endpoint)
wsjeon/wsjeon.github.io
Jekyll source for my personal blog.