wsjeon

Qualcomm AI Research

Pinned Repositories

A3C_TensorFlow
Asynchronous Methods for Deep Reinforcement Learning
Language:Python1 3 00
BGAIL
Bayesian Approach to Generative Adversarial Imitation Learning
Language:Python8 2 13
ConditionalVariationalAutoencoder
CVAE
Language:Python1 2 00
DistributedTensorFlowExample
asynchoronous learning example working inside localhost
Language:Python4 2 01
maddpg-rllib
MADDPG in Ray/RLlib
Language:Python48 2 1015
mujoco-py-1.50.1.68
Language:Python2 2 00
multiagent-gail
multiagent-gail working with multiagent-particle-env-v2 (which was modified by magail authors)
Language:Python9 2 03
multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python2 2 01
multiagent-particle-envs-maac
multiagent-particle-envs used in MAAC repo
Language:Python4 3 01
SVGD
TensorFlow Implementation of Stein Variational Gradient Descent (SVGD)
Language:Python7 3 01

wsjeon's Repositories

wsjeon/maddpg-rllib
MADDPG in Ray/RLlib
Language:Python48 2 1015
wsjeon/multiagent-gail
multiagent-gail working with multiagent-particle-env-v2 (which was modified by magail authors)
Language:Python9 2 03
wsjeon/BGAIL
Bayesian Approach to Generative Adversarial Imitation Learning
Language:Python8 2 13
wsjeon/multiagent-particle-envs-maac
multiagent-particle-envs used in MAAC repo
Language:Python4 3 01
wsjeon/mujoco-py-1.50.1.68
Language:Python2 2 00
wsjeon/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python2 2 01
wsjeon/multiagent-particle-envs-v2
Modified multiagent-particle-env used in multi-agent gail
Language:Python1 2 0
wsjeon/d4rl
A benchmark for offline reinforcement learning.
Language:Python1 0
wsjeon/dm_control
DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Language:Python1 0
wsjeon/flow
Computational framework for reinforcement learning in traffic control
Language:Python1 0
wsjeon/google-research
Google Research
Language:Jupyter Notebook1 0
wsjeon/gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:Python3 0
wsjeon/handful-of-trials
Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
Language:Python3 0
wsjeon/KFAC-Pytorch
Pytorch implementation of KFAC and E-KFAC
Language:Python2 0
wsjeon/minimal-mistakes
:triangular_ruler: Jekyll theme for building a personal site, blog, project documentation, or portfolio.
Language:CSS1 0
wsjeon/models
Models and examples built with TensorFlow
Language:Python2 0
wsjeon/numba
NumPy aware dynamic Python compiler using LLVM
Language:Python0 0
wsjeon/papers
2 0
wsjeon/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python2 0
wsjeon/ray
A fast and simple framework for building and running distributed applications.
Language:Python2 01
wsjeon/rllib-tf2
Language:Python3 0
wsjeon/rlpyt
Reinforcement Learning in PyTorch
Language:Python2 0
wsjeon/smac
SMAC: The StarCraft Multi-Agent Challenge
Language:Python2 0
wsjeon/softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains.
Language:Python3 0
wsjeon/sparsemax-pytorch
Implementation of Sparsemax activation in Pytorch
Language:Python2 0
wsjeon/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python1 0
wsjeon/tensorflow-cmake
TensorFlow examples in C, C++, Go and Python without bazel but with cmake and FindTensorFlow.cmake
Language:C++2 0
wsjeon/TensorFlow-Tutorials
텐서플로우를 기초부터 응용까지 단계별로 연습할 수 있는 소스 코드를 제공합니다
Language:Python2 0
wsjeon/travis-ci
an example builder to build a container with Travis CI, and push to a Singularity Registry Server (or other endpoint)
Language:Shell1 0
wsjeon/wsjeon.github.io
Jekyll source for my personal blog.
Language:SCSS1 0