dkkim93's Stars
shariqiqbal2810/MAAC
Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019
mxgmn/WaveFunctionCollapse
Bitmap & tilemap generation from a single example with the help of ideas from quantum mechanics
alexis-jacq/LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
google-deepmind/bsuite
bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent
chingyaoc/pytorch-REINFORCE
PyTorch Implementation of REINFORCE for both discrete & continuous control
jtorde/uav_trajectory_optimizer
Efficient C++ solver to generate trajectories for UAVs
cbschaff/nlimb
lanpa/tensorboardX
tensorboard for pytorch (and chainer, mxnet, numpy, ...)
mit-aera/Blackbird-Dataset
dgriff777/a3c_continuous
A continuous action space version of A3C LSTM in pytorch plus A3G design
openai/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
sfujim/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
tristandeleu/pytorch-maml-rl
Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch
cbfinn/maml_rl
Code for RL experiments in "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
cbfinn/maml
Code for "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
shariqiqbal2810/maddpg-pytorch
PyTorch Implementation of MADDPG (Lowe et. al. 2017)
ShangtongZhang/DeepRL
Modularized Implementation of Deep RL Algorithms in PyTorch
geek-ai/MAgent
A Platform for Many-Agent Reinforcement Learning
keras-rl/keras-rl
Deep Reinforcement Learning for Keras.
tambetm/gym-minecraft
Minecraft environment for Open AI Gym, based on Microsoft's Malmo.
dgriff777/rl_a3c_pytorch
A3C LSTM Atari with Pytorch plus A3G design
asappresearch/emergent-comms-negotiation
Reproduce ICLR2018 submission "Emergent Communication through Negotiation"
facebookresearch/end-to-end-negotiator
Deal or No Deal? End-to-End Learning for Negotiation Dialogues
jannerm/spatial-reasoning
Code for the paper "Representation Learning for Grounded Spatial Reasoning"
mit-aera/OptiTrack-Motive-2-Client
ROS and LCM drivers for OptiTrack's Motive 2 software. Optimized for tracking aerial drones. Runs on Ubuntu Linux.
florensacc/snn4hrl
Stochastic Neural Networks for Hierarchical Reinforcement Learning
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
facebookarchive/CommNet
Neural network model, suitable for multi-agent learning. https://arxiv.org/abs/1605.07736
mit-aera/FlightGoggles
A framework for photorealistic hardware-in-the-loop agile flight simulation using Unity3D and ROS. Developed by MIT AERA group.
wwxFromTju/deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》