dkkim93

Staff Research Scientist @ Field AI

LG AI Research-Ann Arbor

dkkim93's Stars

shariqiqbal2810/MAAC
Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019
Language:Python666172
mxgmn/WaveFunctionCollapse
Bitmap & tilemap generation from a single example with the help of ideas from quantum mechanics
Language:C#23.2k1.2k
alexis-jacq/LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
Language:Python9015
google-deepmind/bsuite
bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent
Language:Python1.5k181
chingyaoc/pytorch-REINFORCE
PyTorch Implementation of REINFORCE for both discrete & continuous control
Language:Python26451
jtorde/uav_trajectory_optimizer
Efficient C++ solver to generate trajectories for UAVs
Language:C5118
cbschaff/nlimb
Language:Python205
lanpa/tensorboardX
tensorboard for pytorch (and chainer, mxnet, numpy, ...)
Language:Python7.9k865
mit-aera/Blackbird-Dataset
Language:Python12024
dgriff777/a3c_continuous
A continuous action space version of A3C LSTM in pytorch plus A3G design
Language:Python25859
openai/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python2.3k785
sfujim/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
Language:Python1.7k434
tristandeleu/pytorch-maml-rl
Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch
Language:Python822157
cbfinn/maml_rl
Code for RL experiments in "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
Language:Python618180
cbfinn/maml
Code for "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
Language:Python2.5k605
shariqiqbal2810/maddpg-pytorch
PyTorch Implementation of MADDPG (Lowe et. al. 2017)
Language:Python560128
ShangtongZhang/DeepRL
Modularized Implementation of Deep RL Algorithms in PyTorch
Language:Python3.2k679
geek-ai/MAgent
A Platform for Many-Agent Reinforcement Learning
Language:Python1.7k332
keras-rl/keras-rl
Deep Reinforcement Learning for Keras.
Language:Python5.5k1.4k
tambetm/gym-minecraft
Minecraft environment for Open AI Gym, based on Microsoft's Malmo.
Language:Python27229
dgriff777/rl_a3c_pytorch
A3C LSTM Atari with Pytorch plus A3G design
Language:Python563119
asappresearch/emergent-comms-negotiation
Reproduce ICLR2018 submission "Emergent Communication through Negotiation"
Language:Python178
facebookresearch/end-to-end-negotiator
Deal or No Deal? End-to-End Learning for Negotiation Dialogues
Language:Python1.4k277
jannerm/spatial-reasoning
Code for the paper "Representation Learning for Grounded Spatial Reasoning"
Language:Python5214
mit-aera/OptiTrack-Motive-2-Client
ROS and LCM drivers for OptiTrack's Motive 2 software. Optimized for tracking aerial drones. Runs on Ubuntu Linux.
Language:C++1913
florensacc/snn4hrl
Stochastic Neural Networks for Hierarchical Reinforcement Learning
Language:Python9516
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python82.6k22.2k
facebookarchive/CommNet
Neural network model, suitable for multi-agent learning. https://arxiv.org/abs/1605.07736
Language:Lua21359
mit-aera/FlightGoggles
A framework for photorealistic hardware-in-the-loop agile flight simulation using Unity3D and ROS. Developed by MIT AERA group.
Language:C++39799
wwxFromTju/deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》
Language:Python12925

dkkim93

dkkim93's Stars

shariqiqbal2810/MAAC

mxgmn/WaveFunctionCollapse

alexis-jacq/LOLA_DiCE

google-deepmind/bsuite

chingyaoc/pytorch-REINFORCE

jtorde/uav_trajectory_optimizer

cbschaff/nlimb

lanpa/tensorboardX

mit-aera/Blackbird-Dataset

dgriff777/a3c_continuous

openai/multiagent-particle-envs

sfujim/TD3

tristandeleu/pytorch-maml-rl

cbfinn/maml_rl

cbfinn/maml

shariqiqbal2810/maddpg-pytorch

ShangtongZhang/DeepRL

geek-ai/MAgent

keras-rl/keras-rl

tambetm/gym-minecraft

dgriff777/rl_a3c_pytorch

asappresearch/emergent-comms-negotiation

facebookresearch/end-to-end-negotiator

jannerm/spatial-reasoning

mit-aera/OptiTrack-Motive-2-Client

florensacc/snn4hrl

pytorch/pytorch

facebookarchive/CommNet

mit-aera/FlightGoggles

wwxFromTju/deepmind_MAS_enviroment