Pinned Repositories
ai-arena
The AI Arena: A framework for distributed multi-agent reinforcement learning
ardupilot
ArduPlane, ArduCopter, ArduRover source
CS294-HW1-Pytorch
damarl
Codes for Paper "Delay-Aware Multi-Agent Reinforcement Learning".
Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
DQLMRS
Code for "Decentralized Function Approximated Q-Learning in Multi-Robot Systems For Predator Avoidance""
LeMOL
Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3DDPG.
MCCG
route-plan
路径规划算法主要是Dijkstra,A*的算法。
wsg1873's Repositories
wsg1873/MCCG
wsg1873/LeMOL
Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3DDPG.
wsg1873/ai-arena
The AI Arena: A framework for distributed multi-agent reinforcement learning
wsg1873/CS294-HW1-Pytorch
wsg1873/damarl
Codes for Paper "Delay-Aware Multi-Agent Reinforcement Learning".
wsg1873/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
wsg1873/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
wsg1873/DQLMRS
Code for "Decentralized Function Approximated Q-Learning in Multi-Robot Systems For Predator Avoidance""
wsg1873/Emergent-Multiagent-Strategies
Emergence of complex strategies through multiagent competition
wsg1873/geom-gcn
wsg1873/IC3Net
Code for ICLR 2019 paper: Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks
wsg1873/liir
Learning Individual Intrinsic Reward in MARL
wsg1873/MaCA
wsg1873/maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
wsg1873/marl_transfer
Code for paper 'Learning transferable cooperative behaviors in multi-agent teams' (ICML 2019)
wsg1873/MERL
wsg1873/Multi-Explore
Code for "Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning"
wsg1873/multiagent-confrontation
This is the source code of "Efficient training techniques for multi-agent reinforcement learning in combatant tasks".
wsg1873/neural-mmo
Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"
wsg1873/on-policy
This is the official implementation of Multi-Agent PPO.
wsg1873/pymarl2
Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning
wsg1873/PyTorch-Tutorial
Build your neural network easy and fast
wsg1873/RelationalGraphLearning
[IROS20] Relational graph learning for crowd navigation
wsg1873/RGAT-ABSA
wsg1873/RL-CBF
wsg1873/robust_cbf
wsg1873/rtrl
wsg1873/sequential_social_dilemma_games
Repo for reproduction of sequential social dilemmas
wsg1873/social_empowerment
wsg1873/StarCraft
Implementations of QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II