Pinned Repositories
Distributional-Multi-Agent-Actor-Critic-Reinforcement-Learning-MADDPG-Tennis-Environment
The state-of-the-art in multi-agent Reinforcement Learning is the MADDPG algorithm which utilises DDPG actor-critic neural networks where each agent uses centralized critic training but decentralized actor execution, and is capable of learning either cooperative or competitive environments. This is demonstrated on the Unity Tennis Environment.
leela-chess-to-Chinese-Chess
《佳佳象棋 GGzero》 采用 alphazero 技术的**象棋引擎
machinelearning
My blogs and code for machine learning. http://cnblogs.com/pinard
MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
ModelRepo
reproduce some RL or Multi-Agent models
Multi-Commander
Multi & Single Agent Reinforcement Learning for Traffic Signal Control Problem
Python-100-Days
Python - 100天从新手到大师
swmmio
Python tools for interacting with, editing, and visualizing EPA SWMM5 models
Zhiwei-Xu
zhiweixutsinghua's Repositories
zhiweixutsinghua/Distributional-Multi-Agent-Actor-Critic-Reinforcement-Learning-MADDPG-Tennis-Environment
The state-of-the-art in multi-agent Reinforcement Learning is the MADDPG algorithm which utilises DDPG actor-critic neural networks where each agent uses centralized critic training but decentralized actor execution, and is capable of learning either cooperative or competitive environments. This is demonstrated on the Unity Tennis Environment.
zhiweixutsinghua/leela-chess-to-Chinese-Chess
《佳佳象棋 GGzero》 采用 alphazero 技术的**象棋引擎
zhiweixutsinghua/machinelearning
My blogs and code for machine learning. http://cnblogs.com/pinard
zhiweixutsinghua/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
zhiweixutsinghua/ModelRepo
reproduce some RL or Multi-Agent models
zhiweixutsinghua/Multi-Commander
Multi & Single Agent Reinforcement Learning for Traffic Signal Control Problem
zhiweixutsinghua/Python-100-Days
Python - 100天从新手到大师
zhiweixutsinghua/swmmio
Python tools for interacting with, editing, and visualizing EPA SWMM5 models
zhiweixutsinghua/Zhiwei-Xu