starry-sky6688/MARL-Algorithms
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
Python
Issues
- 1
您好,您的代码写的非常好,请问怎么看您发表的论文呢,我想拜读您的论文,这样会更加受益匪浅
#120 opened by lpplpf - 2
您好,VDN和qmix为什么不能进行多智能体通信呢
#118 opened by lpplpf - 1
关于get_action_weights的问题
#117 opened by honey-mxy - 2
关于在别的环境应用qmix出现episodes rewards dropout的问题
#115 opened by ayton-zhang - 1
关于训练得到的模型的问题
#116 opened by honey-mxy - 1
关于attention的训练依据的问题
#114 opened by honey-mxy - 0
关于evaluate的胜率
#113 opened by honey-mxy - 1
关于GA-Common和GA-AC的问题
#112 opened by issta24 - 2
Quick Start 会报错,请问是什么问题。
#111 opened by 1patrickBob - 0
Translate code comments to English
#110 opened by gabrielgcbs - 1
关于qtran的问题
#109 opened by Johnson221b - 2
关于qtran_base.py中_get_individual_q的一个小问题
#108 opened by Johnson221b - 2
策略函数中的eval_hidden和target_hidden如何理解
#106 opened by Johnson221b - 0
None
#107 opened by Johnson221b - 1
custom data traing
#105 opened by shandongchong - 1
自定义的环境能使用这里面的算法跑吗?
#104 opened by shswdmx1 - 3
关于参数reuse_network
#100 opened by zanezhenli - 1
可以使用其他的环境跑这里面的算法吗?
#103 opened by DurableFrank - 1
关于g2anet中hard_weights的问题
#102 opened by Icbyone - 3
关于COMA critic网络输入
#101 opened by hjh0119 - 1
关于QMIX的Trick:Eligibility traces
#99 opened by zanezhenli - 1
关于涉及环境参数的一些疑问
#98 opened by zanezhenli - 1
请问avail_action是什么呢?
#97 opened by Shuang-ZS - 1
请问CommNet 和 G2ANet 需要外部训练算法是什么意思呢
#96 opened by Shuang-ZS - 1
- 1
A bug when choosing actions
#94 opened by Chty-syq - 1
mavan的cuda有问题....
#93 opened by harnvo - 4
- 1
为什么改名哇
#89 opened by zichuan-liu - 1
QMIX 目标网络更新
#88 opened by L-Spike - 4
为何没有保存model?
#87 opened by lbqdcxy - 1
- 2
关于reinforce算法中的问题
#85 opened by Duke-Allen - 1
咨询一个关于qmix的问题
#84 opened by zhouweiqing-star - 1
Its version is StarCraft 2 version: SC2 4.10? and How do I cite this project
#83 opened by zichuan-liu - 7
- 1
Question about get action weights
#81 opened by LXXXXR - 3
关于网络更新时的问题
#80 opened by Duke-Allen - 1
def forward(self, q_values, states): # states的shape为(episode_num, max_episode_len, state_shape) # 传入的q_values是三维的,shape为(episode_num, max_episode_len, n_agents)
#79 opened by zhouweiqing-star - 1
Comments in English?
#78 opened by balamir53 - 4
策略梯度算法Reinforce的问题
#77 opened by Duke-Allen - 1
- 1
关于loss计算的问题
#75 opened by Duke-Allen - 2
关于连续动作的问题
#74 opened by Kevinstone-199898 - 1
关于胜率图像的问题
#73 opened by Duke-Allen - 1
Should probably detach these values right?
#72 opened by aravindvenu7 - 2
关于restore的问题
#71 opened by Kevinstone-199898 - 3
关于reinforce算法的问题
#70 opened by Duke-Allen - 4
关于设置batch大小
#69 opened by Duke-Allen - 0
函数get_obs_agent()中数字表示意义
#68 opened by JiaoAlvin