Implementation tricks for different algorithms

Question

Implementation tricks for different algorithms

waterhorse1 opened this issue 3 years ago · 2 comments

It's a really Nice work on benchmarking multi-agent RL algorithms and I really like it. When I go through the code, I find out epymarl basically only implements the basic version of different algorithms and ignore many different implementation tricks for different algorithms, such as the value normalization trick for MAPPO mentioned in MAPPO , or the value clipping trick for IPPO mentioned in IPPO. Will furture version epymarl support such tricks?

Answer 1 · 2022-02-21T15:50:56.000Z

Hello,
we aim to add more implementation details of different algorithms in the future. The basic idea was for all algorithms to have very similar implementations to be able to better compare their achieved returns. We are also happy to accept pull requests for these implementation tricks.

Answer 2 · 2022-02-21T19:32:28.000Z

Thank you for your answer.