Add new algorithms

Question

Add new algorithms

rahulptel opened this issue 5 years ago · 7 comments

rahulptel commented 5 years ago

It would be nice to add the following algorithms:

RAINBOW
A2C (multiprocessing)

I will submit a PR if I finish any of them.

seungeunrho commented 5 years ago

Awesome!

Answer 1 · 2019-07-16T16:12:15.000Z

Hi!
I think A2C (synchronous update version of A3C) is good.
What about implementing RAINBOW rather than Double, Dueling DQN?
I think the significance of the code to both Double and Dueling DQN is marginal because they are small variations of DQN in terms of implementation.
In contrast, a simple implementation of the RAINBOW might be helpful for many people.
(Actually, Dueling and Double DQN are 2 components of RAINBOW out of 6)
https://arxiv.org/abs/1710.02298

Answer 2 · 2019-07-16T16:27:35.000Z

Agreed. We can go with RAINBOW.

Answer 3 · 2020-06-11T10:51:23.000Z

MuZero would also be a cool algorithm, it is a bit more complicated with the MCTS but it works very well

Answer 4 · 2020-06-11T10:52:26.000Z

Also, thanks so much for sharing.
These are great simple implementations for learning and have been very useful.

If you want to try something else, you could also try to implement them in TensorFlow

Answer 5 · 2020-07-30T11:19:41.000Z

How about SAC?

Answer 6 · 2021-04-05T21:46:56.000Z

How about Phasic Policy Gradient (PPG) as it gives better results than PPO?
Also an example of using these algorithms for non gaming environment like ones with list, dict etc as observation instead of image frames. I guess that will be easy as we will have to use NN instead of CNN. Still a simple example, may be.