seungeunrho/minimalRL

Add new algorithms

rahulptel opened this issue · 7 comments

It would be nice to add the following algorithms:

  • RAINBOW
  • A2C (multiprocessing)

I will submit a PR if I finish any of them.

Hi!
I think A2C (synchronous update version of A3C) is good.
What about implementing RAINBOW rather than Double, Dueling DQN?
I think the significance of the code to both Double and Dueling DQN is marginal because they are small variations of DQN in terms of implementation.
In contrast, a simple implementation of the RAINBOW might be helpful for many people.
(Actually, Dueling and Double DQN are 2 components of RAINBOW out of 6)
https://arxiv.org/abs/1710.02298

Agreed. We can go with RAINBOW.

Awesome!

MuZero would also be a cool algorithm, it is a bit more complicated with the MCTS but it works very well

Also, thanks so much for sharing.
These are great simple implementations for learning and have been very useful.

If you want to try something else, you could also try to implement them in TensorFlow

How about SAC?

How about Phasic Policy Gradient (PPG) as it gives better results than PPO?
Also an example of using these algorithms for non gaming environment like ones with list, dict etc as observation instead of image frames. I guess that will be easy as we will have to use NN instead of CNN. Still a simple example, may be.