CNN Policy

Question

CNN Policy

Opened this issue 6 years ago · 1 comments

Can you please add an example of a CNN policy? All the code is oriented towards MLP policies.

Answer 1 · 2018-10-25T08:49:49.000Z

I found that the observation dimension of Hopper-v1/v2 is 11, so it's not an image as a input for network. I think the MLP policy is the suitable method in fixing such gym game. Surely, if you work on another different domain, you can have a try for CNN policy.