Is this repository only work for Gym Environments?
XueminLiu111 opened this issue · 0 comments
XueminLiu111 commented
What if we have a custom environment and the value of action has bound? Did you use any functions such as tanh
to bound the Gaussian samples to give us a reasonable action?