Closed this issue 6 years ago · 1 comments
First of all, thanks for your work. I was reading the A2RL paper, and I wonder what the value output V(st , θv ) exactly is , what is the formulation?
You'd better read the original A3C paper.