p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch

Question on SAC implementation

fokx opened this issue · 0 comments

fokx commented

In SAC.py Line 120

_, z, action = self.produce_action_and_action_info(state)

However, the output of produce_action_and_action_info(state) is

So, even though SAC algorithm can work in practice, is it a mistake?