/TD3-PyTorch

Implementation of Twin Delayed Deep Deterministic Policy Gradients (TD3) using PyTorch. Paper: Addressing Function Approximation Error in Actor-Critic Methods

Primary LanguagePython

Stargazers