- add per [X] /ere
- add n-step [X]
- add munchausenRL
- add D2RL
- Make test runs for BulletEnvs
- Make comparison to SAC
BY571/TD3-and-Extensions
PyTorch implementation of Twin Delayed Deep Deterministic Policy Gradient (TD3) - including additional Extension to improve the algorithm's performance.
Python