/off-policy-RL-algorithms

PyTorch Implementation of off-policy reinforcement learning algorithms like Q-learning, DQN, DDPG and TD3.

Primary LanguageJupyter NotebookMIT LicenseMIT

Stargazers