/deep_rl

Single-file truly minimal implementation of state-of-the-art reinforcement learning algorithms.

Primary LanguagePythonMIT LicenseMIT

deep_rl

Code style: black

Single-file truly minimal implementation of state-of-the-art reinforcement learning algorithms.

Simplicity is the ultimate sophistication. Leonardo da Vinci

Algorithm
A2C
ARS
C51 ✔️
DDPG ✔️
Double DQN
DQN ✔️
Dueling DQN ✔️
FQF
IQN ✔️
RAINBOW
Recurrent PPO
REINFORCE ✔️
PER (1) ✔️
PPO ✔️
QR-DQN
SAC ✔️
TD3
TQC
TRPO

(1) with DQN

Credit

Most implementations are taken from vwxyzjn/cleanrl without being stricly equivalent. Other ressources: toshikwa/fqf-iqn-qrdqn.pytorch

Contributions

are welcome.