qgallouedec/deep_rl

Single-file truly minimal implementation of state-of-the-art reinforcement learning algorithms.

PythonMIT

deep_rl

Single-file truly minimal implementation of state-of-the-art reinforcement learning algorithms.

Simplicity is the ultimate sophistication. Leonardo da Vinci

Algorithm
A2C	❌
ARS	❌
C51	✔️
DDPG	✔️
Double DQN	❌
DQN	✔️
Dueling DQN	✔️
FQF	❌
IQN	✔️
RAINBOW	❌
Recurrent PPO	❌
REINFORCE	✔️
PER ⁽¹⁾	✔️
PPO	✔️
QR-DQN	❌
SAC	✔️
TD3	❌
TQC	❌
TRPO	❌

⁽¹⁾ with DQN

Credit

Most implementations are taken from vwxyzjn/cleanrl without being stricly equivalent. Other ressources: toshikwa/fqf-iqn-qrdqn.pytorch

Contributions

are welcome.