MarcoMeter/episodic-transformer-memory-ppo
Clean baseline implementation of PPO using an episodic TransformerXL memory
PythonMIT
Stargazers
- AI-Ahmed@Qme-ai
- amisukiKrafton
- AntoineThebUniversité de Sherbrooke
- AutoRecursive
- cwfparsonsonUniversity College London
- dclambert
- Dhawgupta
- dosssman
- Ehplodor
- fly51flyPRIS
- hai-h-nguyenNortheastern University
- hany606Daejeon, South Korea
- hotco87GIST
- ibagurUNICEF/iMMAP Inc.
- jacktheripper19
- jinPreludeFaikerz.Inc
- jonbaerBrooklyn, NY
- jw1401Germany
- knight9114
- Luca96Università di Bologna
- m8e
- mch5048ECE. Seoul National University
- mdiephuisGeneve, La Suisse
- neverparadiseLG Electronics
- ReinholdM中国.北京
- ShawonAshrafellamind GmbH
- sjYoondeltarSeoul
- slerman12Rochester, NY
- SystemclusterAnlatan
- tinyzqhNortheastern University
- tranhoangkhuongvn
- wyatty
- yashkumaratri@IIITD @lcs2-iiitd @hartvigsen-group
- Yonv1943SIAT(中科院深圳先进院)
- zerlinwangTsinghua University
- zhixuan-linUniversity of Montreal