UT-Austin-RPL/amago

a simple and scalable agent for training adaptive policies with sequence-based RL

PythonMIT

Issues

Intended usage of "accelerate launch"
#47 opened 2 months ago by gunshi
2
Can not find Q/K/V sigmaReparam Linear Block
#36 opened 4 months ago by daehwa00
2
Question regarding Mamba
#29 opened 8 months ago by CTP314
4
Inference on CPU
#20 opened 10 months ago by SeeYaaa
2
Symbolic Alchemy results?
#16 opened 10 months ago by carlosluis
2
Questions regarding Crafter experiment
#7 opened a year ago by symoon11
2