UT-Austin-RPL/amago
a simple and scalable agent for training adaptive policies with sequence-based RL
PythonMIT
Issues
- 2
Intended usage of "accelerate launch"
#47 opened by gunshi - 2
Can not find Q/K/V sigmaReparam Linear Block
#36 opened by daehwa00 - 4
Question regarding Mamba
#29 opened by CTP314 - 2
Inference on CPU
#20 opened by SeeYaaa - 2
Symbolic Alchemy results?
#16 opened by carlosluis - 2
Questions regarding Crafter experiment
#7 opened by symoon11