tinkoff-ai/lb-sac
Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Workshop
PythonApache-2.0
Stargazers
- 4SkyNet
- CherryPieSexy
- confiwentShanghai Jiao Tong University
- dokapokaMoscow
- DT6AETH Zurich
- elephantmiptTinkoff AI
- fly51flyPRIS
- gorynych00freelancer
- Howuhh
- kefirski@tinkoff-ai
- leekwoonSeoul, Korea
- SandalotsVolcanak
- Scitator@catalyst-team
- sjYoondeltarSeoul
- suessmann@jbr-ai-labs
- tokarev-i-v
- vkurenkov@tinkoff-ai
- WuTi0525
- yaroslavyaroslavBelgium