jakegrigsby/super_sac
A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data augmentation, offline learning and behavioral cloning.
Python
Stargazers
- backhotion
- bAmpTBerlin
- BigRedDogeSWE Intern at The Hartford
- cycal1020
- DefinitlyEvilEvil
- eli-davis
- ergo-zyhZhejiang University
- frankroederTUHH
- frietz58Örebro University
- hankerbit
- huchanwei123Texas A&M University
- jcj0000
- joel99New York, New York
- justwj
- jxmorris12New York, NY
- lian700
- liuqi8827Harbin Institute of Technology
- LostThinkerChina
- MHajkarim
- mohakbhardwajUniversity of Washington
- piepie1121
- right-chanKAIST
- Rowing0914LINE Corp
- seawee1Karlsruhe Institute of Technology
- ShaneFlandermeyerAdvanced Radar Research Center
- shenmuxinHappy Planet
- sjYoondeltarSeoul
- SSKKaiUniversity of Bristol
- StaminaTang
- till2Hasso-Plattner-Institute
- VittorioGiammarinoLafayette, IN
- walzterBarcelona
- YannickVogt
- zanghyu
- ZhuxinZhang
- zichunxxChina