discrete soft Q learning(SQL) and soft Q imitation learning(SQIL) implementation in pytorch, simple!
Primary LanguagePython