/nstep-sil

Code for NeurIPS 2020 paper 'Self-imitation Learning via Generalized Lower bound Q-learning'

Primary LanguagePython

Watchers