zalkikar/limitation-learning
Generative adversarial imitation learning to produce a proxy for the reward function present in dialogue.
Jupyter NotebookApache-2.0
Generative adversarial imitation learning to produce a proxy for the reward function present in dialogue.
Jupyter NotebookApache-2.0