/DialogPolicy

Dialog policy optimization for task-oriented conversational agents in low resource setting using Reinforcement learning. The methodology is based on a novel probability based Self-play technique and a novel Reward based sampling technique that prioritizes failed dialogues over successful ones

Primary LanguageOpenEdge ABLOtherNOASSERTION

No issues in this repository yet.