iTharindu/DialogPolicy
Dialog policy optimization for task-oriented conversational agents in low resource setting using Reinforcement learning. The methodology is based on a novel probability based Self-play technique and a novel Reward based sampling technique that prioritizes failed dialogues over successful ones
OpenEdge ABLNOASSERTION
No issues in this repository yet.