cuayahuitl/SimpleDS

what is usage of data/english/dialogueN.txt

Closed this issue · 4 comments

Hi, I have a question about the usage of dialogues.

The dialogues in data/language file is just used to train a classifier?

Thanks Heriberto,

I got your meaning since i have read your paper SimpleDS. The dialog examples in data/english is used to compute the rewards, and to narrow action space depending on probability threshold? We know that deep Q-learining with experience replay is well suited for unsupervised learning, there is no need for preparing supervised data. These 6 dialogs are not the training examples in RL.

Hi, I have a question about the running procedure of runclient.js.

During the training, the policy and the output hadn't been saved in results. Is it right?
_20180927172202