bryanlimy/tf2-transformer-chatbot

Evaluation loses temporal information

Opened this issue · 1 comments

Hey guys,

Thanks for the repo, when I am evaluating the model, input_sentence_i does not seem to be influenced by input_sentence_i-1. Is it possible to run the model in inference mode but so that it retains memory of the previous sentences you entered?

Thanks

Hi, the vanilla Transformer (which this repository implements) does not incorporate information across data points so the closest thing you could do with this model is to combine multiple input sentences and treat it as a single input. Recent work by Kossen et al. attempts to apply self-attention across data points, you might wanna have a look!