DongjunLee/conversation-tensorflow

Is it possible to use customer service conversation as data set with this model ?

genz10 opened this issue · 7 comments

Hi,

Is it possible to use customer service conversation as data set with this model ? such as

Hello
May I help you ?
I want to ask something
Yes please
Does my booking refundable ?
May I get your booking ID.
1234567890
Yes, your booking is refundable.
I want to submit a refund for my booking.
...
...
...

Hi @genz10,
Yes it is.

If you make a chatbot that automates custom services, you need lots of customer service conversation data. And there will be uncertainty in the answer.

@DongjunLee

Thanks for your reply. FYI I've been successfully reproduce tiny model, after 20K epoch and final loss 0.0223939. I found that, when the epoch is not enough it will caused loss value to be unstable.

For now, I'm still experimenting your model to be applied into customer service data set. (Arround 8K question and 8K answer), should I report the result here ?

I think 8K data set is not enough, but maybe it can be working.
I wonder what your experiment results are. :)

How this model overcome such conversation like this

A : Hello
A : I want to ask something
A : Regarding my purchase yesterday
B : Yes please

There is multiple question before there is an answer.

My idea is to concatenate multiple sentences into 1 line. But will it works ?

I think that simply solution makes the input sentences are long and it means Increase data sparsity.
But you need to try it.

Or check this paper : Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Model

I've checked that paper, and gives me some idea to use triplets model instead of single sentence for each line.

But for the training, after reach 20k epoch and 0,02XXXX loss value, I found that the model can't answer simple question, I'm still investigating what's wrong.

Maybe that is because of lack of data, it seems to be overfitting to the training data.
If you want to get the same results as training data, try doing the beam_width as 0. (using GreedyEmbeddingHelper)