ChatBots

Best parameters for the models

Seq2Seq Model on OpenSubtitles2018

$ python ShaLab/engine.py -d ~/data/OpenSubtitles2018/preprocessed.personachat/ --sort-batches -lr 0.5 -m 0.1 -emsz 300 -hsz 1024 -nlayers 1 -bs 128 --num-epochs 10 -dp 0.35 --glove ~/word_vectors/glove.6B.300d.txt --log-interval 10000 -o ~/models/OpenSubtitles2018/preprocessed.personachat/Seq2Seq/lr-0.5 --gpu 0
2018-05-12 18:49:08 INFO The best checkpoint /home/urikz/models/OpenSubtitles2018/preprocessed.personachat/Seq2Seq/lr-0.5/model.checkpoint.epoch-10.pth.tar. Picking up the model from there
2018-05-12 18:49:08 INFO Loaded dialog model from checkpoint in 0 seconds
----- Validation [valid.personachat.original] after epoch 10 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 65.396
----- Validation [valid.personachat.train.original] after epoch 10 (122496 samples, 11.73 avg source length, 12.89 avg target length) perplexity 77.164
----- Validation [valid.OpenSubtitles2009] after epoch 10 (167296 samples, 7.10 avg source length, 8.05 avg target length) perplexity 18.490
----- Validation [valid] after epoch 10 (13579648 samples, 7.17 avg source length, 8.13 avg target length) perplexity 19.830
----- Validation [valid.personachat.revised] after epoch 10 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 65.395
----- Validation [test.OpenSubtitles2009] after epoch 10 (152704 samples, 7.04 avg source length, 8.00 avg target length) perplexity 18.301
----- Validation [test.OpenSubtitles2018] after epoch 10 (13537536 samples, 7.18 avg source length, 8.14 avg target length) perplexity 19.867
----- Validation [test.personachat.revised] after epoch 10 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 69.681
----- Validation [test.personachat.original] after epoch 10 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 69.671

Seq2Seq Model on PersonaChat

$ python ShaLab/engine.py -d ~/data/personachat/preprocessed/vocab-full.train-original/ --sort-batches -lr 1.0 -m 0.1 -emsz 300 -hsz 1024 -nlayers 1 -bs 128 --num-epochs 10 -dp 0.35 --glove ~/word_vectors/glove.6B.300d.txt -o /tmp/model-0 --gpu 0

2018-04-19 12:17:06 INFO The best checkpoint /tmp/model-0/model.checkpoint.epoch-9.pth.tar. Picking up the model from there
2018-04-19 12:17:06 INFO Loaded dialog model from checkpoint in 0 seconds
----- Validation [valid.OpenSubtitles2009] after epoch 10 (167296 samples, 7.10 avg source length, 8.05 avg target length) perplexity 226.403
----- Validation [valid.revised] after epoch 10 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 33.646
----- Validation [valid] after epoch 10 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 33.646
----- Validation [test.OpenSubtitles2009] after epoch 10 (152704 samples, 7.04 avg source length, 8.00 avg target length) perplexity 227.319
----- Validation [test.original] after epoch 10 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 31.538
----- Validation [test.revised] after epoch 10 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 31.538

Seq2Seq Model on PersonaChat and OpenSubtitles2009

$ python ShaLab/engine.py -d ~/data/OpenSubtitles/preprocessed.personachat/ --sort-batches -lr 1.0 -m 0.1 -emsz 300 -hsz 1024 -nlayers 1 -bs 128 --num-epochs 10 -dp 0.35 --glove ~/word_vectors/glove.6B.300d.txt --log-interval 1000 -o ~/models/OpenSubtitles/preprocessed.personachat/Seq2Seq/baseline --gpu 0
2018-05-09 16:06:22 INFO The best checkpoint /home/urikz/models/OpenSubtitles/preprocessed.personachat/Seq2Seq/baseline/model.checkpoint.epoch-10.pth.tar. Picking up the model from there
2018-05-09 16:06:22 INFO Loaded dialog model from checkpoint in 0 seconds
----- Validation [valid.personachat.original] after epoch 10 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 106.489
----- Validation [valid.personachat.train.original] after epoch 10 (122496 samples, 11.73 avg source length, 12.89 avg target length) perplexity 126.017
----- Validation [valid] after epoch 10 (167296 samples, 7.10 avg source length, 8.05 avg target length) perplexity 23.232
----- Validation [valid.personachat.revised] after epoch 10 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 106.494
----- Validation [test.OpenSubtitles2009] after epoch 10 (152704 samples, 7.04 avg source length, 8.00 avg target length) perplexity 23.606
----- Validation [test.personachat.revised] after epoch 10 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 114.291
----- Validation [test.personachat.original] after epoch 10 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 114.270

Seq2Seq Model on PersonaChat and OpenSubtitles2009

$ python ShaLab/engine.py -d ~/data/personachat_with_OS2009 --sort-batches -lr 1.0 -m 0.1 -emsz 300 -hsz 1024 -nlayers 1 -bs 128 --num-epochs 10 -dp 0.35 --glove ~/word_vectors/glove.6B.300d.txt --log-interval 1000 -o /tmp/model-5.seq2seq --gpu 0

2018-04-29 18:42:49 INFO The best checkpoint /tmp/model-5.seq2seq/model.checkpoint.epoch-10.pth.tar. Picking up the model from there
2018-04-29 18:42:50 INFO Loaded dialog model from checkpoint in 0 seconds
----- Validation [valid.OpenSubtitles2009] after epoch 20 (167296 samples, 7.10 avg source length, 8.05 avg target length) perplexity 23.105
----- Validation [valid.revised] after epoch 20 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 31.039
----- Validation [valid] after epoch 20 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 31.041
----- Validation [test.OpenSubtitles2009] after epoch 20 (152704 samples, 7.04 avg source length, 8.00 avg target length) perplexity 23.376
----- Validation [test.original] after epoch 20 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 30.022
----- Validation [test.revised] after epoch 20 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 30.023

ProfileMemoryModel (without default memory) on PersonaChat

$ python ShaLab/engine.py -d ~/data/personachat/preprocessed/vocab-full.train-original/ --sort-batches -lr 1.0 -m 0.2 -emsz 300 -hsz 1024 -nlayers 1 -bs 128 --num-epochs 20 -dp 0.5 --glove ~/word_vectors/glove.6B.300d.txt -o ~/models/personachat/ProfileMemoryModel/no_default_memory --gpu 3 --profile-memory-attention general

2018-05-01 21:41:44 INFO The best checkpoint /home/urikz/models/personachat/ProfileMemoryModel/no_default_memory/model.checkpoint.epoch-20.pth.tar. Picking up the model from there
2018-05-01 21:41:44 INFO Loaded dialog model from checkpoint in 0 seconds
----- Validation [valid.revised] after epoch 20 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 32.711
----- Validation [valid] after epoch 20 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 29.686
----- Validation [test.original] after epoch 20 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 28.406
----- Validation [test.revised] after epoch 20 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 30.911

ProfileMemoryModel (without default memory) on PersonaChat and OpenSubtitles2009

$ python ShaLab/engine.py -d ~/data/personachat_with_OS2009 --sort-batches -lr 0.5 -m 0.2 -emsz 300 -hsz 1024 -nlayers 1 -bs 128 --num-epochs 20 -dp 0.4 --glove ~/word_vectors/glove.6B.300d.txt --log-interval 1000 -o ~/models/personachat+OS2009/ProfileMemoryModel/no_default_memory --gpu 2 --profile-memory-attention general

2018-05-01 23:35:37 INFO The best checkpoint /home/urikz/models/personachat+OS2009/ProfileMemoryModel/no_default_memory/model.checkpoint.epoch-14.pth.tar. Picking up the model from there
2018-05-01 23:35:37 INFO Loaded dialog model from checkpoint in 0 seconds
----- Validation [valid.revised] after epoch 20 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 29.443
----- Validation [valid] after epoch 20 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 28.252
----- Validation [test.original] after epoch 20 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 27.373
----- Validation [test.revised] after epoch 20 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 28.450

ProfileMemoryModel (without default memory) on PersonaChat and DailyDialog

$ python ShaLab/engine.py -d ~/data/PersonaChat/withDailyDialog --sort-batches -lr 0.5 -m 0.2 -emsz 300 -hsz 1024 -nlayers 1 -bs 128 --num-epochs 20 -dp 0.4 --glove ~/word_vectors/glove.6B.300d.txt --log-interval 1000 -o ~/models/personachat+DailyDialog/ProfileMemoryModel/no_default_memory --gpu 0 --profile-memory-attention general --model-type profile-memory

2018-07-01 17:03:20 INFO The best checkpoint /home/urikz/models/personachat+DailyDialog/ProfileMemoryModel/no_default_memory/model.checkpoint.epoch-12.pth.tar. Picking up the model from there
----- Validation [valid.revised] after epoch 20 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 32.008
----- Validation [valid] after epoch 20 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 29.463
----- Validation [test.original] after epoch 20 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 28.270
----- Validation [test.revised] after epoch 20 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 30.394

ProfileMemoryModel (without default memory) on PersonaChat and CornellMovieDialogCorpus

$ python ShaLab/engine.py -d ~/data/PersonaChat/withCornellMovieDialogCorpus/ --sort-batches -lr 0.5 -m 0.2 -emsz 300 -hsz 1024 -nlayers 1 -bs 128 --num-epochs 20 -dp 0.4 --glove ~/word_vectors/glove.6B.300d.txt --log-interval 1000 -o ~/models/personachat+CornellMovieDialogCorpus/ProfileMemoryModel/no_default_memory --gpu 2 --profile-memory-attention general --model-type profile-memory

2018-07-01 17:10:40 INFO The best checkpoint /home/urikz/models/personachat+CornellMovieDialogCorpus/ProfileMemoryModel/no_default_memory/model.checkpoint.epoch-14.pth.tar. Picking up the model from there
----- Validation [valid.revised] after epoch 20 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 31.682
----- Validation [valid] after epoch 20 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 29.436
----- Validation [test.original] after epoch 20 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 28.423
----- Validation [test.revised] after epoch 20 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 30.330

ProfileMemoryModel on PersonaChat

$ python ShaLab/engine.py -d ~/data/personachat/preprocessed/vocab-full.train-original/ --sort-batches -lr 1.0 -m 0.2 -emsz 300 -hsz 1024 -nlayers 1 -bs 128 --num-epochs 20 -dp 0.5 --glove ~/word_vectors/glove.6B.300d.txt -o /tmp/model-3 --gpu 3 --profile-memory-attention general --use-default-memory

2018-04-19 12:27:55 INFO The best checkpoint /tmp/model-3/model.checkpoint.epoch-19.pth.tar. Picking up the model from there
2018-04-19 12:27:55 INFO Loaded dialog model from checkpoint in 0 seconds
----- Validation [valid.OpenSubtitles2009] after epoch 20 (167296 samples, 7.10 avg source length, 8.05 avg target length) perplexity 236.495
----- Validation [valid.revised] after epoch 20 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 32.722
----- Validation [valid] after epoch 20 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 29.483
----- Validation [test.OpenSubtitles2009] after epoch 20 (152704 samples, 7.04 avg source length, 8.00 avg target length) perplexity 236.940
----- Validation [test.original] after epoch 20 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 28.098
----- Validation [test.revised] after epoch 20 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 30.779

ProfileMemoryModel on PersonaChat and OpenSubtitles2009

$ python ShaLab/engine.py -d ~/data/personachat_with_OS2009 --sort-batches -lr 0.5 -m 0.2 -emsz 300 -hsz 1024 -nlayers 1 -bs 128 --num-epochs 20 -dp 0.4 --glove ~/word_vectors/glove.6B.300d.txt --log-interval 1000 -o /tmp/model-2 --gpu 2 --profile-memory-attention general --use-default-memory

2018-05-13 00:58:57 INFO The best checkpoint /tmp/model-0/model.checkpoint.epoch-15.pth.tar. Picking up the model from there
2018-05-13 00:58:57 INFO Loaded dialog model from checkpoint in 0 seconds
----- Validation [valid.OpenSubtitles2009] after epoch 20 (167296 samples, 7.10 avg source length, 8.05 avg target length) perplexity 23.053
----- Validation [valid.revised] after epoch 20 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 29.667
----- Validation [valid] after epoch 20 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 27.505
----- Validation [test.OpenSubtitles2009] after epoch 20 (152704 samples, 7.04 avg source length, 8.00 avg target length) perplexity 23.224
----- Validation [test.CornellMovieDialogCorpus] after epoch 20 (85760 samples, 8.29 avg source length, 9.21 avg target length) perplexity 29.818
----- Validation [test.original] after epoch 20 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 26.764
----- Validation [test.revised] after epoch 20 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 28.622

ProfileMemoryModel on PersonaChat and DailyDialog

$ python ShaLab/engine.py -d ~/data/PersonaChat/withDailyDialog --sort-batches -lr 0.5 -m 0.2 -emsz 300 -hsz 1024 -nlayers 1 -bs 128 --num-epochs 20 -dp 0.4 --glove ~/word_vectors/glove.6B.300d.txt --log-interval 1000 -o ~/models/personachat+DailyDialog/ProfileMemoryModel/baseline --gpu 0 --profile-memory-attention general --use-default-memory --model-type profile-memory

2018-07-01 16:21:36 INFO The best checkpoint /home/urikz/models/personachat+DailyDialog/ProfileMemoryModel/baseline/model.checkpoint.epoch-11.pth.tar. Picking up the model from there
----- Validation [valid.OpenSubtitles2009] after epoch 20 (167296 samples, 7.10 avg source length, 8.05 avg target length) perplexity 62.022
----- Validation [valid.revised] after epoch 20 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 31.694
----- Validation [valid] after epoch 20 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 29.119
----- Validation [test.OpenSubtitles2009] after epoch 20 (152704 samples, 7.04 avg source length, 8.00 avg target length) perplexity 60.853
----- Validation [test.CornellMovieDialogCorpus] after epoch 20 (85760 samples, 8.29 avg source length, 9.21 avg target length) perplexity 57.354
----- Validation [test.original] after epoch 20 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 28.033
----- Validation [test.revised] after epoch 20 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 30.220

ProfileMemoryModel on PersonaChat and CornellMovieDialogCorpus

$ python ShaLab/engine.py -d ~/data/PersonaChat/withCornellMovieDialogCorpus/ --sort-batches -lr 0.5 -m 0.2 -emsz 300 -hsz 1024 -nlayers 1 -bs 128 --num-epochs 20 -dp 0.4 --glove ~/word_vectors/glove.6B.300d.txt --log-interval 1000 -o ~/models/personachat+CornellMovieDialogCorpus/ProfileMemoryModel/baseline --gpu 2 --profile-memory-attention general --use-default-memory --model-type profile-memory

2018-07-01 16:32:57 INFO The best checkpoint /home/urikz/models/personachat+CornellMovieDialogCorpus/ProfileMemoryModel/baseline/model.checkpoint.epoch-14.pth.tar. Picking up the model from there
----- Validation [valid.OpenSubtitles2009] after epoch 20 (167296 samples, 7.10 avg source length, 8.05 avg target length) perplexity 38.776
----- Validation [valid.revised] after epoch 20 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 31.611
----- Validation [valid] after epoch 20 (14592 samples, 11.99 avg source length, 13.11 avg target length) perplexity 28.971
----- Validation [test.OpenSubtitles2009] after epoch 20 (152704 samples, 7.04 avg source length, 8.00 avg target length) perplexity 38.226
----- Validation [test.original] after epoch 20 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 27.980
----- Validation [test.revised] after epoch 20 (13952 samples, 11.78 avg source length, 12.94 avg target length) perplexity 30.235
----- Validation [test.DailyDialog] after epoch 20 (47744 samples, 10.29 avg source length, 11.19 avg target length) perplexity 42.194

urikz/ChatBot

ChatBots

Best parameters for the models

Seq2Seq Model on OpenSubtitles2018

Seq2Seq Model on PersonaChat

Seq2Seq Model on PersonaChat and OpenSubtitles2009

Seq2Seq Model on PersonaChat and OpenSubtitles2009

ProfileMemoryModel (without default memory) on PersonaChat

ProfileMemoryModel (without default memory) on PersonaChat and OpenSubtitles2009

ProfileMemoryModel (without default memory) on PersonaChat and DailyDialog

ProfileMemoryModel (without default memory) on PersonaChat and CornellMovieDialogCorpus

ProfileMemoryModel on PersonaChat

ProfileMemoryModel on PersonaChat and OpenSubtitles2009

ProfileMemoryModel on PersonaChat and DailyDialog

ProfileMemoryModel on PersonaChat and CornellMovieDialogCorpus