/structured_fusion_networks

Code for SIGDial 2019 Best Paper: Structured Fusion Networks for Dialog https://arxiv.org/abs/1907.10016

Primary LanguagePython

Structured Fusion Networks

Structured Fusion Networks for Dialog https://arxiv.org/abs/1907.10016

Code written by Shikib Mehri and Tejas Srinivasan.

Pretrained Models

The main models trained in the paper have been provided and can be used as follows:

Sequence-to-Sequence with Attention:

python3 test.py --model_name models/attn --use_attn --test_only

Structured Fusion Networks:

python3 test.py --model_name models/sfn --structured_fusion --test_only

Structured Fusion Networks with Reinforcement Learning:

python3 test.py --model_name models/sfn_rl --structured_fusion --test_only

Training

Structured Fusion Networks can be trained with the following command:

python3 train.py --model_name 'models/sfn' --structured_fusion --use_cuda True --tune_params True

SFNs can be fine-tuned with RL as follows (you may need to edit the code a bit):

python3 rl_train.py --structured_fusion --model_name models/rl_sfn --lr 0.00001

Questions

Please feel free to send any questions/concerns to amehri@andrew.cmu.edu.

Citing

The following paper should be cited if you use this code.

@article{mehri2019structured,
  title={Structured Fusion Networks for Dialog},
  author={Mehri, Shikib and Srinivasan, Tejas and Eskenazi, Maxine},
  journal={arXiv preprint arXiv:1907.10016},
  year={2019}
}