QANet

A Pytorch implementation of QANet
The code is mostly based on the two repositories: hengruo/QANet-pytorch NLPLearn/QANet

Performance

Training epochs / Steps	BatchSize	HiddenSize	Attention Heads	EM	F1
12.8 / 35,000	32	96	1	69.0	78.6
22 / 60,000	32	96	1	69.7	79.2
12.8 / 93,200	12	128	8	70.3	79.7
22 / 160,160	12	128	8	70.7	80.0

*The results of hidden size 128 with 8 heads were run with 12 batches.

Download and preprocess the data

# download SQuAD and Glove
$ sh download.sh
# preprocess
$ python3.6 main.py --mode data

Train the model

# model/model.pt will be generated every epoch
$ python3.6 main.py --mode train

# Run tensorboard for visualisation
$ tensorboard --logdir ./log/