entailment-neural-attention-lstm-tf: A Python repository from borelien

Reasoning about Entailment with Neural Attention

This is a TensorFlow [3] implementation of the model described in Rocktäschel et al. "Reasoning about Entailment with Neural Attention" [1].

The SNLI dataset by Samuel R. Bowman et al. [4]:

The pretrained Word2Vec word and phrase vectors by Mikolov et al. [2]:

The main script come with several options, which can be listed with the --help flag.

python main.py --help

To run the training:

python main.py --train

By default, the script runs on GPU 0 with these parameters values:

learning_rate = 0.001
weight_decay = 0.
batch_size_train = 24
num_epochs = 45
sequence_length = 20
embedding_dim = 300
num_units = 100

(to be updated)

(to be updated)

[1] Tim Rocktäschel, Edward Grefenstette, Karl Moritz Hermann, Tomáš Kočiský, Phil Blunsom, Reasoning about Entailment with Neural Attention, 2015.

[2] Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg Corrado, Jeffrey Dean, Distributed Representations of Words and Phrases and their Compositionality, 2013.

[4] Samuel R. Bowman, Gabor Angeli, Christopher Potts, Christopher D. Manning, The Stanford Natural Language Processing Group, A large annotated corpus for learning natural language inference, 2015.