bentrevett/pytorch-seq2seq

Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.

Jupyter NotebookMIT

Issues

Incorrect German Translation
#206 opened 3 months ago by timothy-geiger
0
Tutorial 3: failed to run in google colab
#159 opened 5 months ago by YiyiAm
6
RuntimeError: Expected hidden[0] size (2, 1, 512), got [2, 128, 512] - Seq2Seq Model with PreTrained BERT Model
#162 opened 5 months ago by Ninja16180
0
how to use BPTTIterator for Language Modeling
#163 opened 5 months ago by StephennFernandes
0
Adding layers to the encoder and decoder of seq2seq
#154 opened 5 months ago by dopu2k16
1
Blank Outputs
#155 opened 5 months ago by abdullahkhilji
0
Tutorial 6: PositionWiseFeedforwardLayer - fc_2 activation function
#157 opened 5 months ago by dryng
1
Tutorial 4: Decoder - the calculation of prediction
#158 opened 5 months ago by actforjason
2
Why the batch size is misplaced in the tensor?
#153 opened 5 months ago by abdullahkhilji
2
How to change seq2seq to graph2seq
#203 opened 5 months ago by nhjclxc
0
Seq2seq: Input not matching Output (and big thanks)
#202 opened 5 months ago by s2458588
0
no module named 'torchtext.legacy'
#198 opened 5 months ago by Hzzhang-nlp
2
import
#199 opened 5 months ago by bouchalhakim
0
possible opposite explanation of hidden compared to output in notebook #3
#200 opened 5 months ago by Hadar933
0
Why using tanh function
#194 opened 5 months ago by karimmahalian
3
How do you make this work on android?
#196 opened 5 months ago by Lambdacreator
0
Notebook 1 <eos> problem.
#197 opened 5 months ago by yusufsali61
2
Tutorial 6: [Attention is All You need] Different output at different batch size during Inference
#189 opened 5 months ago by rajeevbaalwan
0
Question about changing params init from xavier to kaiming
#190 opened 5 months ago by yzhang-github-pub
0
Using pretrained BERT embedding
#192 opened 5 months ago by dhurba-baral
0
Question
#184 opened 5 months ago by Letcise
0
torchtext recent version (0.12.0) doesn't support Field, BucketIterator
#185 opened 5 months ago by manik2304
4
Question about how to resolve the out of vocabulary problem during encoding and decoding in tutorial 1
#186 opened 5 months ago by liaomuquan
0
Possible Inaccuracies in training script
#187 opened 5 months ago by rrmina
0
Thank you!
#181 opened 5 months ago by asigalov61
1
Multi-gpu might fail with Attention is all you need
#180 opened 5 months ago by Aaron-Zhao123
3
Custom Text Dataset
#183 opened 5 months ago by moodhiaj
6
using CTC loss instead of Cross Entropy loss:
#177 opened 5 months ago by kerolos
1
spacy load not loading
#179 opened 5 months ago by neqkir
1
Custom dataset: using Tabular dataset
#172 opened 5 months ago by CrispenGari
0
Question about tutorial 1 and 2 Decoder
#168 opened 5 months ago by djaekim
1
Tutorial 1: Differences between Encoder/Decoder in Seq2Seq Model
#169 opened 5 months ago by michael-camilleri
3
Tutorial 6: Attention is all you need
#170 opened 5 months ago by Hannibal046
3
Suggest ask the users (whoever is using this tutorial) to update their spaCy to 3.0 as 2.0 is suddenly very slow today
#165 opened 5 months ago by cestwc
3
Transformer ScaledDotProductAttention energy value on 16-bit Precision.
#191 opened 2 years ago by ankitvad
3
why this
#178 opened 2 years ago by zhiqiangohuo
0
[Bug] Tranformer Seq2Seq Have Wrong Inputs!
#182 opened 2 years ago by bot66
2
Tut 6: TypeError with calculate_bleu_alt
#175 opened 3 years ago by yuvaraj91
1
Tutorial 6: Regarding attention plots
#176 opened 3 years ago by yuvaraj91
0
Tutorial 4 : AssertionError during the trainning
#173 opened 3 years ago by WendkuuniArzouma
7
How easy can those models be converted into production TorchScript (PyTorch model inference pipeline)
#174 opened 3 years ago by kerolos
0
Tutorial 4: Why not set the padding_idx of nn.Embedding's to src_pad_token_idx or tgt_pad_token_idx respectively?
#167 opened 3 years ago by yipliu
1
Tutorial 3: The input of Attention
#149 opened 3 years ago by yipliu
3
Tutorial 6: Question to CrossEntropyLoss
#166 opened 3 years ago by bpaulwitz
1
Lesson 1, decoder to linear batch position
#164 opened 3 years ago by marcpaga
4
Seq2Seq Model with PreTrained BERT Model is Throwing Error During Training
#161 opened 3 years ago by Ninja16180
2
Conversational Model Created Using Pre-Trained BERT Model is Throwing Error During Training
#160 opened 3 years ago by Ninja16180
4
Tutorial 4: RuntimeError: 'lengths' argument should be a 1D CPU int64 tensor, but got 1D cuda:0 Long tensor
#156 opened 3 years ago by STRZGR
3
Tutorial 1: understanding LSTM
#151 opened 3 years ago by harshraj22
3
Tutorial 6: Multihead Attention
#152 opened 3 years ago by hartrials
0