hkproj/pytorch-transformer

Attention is all you need implementation

Jupyter Notebook

Issues

Error: DatasetGenerationError: An error occurred while generating the dataset
#25 opened 20 days ago by hissain
0
only spaces get predicted
#18 opened a month ago by aadityapattabhiraman
3
W_o matrix in multiheadattention
#23 opened 2 months ago by PING-An32
0
Clarification regarding decoder_input and label
#22 opened 3 months ago by BrLlan
1
Clarification regarding dropout in the multihead attention block
#21 opened 3 months ago by anupsingh15
0
Not an issue, but a re-implementation
#20 opened 4 months ago by chettiargautam
0
translate.py not consistent
#19 opened 4 months ago by dzjxzyd
2
System cannot find the path specified - tokenizer path.
#17 opened 5 months ago by shuklaji28
1
Getting Error : module transformers has no attribute 'PreTrainedtokenizerBase'
#16 opened 5 months ago by shuklaji28
0
Quality after 20 epoch training
#15 opened 5 months ago by thanhnew2001
1
The implementation of ResidualConnection is not correct?
#14 opened 6 months ago by imxtx
2
Fourier Positional Embeddings
#13 opened 6 months ago by mourad-ghafiri
0
Truncation for Tokenizer
#12 opened 7 months ago by ardaaras99
1
Position Encoding
#2 opened a year ago by Sheiphan
2
forward method in transformer class
#10 opened 9 months ago by akkasi
1
dataset
#9 opened 9 months ago by debby-2020
3
projection_layer should output logits value directly instead of log_softmax
#7 opened 10 months ago by liyang85105
3
Question: train.py
#5 opened 10 months ago by mohamedelbahnasawi
4
multihead attention biases
#4 opened 10 months ago by stanchiang
1