Issues
- 1
- 2
RuntimeError: shape '[32, 1, 1000, 64]' is invalid for input of size 2273280
#16 opened by Tylersuard - 0
Instructions for generating tokens from LM?
#18 opened by Tylersuard - 0
Script hangs forever on validation step
#17 opened by Tylersuard - 0
New Pathfinder-X2 Dataset!
#15 opened by Tylersuard - 0
Maximum context length?
#13 opened by Tylersuard - 0
Any plans to merge with `fairseq`?
#12 opened by mahnerak - 2
How to get test accuracy
#9 opened by aleksandar-terzic - 3
Attention masking in MovingAverageGatedAttention
#11 opened by jambo6 - 1
Tokenization for downstream tasks
#10 opened by danigoju - 4
Non-commercial license
#7 opened by mnaylor5 - 1
ONNX support
#6 opened by leonid-pishchulin - 1
Regarding the damping factor δ
#4 opened by ciaua - 3
Fail for transformer_lra_pf32
#3 opened by nachewigkeit - 1
The lack of warmup-updates
#2 opened by nachewigkeit - 0
Lack of src-bin/dict.txt
#1 opened by nachewigkeit