Issues
- 0
Torchscript EncoderDecoderTransformer
#147 opened by msaroufim - 1
Can I use WordPiece tokenizer when I pretrain RobertaModel following the guideline?
#149 opened by qwer4107 - 0
Cannot build model
#148 opened by at3e - 0
Problems in comparing parameters
#146 opened by leonleeldc - 4
[question] how to know use which arch parameters?
#145 opened by HyejinWon - 2
Inference/finetuning Blender in fairseq
#144 opened by hckh - 1
- 2
Data preparation
#130 opened by zwx8981 - 1
Instructions for fine-tuning pre-trained models
#139 opened by y3nk0 - 1
Pretrained word embeddings for machine translation?
#141 opened by darsh10 - 0
What is positional score per token?
#140 opened by aj7tesh - 3
- 0
error when load libbleu
#138 opened by amelieyu - 2
RuntimeError during training
#136 opened by zhao1iang - 2
How does the 1D convolution work?
#135 opened by robrechtme - 1
Typo in reference README
#134 opened by robrechtme - 2
Unable to Download Pretrained models
#133 opened by nithya4 - 0
Encount CUDA error: out of memory when training multilingual translation task
#132 opened by moonscar - 2
generate.py RuntimeError: cublas runtime error : library not initialized at torch/lib/THC/THCGeneral.c
#131 opened by shirakad - 0
Segmentation fault during trainning
#129 opened by mattiadg - 1
Is Writing Prompts Dataset available in this repo ?
#127 opened by GBLin5566 - 3
Cannot reverse order of translation
#128 opened by andrewb-ms - 0
sh: 1: ~/mosesdecoder/symal: Permission denied
#126 opened by travel-go - 0
- 2
Averaging model parameters
#124 opened by patrik-lambert - 2
Pointers to distributed training
#117 opened by posenhuang - 11
create new class
#99 opened by linhanxiao - 2
Training error after use BPE
#102 opened by happygirl123456 - 1
Generate from raw text
#105 opened by nguyenlab - 1
is fairseq support run as a service?
#123 opened by huqitu - 6
Generation with alignment dictionary
#121 opened by patrik-lambert - 1
Any method to reduce GPU memory usage?
#122 opened by frankang - 1
The problem of Chinese translation into English
#120 opened by travel-go - 2
Characters as input instead of words
#119 opened by riteshpanjwani - 1
Reproduce the result on WMT14 en-de
#112 opened by Zrachel - 7
I want to skip some epochs
#118 opened by travel-go - 2
the result of WMT en-de
#116 opened by travel-go - 3
- 3
attention visualization of different decoder layers
#114 opened by SuperWu090 - 10
Assertion in topk() failed during generation
#113 opened by travel-go - 4
How convolution exactly works in this network?
#111 opened by aburkov - 0
cannot reproduce result on wmt en-de
#110 opened by Zrachel - 0
regularization
#109 opened by linhanxiao - 9
loss regularization
#107 opened by linhanxiao - 2
out of memory at the end of epoch
#108 opened by posenhuang - 6
FairSeq model as a Server
#104 opened by mdasadul - 2
pytorch version
#106 opened by linhanxiao - 3
- 2
train the example model error: Segmentation fault
#101 opened by Lingogo - 3
question about network training
#103 opened by linhanxiao