jxzhanggg opened this issue 5 years ago · 1 comments
Part of code in both fine-tune and pre-train is the same, for example, the beam search code, decoder, basic_layers etc. I think the code can be reorganized to be more compact.
too lazy to update this.