What subword tokenizer do you use?

Question

conan1024hao opened this issue 3 years ago · 2 comments

Hi, I am just curious about that what subword tokenizer did you use when training the model, BPE or WordPiece?

Answer 1 · 2022-05-12T09:39:12.000Z

We trained this model using Sentencepiece Unigram tokenizer.

Answer 2 · 2022-05-12T09:56:50.000Z

@bage79 감사합니다！