tokenizer.model_max_length is incorrect

Question

mh-northlander opened this issue 2 years ago · 1 comments

The parameter model_max_length of chitra tokenizer seems too large.

> sudachitra.sudachitra.BertSudachipyTokenizer.from_pretrained("chiTra-1.0").model_max_length
1000000000000000019884624838656

Answer 1 · 2023-03-06T08:13:59.000Z

Adding "model_max_length": 512 to the tokenizer_config.json will solve this problem.