Text classification examples - Tokenizer is defined twice

Question

obesp opened this issue 4 years ago · 1 comments

The tokenizer is defined both in the model and the dataset in the BERT text classification examples.

multi_class.py, line 50:
self.tokenizer = transformers.BertTokenizer.from_pretrained( "bert-base-uncased", do_lower_case=True )

Answer 1 · 2021-01-06T13:03:37.000Z

Indeed it is. its not needed in model. seems like a copy-paste error. ;) i will fix it.