switch base model from xlm-roberta-base

Question

switch base model from xlm-roberta-base

Closed this issue 8 months ago · 2 comments

Hi, I'm working with historical swedish text, and want to train a sentence-segmenter for that (historical swedish text is very inconsistent with punctuation, capital letters etc). I've finetuned a swedish BERT-model on a historical text corpora, and now I want to use this model instead of xlm-roberta-base, when I train the sentence segmenter, I've tried changing the base model, so that it loads the BERT model instead, but I get a mismatch in parameters. Can you give me some tips on what I have to do to change the base_model from xlm-roberta-base to my finetuned historical BERT-model.

Best regards, and thanks for a great repo!

Answer 1 · 2022-05-22T19:59:31.000Z

It seems like the xlm-roberta-model is hardcoded into the codebase.
Have a look at spacy for individual models: https://explosion.ai/blog/ud-benchmarks-v3-2

Answer 2 · 2023-11-07T18:52:58.000Z

Hi @Sneriko,
Thanks for letting us know.
To support another base model, one way is to clone our repo and install it from source. Once the installation is done, you may change the encoder specified in the two following files:

Base model file: https://github.com/nlp-uoregon/trankit/blob/master/trankit/models/base_models.py
Config file: https://github.com/nlp-uoregon/trankit/blob/master/trankit/config.py