roberta-pos-finetuning

This repo contains a nootebok which shows in details how to finetune roberta (as the example I used the polish version: sdadas/polish-roberta-base-v2) model for part-of-speech tagging task. It uses modern API from Huggingface ecosystem and is heavily based on https://huggingface.co/docs/transformers/tasks/token_classification.

Dataset used in training: ipipan/nkjp1m

WikKam/roberta-pos-finetuning