/transfusion-code

Primary LanguagePythonMIT LicenseMIT

Anonymized Supplementary Material

TransFusion Training code

bash train_masakha_ner_mdeberta.sh

TransFusion Data

TransFusion training/inference data for MasakhaNER can be found in (anonymized) Google Drive

EasyProject Data Generation

Translation data can be found in the Google Drive. Run the following code to project labels from translation data in 'conll_nllb_3B_ft.pkl':

python decode_marker_conll.py