- pytorch 2.x
- transformers [[link] https://github.com/huggingface/transformers]
- flair nlp [[link] https://github.com/flairNLP/flair]]
- allennlp [[link] https://allennlp.org/]]
- full data attached to this task can be found in the official repo of the task in the following link [[link] https://github.com/adobe-research/deft_corpus]
- This task aims at classifying each sentence whether it contains a defintion or not
- This is a sequence labeling task aims at classifiying each token to definition class
Model | F1- macro Score |
---|---|
semi-crf (baseline) | 15% |
----- | ----- |
BERT | 39% |
----- | ----- |
BiLSTM CRF + GLOVE Embeddings | 32.2% |
----- | ----- |
BiLSTM CRF + Character Embeddings | 37.1% |
----- | ----- |
BILSTM CRF char+GLOVE Embeddings | 38.2% |
----- | ----- |
BILSTM CRF + ELMO Embeddings | 42.1% |
----- | ----- |
BILSTM CRF + BERT Embeddigns | 43.8% |
----- | ----- |
BILSTM CRF + RoBERTa Embeddings | 43.2% |
----- | ----- |
#BILSTM CRF + flairs Embeddings | 47% |
----- | ----- |
Task1: 29/55 Task2: 33/55