sted97
Ph.D. Student @SapienzaUniversity & NLP Researcher @Babelscape. My research interests range from multilingual Information Extraction to Generative AI.
PhD Student at @SapienzaNLP group | NLP Researcher at @Babelscape.Italy
Pinned Repositories
ALERT
Official repository for the paper "ALERT: A Comprehensive Benchmark for Assessing Large Language Models’ Safety through Red Teaming"
ID10M
Data and code for the paper "ID10M: Idiom Identification in 10 Languages" (NAACL 2022).
multinerd
Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguation)" (NAACL 2022).
ner4el
Repository for the paper "Named Entity Recognition for Entity Linking: What Works and What's Next" (EMNLP 2021).
ner4id
Data and code for the paper "NER4ID at SemEval-2022 Task 2: Named Entity Recognition for Idiomaticity Detection".
wikineural
Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2021).
ALERT_repo
Official repository for the paper "ALERT: A Comprehensive Benchmark for Assessing Large Language Models’ Safety through Red Teaming"
entity-recognition-datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
mteb
MTEB: Massive Text Embedding Benchmark
NER-datasets
Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)
sted97's Repositories
sted97/ALERT_repo
Official repository for the paper "ALERT: A Comprehensive Benchmark for Assessing Large Language Models’ Safety through Red Teaming"
sted97/entity-recognition-datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
sted97/mteb
MTEB: Massive Text Embedding Benchmark
sted97/NER-datasets
Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)
sted97/nlp-notebooks
A collection of notebooks for Natural Language Processing from NLP Town
sted97/sted97.github.io
sted97/wsd-hard-benchmark
Data and code for "Nibbling at the Hard Core of Word Sense Disambiguation" (ACL 2022).