Pinned Repositories
afriberta
AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages
africanlp-resources
List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond
BERT-NER
Use Google's BERT for named entity recognition (CoNLL-2003 as the dataset).
How-to-distill-your-BERT
Code for the paper: How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives (ACL 2023)
hstm
Code and data for "Heterogeneous Supervised Topic Models"
menyo-20k_MT
nepali-ner
Nepali NER dataset and code
NLP_DL_Intro
Deep learning for NLP
sib-200
SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects
yoruba-text
Yorùbá language training text for NLP, ASR and TTS tasks
dadelani's Repositories
dadelani/sib-200
SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects
dadelani/africanlp-resources
List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond
dadelani/nepali-ner
Nepali NER dataset and code
dadelani/AfriHG
News headline generation for African languages
dadelani/How-to-distill-your-BERT
Code for the paper: How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives (ACL 2023)
dadelani/hstm
Code and data for "Heterogeneous Supervised Topic Models"
dadelani/NLP_DL_Intro
Deep learning for NLP
dadelani/aclpub2
dadelani/ANEC-An-Amharic-Named-Entity-Corpus-
A Dataset for Amharic Named Entity Recognition
dadelani/composable-sft
dadelani/dadelani.github.io
David Adelani website
dadelani/dadelani2.github.io
dadelani/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
dadelani/DeBERTa
The implementation of DeBERTa
dadelani/finetune-hf-vits
Finetune VITS and MMS using HuggingFace's tools
dadelani/flores
Facebook Low Resource (FLoRes) MT Benchmark
dadelani/ftac
FTAC text
dadelani/HornMT
Machine translation (MT) benchmark dataset for languages in the Horn of Africa.
dadelani/lm-evaluation-harness
A framework for few-shot evaluation of language models.
dadelani/Medical-Dialogue
dadelani/muliwai
Text pre-processing for NLP datasets
dadelani/MultilingualSIFT
MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning
dadelani/NLLB-inference
dadelani/open-bible-scripts
scipts for working with open.bible data
dadelani/portuguese-bert
Portuguese pre-trained BERT models
dadelani/scholarly-metadata
dadelani/setfit
Efficient few-shot learning with Sentence Transformers
dadelani/ViDeBERTa
ViDeBERTa: A powerful pre-trained language model for Vietnamese
dadelani/wtpsplit
Code for Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation
dadelani/zindi_masakhane_pos
Code for Lacuna Masakhane Parts of Speech Classification Challenge