dadelani

Assistant Professor @McGill-NLP

Pinned Repositories

afriberta
AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages
Language:Python1 1 00
africanlp-resources
List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond
10 1 04
BERT-NER
Use Google's BERT for named entity recognition （CoNLL-2003 as the dataset）.
Language:Python1 1 00
How-to-distill-your-BERT
Code for the paper: How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives (ACL 2023)
Language:Python2 0 01
hstm
Code and data for "Heterogeneous Supervised Topic Models"
Language:Python1 0 00
menyo-20k_MT
11 2 04
nepali-ner
Nepali NER dataset and code
Language:Jupyter Notebook6 2 03
NLP_DL_Intro
Deep learning for NLP
Language:Python1 1 06
sib-200
SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects
Language:Python16 1 01
yoruba-text
Yorùbá language training text for NLP, ASR and TTS tasks
Language:Python2 1 00

dadelani's Repositories

dadelani/sib-200
SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects
Language:Python16 1 01
dadelani/africanlp-resources
List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond
10 1 04
dadelani/nepali-ner
Nepali NER dataset and code
Language:Jupyter Notebook6 2 03
dadelani/AfriHG
News headline generation for African languages
2
dadelani/How-to-distill-your-BERT
Code for the paper: How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives (ACL 2023)
Language:Python2 0 01
dadelani/hstm
Code and data for "Heterogeneous Supervised Topic Models"
Language:Python1 0 00
dadelani/NLP_DL_Intro
Deep learning for NLP
Language:Python1 1 06
dadelani/aclpub2
Language:TeX0 0 00
dadelani/ANEC-An-Amharic-Named-Entity-Corpus-
A Dataset for Amharic Named Entity Recognition
0 01
dadelani/composable-sft
Language:Python1 0
dadelani/dadelani.github.io
David Adelani website
Language:JavaScript0 01
dadelani/dadelani2.github.io
Language:HTML1 01
dadelani/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Language:Python0 0
dadelani/DeBERTa
The implementation of DeBERTa
Language:Python0 0
dadelani/finetune-hf-vits
Finetune VITS and MMS using HuggingFace's tools
Language:Python0 0
dadelani/flores
Facebook Low Resource (FLoRes) MT Benchmark
Language:Python0 0
dadelani/ftac
FTAC text
1 0
dadelani/HornMT
Machine translation (MT) benchmark dataset for languages in the Horn of Africa.
0 0
dadelani/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python2
dadelani/Medical-Dialogue
Language:JavaScript0 0
dadelani/muliwai
Text pre-processing for NLP datasets
Language:Python1 0
dadelani/MultilingualSIFT
MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning
Language:Python0 0
dadelani/NLLB-inference
Language:Perl0 0
dadelani/open-bible-scripts
scipts for working with open.bible data
Language:Shell1 0
dadelani/portuguese-bert
Portuguese pre-trained BERT models
Language:Python0 0
dadelani/scholarly-metadata
Language:Python0 0
dadelani/setfit
Efficient few-shot learning with Sentence Transformers
Language:Python0 0
dadelani/ViDeBERTa
ViDeBERTa: A powerful pre-trained language model for Vietnamese
Language:Jupyter Notebook0 0
dadelani/wtpsplit
Code for Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation
Language:Python0 0
dadelani/zindi_masakhane_pos
Code for Lacuna Masakhane Parts of Speech Classification Challenge
Language:Jupyter Notebook0 0