ivulic's Stars
facebookresearch/MUSE
A library for Multilingual Unsupervised or Supervised word Embeddings
state-spaces/s4
Structured state space sequence models
AkariAsai/self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
JustGlowing/minisom
:red_circle: MiniSom is a minimalistic implementation of the Self Organizing Maps
PolyAI-LDN/conversational-datasets
Large datasets for conversational AI
babylonhealth/fastText_multilingual
Multilingual word vectors in 78 languages
princeton-nlp/LM-BFF
[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723
globalwordnet/english-wordnet
The Open English WordNet
PolyAI-LDN/pheme
PolyAI-LDN/task-specific-datasets
A collection of task-specific NLU datasets
ZhangXInFD/soundstorm-speechtokenizer
Implementation of SoundStorm built upon SpeechTokenizer.
cambridgeltl/xcopa
XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning
cambridgeltl/composable-sft
A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.
nmrksic/attract-repel
The Attract-Repel algorithm presented in (Mrkšić et al., TACL 2017), with accompanying resources.
nmrksic/LEAR
Specialising Word Vectors for Lexical Entailment
ndaheim/faithful-dialogue
ducdauge/sft-llm
Scaling Sparse Fine-Tuning to Large Language Models
codogogo/towerparse
Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection
cambridgeltl/multi3woz
The official repository for Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems (TACL 2023)
cambridgeltl/e2e_tod_toolkit
A codebase for e2e ToD toolkit.
ivulic/panlex-bli
Bilingual lexicon induction (BLI) training and test sets extracted from PanLex - used in the work of Vulić et al. (EMNLP 2019)
codogogo/instamap
Instance-Based Mapping for Induction of Cross-Lingual Word Embedding Spaces
cambridgeltl/COD
ivulic/hyperlex
HyperLex: a gold standard resource for measuring and evaluating how well semantic models capture graded or soft lexical entailment