davidstap's Stars
google/styleguide
Style guides for Google-originated open-source projects
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
google/yapf
A formatter for Python files
ctgk/PRML
PRML algorithms implemented in Python
NVIDIA/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
facebookresearch/lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
dvanoni/notero
A Zotero plugin for syncing items and notes into Notion
rsennrich/subword-nmt
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
NX-AI/xlstm
Official repository of the xLSTM.
allenai/natural-instructions
Expanding natural instructions
translate/translate
Useful localization tools with Python API for building localization & translation systems
stas00/the-art-of-debugging
The Art of Debugging
joeynmt/joeynmt
Minimalist NMT for educational purposes
hplt-project/sacremoses
Python port of Moses tokenizer, truecaser and normalizer
allenai/python-package-template
A template repo for Python packages
rspeer/langcodes
A Python library for working with and comparing language codes.
neulab/knn-transformers
PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an implementation of kNN-LM and kNN-MT
jsbaan/transformer-from-scratch
Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.
prajdabre/yanmtt
Yet Another Neural Machine Translation Toolkit
responsibleproblemsolving/energy-usage
Provides a function to measure the energy usage of another function.
bzhangGo/zero
Zero -- A neural machine translation system
thammegowda/mtdata
A tool that locates, downloads, and extracts machine translation corpora
NJUNLP/knn-box
an easy-to-use knn-mt toolkit
reycn/notion-zotero
Create a Notion collection, synced with Zotero.
ZurichNLP/mbr
Minimum Bayes Risk Decoding for Hugging Face Transformers
masakhane-io/lafand-mt
MAFAND-MT
FadedCosine/kNN-KD
Code for paper "Nearest Neighbor Knowledge Distillation for Neural Machine Translation" by Zhixian Yang, Renliang Sun, and Xiaojun Wan. This paper is accepted by NAACL 2022 Main Conference.
alvations/gachalign
Gale-Church sentence aligner with options for variable parameters
ASoleimaniB/NLQuAD
NLQuAD: A Non-Factoid Long Question Answering Data Set. To be published at EACL2021