dannigt's Stars
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
BlinkDL/RWKV-LM
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.
huggingface/text-generation-inference
Large Language Model Text Generation Inference
state-spaces/s4
Structured state space sequence models
facebook/Ax
Adaptive Experimentation Platform
facebookincubator/submitit
Python 3.8+ toolbox for submitting jobs to Slurm
uber-research/PPLM
Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.
dropreg/R-Drop
facebookresearch/fairseq2
FAIR Sequence Modeling Toolkit 2
facebookresearch/SONAR
SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.
karpathy/deep-vector-quantization
VQVAEs, GumbelSoftmaxes and friends
Shark-NLP/OpenICL
OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.
jxhe/unify-parameter-efficient-tuning
Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)
fe1ixxu/ALMA
State-of-the-art LLM-based translation models.
allenai/acl2022-zerofewshot-tutorial
google-research-datasets/cvss
CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus
wxjiao/ParroT
The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.
diyiy/ACL2022_Limited_Data_Learning_Tutorial
quanpn90/NMTGMinor
A Neural Machine Translation toolkit for research purpose
bigscience-workshop/multilingual-modeling
BLOOM+1: Adapting BLOOM model to support a new unseen language
bitextor/bicleaner-ai
Bicleaner fork that uses neural networks
turkic-interlingua/til-mt
Machine Translation for Turkic Languages
SimengSun/alpaca_farm_lora
mzboito/IWSLT2022_Tamasheq_data
Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IWSLT2022.
AmericasNLP/americasnlp2022
ZNLP/zero-shot-st
Llamacha/quechua_resources
vladsolovyev/fairseq_summarization
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.