jon-tow's Stars
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
getzep/zep
Zep | The Memory Foundation For Your AI Stack
jquesnelle/yarn
YaRN: Efficient Context Window Extension of Large Language Models
hyperonym/basaran
Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Transformers-based text generation models.
facebookincubator/submitit
Python 3.8+ toolbox for submitting jobs to Slurm
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
Cerebras/modelzoo
NVIDIA/nccl-tests
NCCL Tests
conceptofmind/PaLM
An open-source implementation of Google's PaLM models
ChenghaoMou/text-dedup
All-in-one text de-duplication
alasdairforsythe/tokenmonster
Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript
Azure/MS-AMP
Microsoft Automatic Mixed Precision Library
Guitaricet/relora
Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
epfml/landmark-attention
Landmark Attention: Random-Access Infinite Context Length for Transformers
taylorai/galactic
data cleaning and curation for unstructured text
huggingface/hf_transfer
EleutherAI/concept-erasure
Erasing concepts from neural representations with provable guarantees
yifanzhang-pro/AutoMathText
Official implementation of DPFM @ ICLR 2024 paper "Autonomous Data Selection with Language Models for Mathematical Texts" (Huggingface Daily Papers: https://huggingface.co/papers/2402.07625)
shayne-longpre/a-pretrainers-guide
coreweave/kubernetes-cloud
Getting Started with the CoreWeave Kubernetes GPU Cloud
sunyt32/torchscale
Transformers at any scale
iwiwi/epochraft
Checkpointable dataset utilities for foundation model training
Gnurro/FinetuneReFormatter
Tools with GUI for GPT finetune data preparation
OpenMachine-ai/transformer-tricks
A collection of tricks to speed up LLMs
samikama/CPCargo
A simple package to upload DL checkpoints to remote storage
JetBrains-Research/code-summarization-dataset
coreweave/gutenberg-epub
haileyschoelkopf/megablocks