Pinned Repositories
pytorch-llama
LLaMA 2 implemented from scratch in PyTorch
optimum
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
Olive
Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.
first-my-project
nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
llama.onnx
LLaMa/RWKV onnx models, quantization and testcase