purejomo

Pinned Repositories

pytorch-llama
LLaMA 2 implemented from scratch in PyTorch
Language:Python307 7 1357
optimum
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
Language:Python2.8k 54 798512
Olive
Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.
Language:Python1.8k 33 239195
first-my-project
0 1 00
nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python24 1 9019
llama.onnx
LLaMa/RWKV onnx models, quantization and testcase
Language:Python359 13 2031