Pinned Repositories
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
lm-evaluation-harness
A framework for few-shot evaluation of language models.
AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
swift
ms-swift: Use PEFT or Full-parameter to finetune 250+ LLMs or 35+ MLLMs. (Qwen2, GLM4, Internlm2, Yi, Llama3, Llava, Deepseek, Baichuan2...)
fsdp_qlora
Training LLMs with QLoRA + FSDP
functionary
Chat language model that can use tools and interpret the results
OmniQuant
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
hqq
Official implementation of Half-Quadratic Quantization (HQQ)
QuaRot
Code for QuaRot, an end-to-end 4-bit inference of large language models.
EETQ
Easy and Efficient Quantization for Transformers
mxjmtxrm's Repositories
mxjmtxrm doesn’t have any repository yet.