mxjmtxrm

Pinned Repositories

fsdp_qlora
Training LLMs with QLoRA + FSDP
Language:Jupyter Notebook1.5k 23 39190
AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Language:Python4.8k 30 481511
lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python8.3k 39 1.3k2.2k
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python141k 1.1k 16.9k28.3k
functionary
Chat language model that can use tools and interpret the results
Language:Python1.5k 21 131117
ms-swift
Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
Language:Python6.4k 33 1.9k542
Auto-Round
Language:Python10
transformers
Language:Python00
Megatron-LM
Ongoing research training transformer models at scale
Language:Python11.8k 167 9062.7k
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Language:Python2.3k 35 420382

mxjmtxrm's Repositories

mxjmtxrm/Auto-Round
Language:Python10
mxjmtxrm/transformers
Language:Python00
mxjmtxrm/pic