longxudou

LLM Researcher @sail-sg. Maintainer ⚓️Sailor | 🔱Sailor2 | 🚢 SailCraft | 🧭 SailCompass

Research Scientist @ Sea AI LabHarbin

longxudou's Stars

hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Language:Python44.6k 246 6.2k5.5k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python41.9k 350 7.2k6.3k
triton-lang/triton
Development repository for the Triton language and compiler
Language:MLIR14.9k 197 1.7k1.9k
MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
Language:MATLAB7.6k 60 0786
arcee-ai/mergekit
Tools for merging pretrained large language models.
Language:Python5.4k 56 353512
kvcache-ai/Mooncake
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
Language:C++2.9k 31 69181
togethercomputer/MoA
Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models
Language:Python2.7k 36 19367
NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Language:Python2.3k 35 420382
mit-han-lab/once-for-all
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
Language:Python1.9k 52 75341
huggingface/lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
Language:Python1.3k 28 234198
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
Language:Python1.2k 19 3890
horseee/LLM-Pruner
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.
Language:Python977 10 87115
NVIDIA/NeMo-Curator
Scalable data pre processing and curation toolkit for LLMs
Language:Jupyter Notebook830 12 181113
locuslab/wanda
A simple and effective LLM pruning approach.
Language:Python722 8 67102
allenai/OLMoE
OLMoE: Open Mixture-of-Experts Language Models
Language:Jupyter Notebook684 10 1956
princeton-nlp/LLM-Shearing
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
Language:Python595 23 7350
TencentARC/LLaMA-Pro
[ACL 2024] Progressive LLaMA with Block Expansion.
Language:Python499 20 3439
NVIDIA/NeMo-Framework-Launcher
Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.
Language:Python493 19 38145
NVlabs/Minitron
A family of compressed models obtained via pruning and knowledge distillation
330 23 618
arcee-ai/EvolKit
EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Models (LLMs).
Language:Jupyter Notebook207 4 024
Outsider565/LoRA-GA
Language:Jupyter Notebook181 3 217
sail-sg/regmix
[ICLR 2025] 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)
Language:Jupyter Notebook114 6 106
pprp/Pruner-Zero
Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs
Language:Python80 4 66
sail-sg/scaling-with-vocab
[NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623
Language:Python80 3 15
Qichuzyy/POA
Official implementation of ECCV24 paper: POA
24 4 0
zirui-HIT/DAC
Language:Python8 1 40
zhxlia/FLEXTAF
Language:Python6 2 00
zirui-HIT/Fused
Language:Python5 1 00
zirui-HIT/Encore
Language:Python4 1 10
OpenDFM/EST
3 0 00

longxudou

longxudou's Stars

hiyouga/LLaMA-Factory

vllm-project/vllm

triton-lang/triton

MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning

arcee-ai/mergekit

kvcache-ai/Mooncake

togethercomputer/MoA

NVIDIA/TransformerEngine

mit-han-lab/once-for-all

huggingface/lighteval

RLHFlow/RLHF-Reward-Modeling

horseee/LLM-Pruner

NVIDIA/NeMo-Curator

locuslab/wanda

allenai/OLMoE

princeton-nlp/LLM-Shearing

TencentARC/LLaMA-Pro

NVIDIA/NeMo-Framework-Launcher

NVlabs/Minitron

arcee-ai/EvolKit

Outsider565/LoRA-GA

sail-sg/regmix

pprp/Pruner-Zero

sail-sg/scaling-with-vocab

Qichuzyy/POA

zirui-HIT/DAC

zhxlia/FLEXTAF

zirui-HIT/Fused

zirui-HIT/Encore

OpenDFM/EST