L-hongbin

Pinned Repositories

LLM-Pruner
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.
Language:Python920 10 85108
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python00
MAP-NEO
Language:Python00
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1.9k 25 184345
MINT-1T
MINT-1T: A one trillion token multimodal interleaved dataset.
786 25 1120

L-hongbin's Repositories

L-hongbin/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python00
L-hongbin/MAP-NEO
Language:Python00