Pinned Repositories
FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Megatron-LM
Ongoing research training transformer models at scale
calculon
yutian-mt's Repositories
yutian-mt/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
yutian-mt/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.