Pinned Repositories
BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
dice_loss_for_NLP
The repo contains the code of the ACL2020 paper `Dice Loss for Data-imbalanced NLP Tasks`
gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
vision
Datasets, Transforms and Models specific to Computer Vision
Megatron-LM
Ongoing research training transformer models at scale
Life-0-1's Repositories
Life-0-1/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
Life-0-1/dice_loss_for_NLP
The repo contains the code of the ACL2020 paper `Dice Loss for Data-imbalanced NLP Tasks`
Life-0-1/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Life-0-1/vision
Datasets, Transforms and Models specific to Computer Vision