Pinned Repositories
LLM-Drop
The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
PiPPy
Pipeline Parallelism for PyTorch
Lightgbm_analysis
mgtwr
sunkun1997
Config files for my GitHub profile.
sunkun1997's Repositories
sunkun1997/mgtwr
sunkun1997/Lightgbm_analysis
sunkun1997/sunkun1997
Config files for my GitHub profile.