Pinned Repositories
operator
Kubernetes operator for Bagua distributed training job.
flash-attention
Fast and memory-efficient exact attention
api
The canonical location of the Kubernetes API definition.
apimachinery
AresOperator
AutoCode
common
Common APIs and libraries shared by other Kubeflow operator repositories.
kubernetes
Production-Grade Container Scheduling and Management
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
volcano
A Cloud Native Batch System (Project under CNCF)
eggiter's Repositories
eggiter/api
The canonical location of the Kubernetes API definition.
eggiter/apimachinery
eggiter/AresOperator
eggiter/AutoCode
eggiter/CDNMF-Dynamic
eggiter/coursera-deep-learning-specialization
Notes, programming assignments and quizzes from all courses within the Coursera Deep Learning specialization offered by deeplearning.ai: (i) Neural Networks and Deep Learning; (ii) Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization; (iii) Structuring Machine Learning Projects; (iv) Convolutional Neural Networks; (v) Sequence Models
eggiter/bril
an educational compiler intermediate representation
eggiter/Deep-Learning-Specialization
eggiter/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
eggiter/FlagAI
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.
eggiter/flash-attention
Fast and memory-efficient exact attention
eggiter/kubernetes
Production-Grade Container Scheduling and Management
eggiter/Megatron-LM
Ongoing research training transformer models at scale
eggiter/Module-0
Module 0 - Fundamentals
eggiter/operator
Kubernetes operator for Bagua distributed training job.
eggiter/vimrc
eggiter/volcano
A Kubernetes Native Batch System (Project under CNCF)
eggiter/website
Koordinator documentations and website.