Pinned Repositories
Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
aliyun_project
incubator-dolphinscheduler
Dolphin Scheduler is a distributed and easy-to-expand visual DAG workflow scheduling system, dedicated to solving the complex dependencies in data processing, making the scheduling system out of the box for data processing.(分布式易扩展的可视化工作流任务调度)
Megatron-LM
Ongoing research training transformer models at scale
kubernetes
Production-Grade Container Scheduling and Management
k8s-rdma-shared-dev-plugin
Megatron-LM
Ongoing research training transformer models at scale
Eisenhower's Repositories
Eisenhower/aliyun_project
Eisenhower/incubator-dolphinscheduler
Dolphin Scheduler is a distributed and easy-to-expand visual DAG workflow scheduling system, dedicated to solving the complex dependencies in data processing, making the scheduling system out of the box for data processing.(分布式易扩展的可视化工作流任务调度)
Eisenhower/Megatron-LM
Ongoing research training transformer models at scale