Pinned Repositories
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
alpa
Training and serving large-scale neural networks with auto parallelization.
cruise
Cruise: A Distributed Machine Learning Framework with Automatic System Configuration
FastFlow
FastFlow is a system that automatically detects CPU bottlenecks in deep learning training pipelines and resolves the bottlenecks with data pipeline offloading to remote resources .
harmony
Harmony: A new scheduling framework that executes multiple Parameter-Server (PS) Machine Learning (ML) training jobs efficiently to improve cluster resource utilization.
incubator-nemo
Apache Nemo (Incubating) - Data Processing System for Flexible Employment With Different Deployment Characteristics
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
wooyeonlee0's Repositories
wooyeonlee0/alpa
Training and serving large-scale neural networks with auto parallelization.
wooyeonlee0/cruise
Cruise: A Distributed Machine Learning Framework with Automatic System Configuration
wooyeonlee0/FastFlow
FastFlow is a system that automatically detects CPU bottlenecks in deep learning training pipelines and resolves the bottlenecks with data pipeline offloading to remote resources .
wooyeonlee0/harmony
Harmony: A new scheduling framework that executes multiple Parameter-Server (PS) Machine Learning (ML) training jobs efficiently to improve cluster resource utilization.
wooyeonlee0/incubator-nemo
Apache Nemo (Incubating) - Data Processing System for Flexible Employment With Different Deployment Characteristics
wooyeonlee0/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs