Pinned Repositories
AgentBaker
Agent Baker is aiming to provide a centralized, portable k8s agent node provisioning lib as well as rich support on different OS image with optimized k8s binaries.
aibrix
Cost-efficient and pluggable Infrastructure components for GenAI inference
aks-gpu
Setup and configure nodes to support GPUs on k8s, with a focus on AKS nodes. This repo contains steps to build a container image with the Nvidia driver, and dependencies for integration.
aks-rdma-infiniband
kaito
Kubernetes AI Toolchain Operator
cluster-api
Home for Cluster API, a subproject of sig-cluster-lifecycle
gateway-api-inference-extension
Gateway API Inference Extension
kubernetes
Production-Grade Container Scheduling and Management
test-infra
Test infrastructure for the Kubernetes project.
gatekeeper
🐊 Policy Controller for Kubernetes
chewong's Repositories
chewong/kubernetes
Production-Grade Container Scheduling and Management
chewong/AgentBaker
Agent Baker is aiming to provide a centralized, portable k8s agent node provisioning lib as well as rich support on different OS image with optimized k8s binaries.
chewong/aibrix
Cost-efficient and pluggable Infrastructure components for GenAI inference
chewong/aks-gpu
Setup and configure nodes to support GPUs on k8s, with a focus on AKS nodes. This repo contains steps to build a container image with the Nvidia driver, and dependencies for integration.
chewong/aks-rdma-infiniband
chewong/cert-controller
chewong/gatekeeper
Gatekeeper - Policy Controller for Kubernetes
chewong/gitdm
📜Fork for tracking CNCF projects
chewong/kaito
Kubernetes AI Toolchain Operator
chewong/kuberay
A toolkit to run Ray applications on Kubernetes
chewong/container-upstream
This project captures work in progress, and completed work for the Azure Core Container Upstream team
chewong/foundation
☁️♮🏛 This repo contains several documents related to the operation of the CNCF. File non-technical issues related to CNCF here.
chewong/gateway-api-inference-extension
Gateway API Inference Extension
chewong/guidellm
Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs
chewong/jobset
JobSet: a k8s native API for distributed ML training and HPC workloads
chewong/kaito-cookbook
Examples and guides for using the Kaito API
chewong/llm-d-deployer
Helm charts for llm-d
chewong/llm-d-modelservice
helm charts for deploying models with llm-d
chewong/lws
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
chewong/test-infra
Test infrastructure for the Kubernetes project.
chewong/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs