Pinned Repositories
ai-on-gke
apimachinery
application
Application metadata descriptor CRD
common
Common APIs and libraries shared by other Kubeflow operator repositories.
container-engine-accelerators
Collection of tools and examples for managing Accelerated workloads in Kubernetes Engine
cos-gpu-installer
Scripts to build and use a container to install GPU drivers on Container-Optimized OS images
evaluation
maxtext
A simple, performant and scalable Jax LLM!
ray-on-gke
ray-on-gke-old
richardsliu's Repositories
richardsliu/ray-on-gke
richardsliu/ray-on-gke-old
richardsliu/maxtext
A simple, performant and scalable Jax LLM!
richardsliu/ai-on-gke
richardsliu/application
Application metadata descriptor CRD
richardsliu/common
Common APIs and libraries shared by other Kubeflow operator repositories.
richardsliu/container-engine-accelerators
Collection of tools and examples for managing Accelerated workloads in Kubernetes Engine
richardsliu/evaluation
richardsliu/examples
A repository to host extended examples and tutorials
richardsliu/gatekeeper
Gatekeeper - Policy Controller for Kubernetes
richardsliu/JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
richardsliu/jetstream-pytorch
PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"
richardsliu/katib
Repository for hyperparameter tuning
richardsliu/kfctl
kfctl is a CLI for deploying and managing Kubeflow
richardsliu/kubeflow
Machine Learning Toolkit for Kubernetes
richardsliu/kubeflow-distribution
Blueprints for Deploying Kubeflow on Google Cloud Platform and Anthos
richardsliu/kuberay
A toolkit to run Ray applications on Kubernetes
richardsliu/manifests
A repository for Kustomize manifests
richardsliu/ml-auto-solutions
A simplified and automated orchestration workflow to perform ML end-to-end (E2E) model tests and benchmarking on Cloud VMs across different frameworks.
richardsliu/nccl-tests
NCCL Tests
richardsliu/optimum-tpu
Google TPU optimizations for transformers models
richardsliu/pipelines
Machine Learning Pipelines for Kubeflow
richardsliu/pytorch-operator
PyTorch on Kubernetes
richardsliu/ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
richardsliu/ray-tpu
richardsliu/test-infra
Test infrastructure for the Kubernetes project.
richardsliu/testing
Test infrastructure and tooling for Kubeflow.
richardsliu/tf-operator
Tools for ML/Tensorflow on Kubernetes.
richardsliu/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
richardsliu/website
Kubeflow's public website