Pinned Repositories
codeflare
Simplifying the definition and execution, scaling and deployment of pipelines on the cloud.
codeflare-cli
codeflare-operator
Operator for installation and lifecycle management of CodeFlare distributed workload stack
codeflare-sdk
An intuitive, easy-to-use python interface for batch resource requesting, access, job submission, and observation. Simplifying the developer's life while enabling access to high-performance compute resources, either in the cloud or on-prem.
codeflare-transfer-learning
instascale
On-demand Kubernetes/OpenShift cluster scaling and aggregated resource provisioning
instaslice
InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing
multi-cluster-app-dispatcher
Holistic job manager on Kubernetes
rayvens
Rayvens makes it possible for data scientists to access hundreds of data services within Ray with little effort.
zero-copy-model-loading
In-depth code associated with my Medium blog post, "How to Load PyTorch Models 340 Times Faster with Ray"
project-codeflare's Repositories
project-codeflare/codeflare
Simplifying the definition and execution, scaling and deployment of pipelines on the cloud.
project-codeflare/multi-cluster-app-dispatcher
Holistic job manager on Kubernetes
project-codeflare/rayvens
Rayvens makes it possible for data scientists to access hundreds of data services within Ray with little effort.
project-codeflare/zero-copy-model-loading
In-depth code associated with my Medium blog post, "How to Load PyTorch Models 340 Times Faster with Ray"
project-codeflare/codeflare-transfer-learning
project-codeflare/codeflare-sdk
An intuitive, easy-to-use python interface for batch resource requesting, access, job submission, and observation. Simplifying the developer's life while enabling access to high-performance compute resources, either in the cloud or on-prem.
project-codeflare/instaslice
InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing
project-codeflare/codeflare-cli
project-codeflare/instascale
On-demand Kubernetes/OpenShift cluster scaling and aggregated resource provisioning
project-codeflare/mcad
MCAD v2
project-codeflare/codeflare-operator
Operator for installation and lifecycle management of CodeFlare distributed workload stack
project-codeflare/appwrapper
AppWrapper controller for Kueue
project-codeflare/data-integration
Object Storage data processing for Ray framework
project-codeflare/ibm-vpc-ray-connector
Enables Ray to use IBM Gen2 backend
project-codeflare/mcad-dashboard
Dashboard for MCAD
project-codeflare/project-codeflare-landing
project-codeflare/mlbatch
Queuing and quota management for AI/ML batch jobs on Kubernetes
project-codeflare/serverless-distributed-dl-training
project-codeflare/adr
Home for CodeFlare Architecture Design Records (ADR)
project-codeflare/.github
project-codeflare/codeflare-cli-1
project-codeflare/codeflare-common
Common packages for use with CodeFlare Distributed Workload stack.
project-codeflare/community-operators-prod
community-operators metadata backing OpenShift OperatorHub
project-codeflare/demo-images
project-codeflare/ibm-ray-config
project-codeflare/notebooks
Notebook images for ODH
project-codeflare/pypi-cache
Simple implementation of PyPI cache server for offline use
project-codeflare/Ray-SLURM-autoscaler
project-codeflare/ray_lightning
Pytorch Lightning Distributed Accelerators using Ray
project-codeflare/torchx
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.