Openshift AI

Introduction

Red Hat OpenShift AI is a platform for data scientists and developers of artificial intelligence (AI) applications. It provides a fully supported environment that lets you rapidly develop, train, test, and deploy machine learning models on-premises and/or in the public cloud.

OpenShift AI integrates the following components and services:

At the service layer:

OpenShift AI dashboard: A customer-facing dashboard that shows available and installed applications for the OpenShift AI environment as well as learning resources such as tutorials, quick starts, and documentation. Administrative users can access functionality to manage users, clusters, notebook images, accelerator profiles, and model-serving runtimes. Data scientists can use the dashboard to create projects to organize their data science work.
Model serving: Data scientists can deploy trained machine-learning models to serve intelligent applications in production. After deployment, applications can send requests to the model using its deployed API endpoint.
Data science pipelines: Data scientists can build portable machine learning (ML) workflows with data science pipelines, using Docker containers. This enables your data scientists to automate workflows as they develop their data science models.
Jupyter (self-managed): A self-managed application that allows data scientists to configure their own notebook server environment and develop machine learning models in JupyterLab.
Distributed workloads: Data scientists can use multiple nodes in parallel to train machine-learning models or process data more quickly. This approach significantly reduces the task completion time, and enables the use of larger datasets and more complex models.

At the management layer:

The Red Hat OpenShift AI Operator: A meta-operator that deploys and maintains all components and sub-operators that are part of OpenShift AI.
Monitoring services: Prometheus gathers metrics from OpenShift AI for monitoring purposes.

The namespaces installed by the OpenShift AI operator are:

redhat-ods-operator contains the Red Hat OpenShift AI Operator.
redhat-ods-applications installs the dashboard and other required components of OpenShift AI.
redhat-ods-monitoring contains services for monitoring.
rhods-notebooks is where notebook environments are deployed by default.

Requirements

OCP 4.14+
Identity Provider, Storage and Internet Access
RH Pipelines Operator
KServe Dependencies
- Red Hat Openshift Serverless
- Red Hat Openshift Service Mesh

Setting Up Openshift AI

It is time to deploy Openshift AI in a Openshift cluster from the scratch. Please follow next steps for performing this task:

Create Users and define Openshift AI Admins

sh scripts/setup_lab_multi.sh

# This script creates user01, user02, user03 and user04 and adds user01 and user02 to rhods-admins groups 
NOTE: Authentication is based on Htpasswd and the users' password is the username
NOTE: By default, Data Science user group is system_authenticated users.

Install Operators

oc apply -f files/operator-openshiftai.yaml
oc apply -f files/operator-kserver-prerequisites.yaml

Install the DataScienceClsuter

oc apply -f files/datasciencecluster.yaml

Check Installation via script

sh scripts/check.sh

Check the installation vía Dashboard with user01 (admin) and user03 (Data Science user)

oc get routes rhods-dashboard -n redhat-ods-applications -o jsonpath='{.spec.host}'

Openshift AI & Operator Objects

DataScienceCluster: Install Red Hat OpenShift AI components.
- Dashboard -> Openshift AI Dashboard + RBAC (It is possible to personalize the dashboard to meet multiple requirements. Please review the official documentation)
OdhDashboardConfig: Configure the Openshift AI Dashboard from an overall point of view
OdhApplication: Make installed applications accessible for OpenShift AI users

Data Science Projects

Organize your work in projects and workbenches, create and collaborate on notebooks, train and deploy models, configure model servers, and implement pipelines.

Notebooks

It is possible to create a blank notebook or import a notebook from a number of different sources.

Others

Data Science Projects (projects.project.openshift.io) -> A Data Science Project is synonymous with an OpenShift Project or a Namespace. See the users section for more information on how to create and manage Data Science Projects
Workbenches (notebooks.kubeflow.org) -> A workbench is a development environment running in an OpenShift pod that uses the Kubeflow Notebook Controller. Depending on the workbench image, workbenches can run a number of web-based editors, including JupyterLab, Visual Studio Code, and R Studio.
Models (inferenceservices.serving.kserve.io) -> Models require a data connection and a location where the model file is stored in the S3 bucket.
Model Servers (servingruntimes.serving.kserve.io) -> Models are associated with a specific model server, which host the model and are used to create endpoints. A single model server can serve multiple models from a single instance.
Data Connections (secret) -> A data connection is an OpenShift secret that stores the values required to connect to an S3 bucket.
Data Science Pipeline Applications (DSPA) (datasciencepipelinesapplications.datasciencepipelinesapplications.opendatahub.io) -> A DSPA creates an instance of a Data Science Pipeline and requires a data connection and an S3 bucket to create the instance. A DSPA is namespace-scoped to prevent leaking data across multiple projects.

Integration Components

KServe provides a Kubernetes Custom Resource Definition for serving predictive and generative machine learning (ML) models. It aims to solve production model serving use cases by providing high abstraction interfaces for Tensorflow, XGBoost, ScikitLearn, PyTorch, Huggingface Transformer/LLM models using standardized data plane protocols

Author

Asier Cidon @Red Hat

acidonper/ocp-openshiftAI