/ocp4-upi-powervm

OCP4 on PowerVM

Primary LanguageHCLApache License 2.0Apache-2.0

Table of Contents

Introduction

This repo contains Terraform templates to help deployment of OpenShift Container Platform (OCP) 4.x on PowerVM LPARs. This assumes PowerVC is used as the IaaS layer for managing PowerVM LPARs.

If you are using standalone PowerVM please take a look at the following ansible playbook to setup helper node (bastion) for OCP deployment.

This project also leverages the same ansible playbook internally for OCP deployment.

Run this code from either Mac or Linux (Intel) system.

This automation is intended for test/development purposes only and there is no formal support. For issues please open a GitHub issue

Pre-requisites

  • Git: Please refer to the following link for instructions on installing git for Linux and Mac.
  • Terraform: You'll need to use version 0.12.20. Please refer to the following link for instructions on installing terraform for Linux and Mac.

Image and LPAR requirements

You'll need to create RedHat CoreOS (RHCOS) and RHEL 8.1 image in PowerVC. For RHCOS image creation, follow the steps mentioned here. Following are the recommended LPAR configs for OpenShift nodes

  • Bootstrap, Master - 2 vCPUs, 16GB RAM, 120 GB Disk.

    PowerVM LPARs by default uses SMT=8. So with 2vCPUs, the number of logical CPUs as seen by the Operating System will be 16 (2 vCPUs x 8 SMT)

    This config is suitable for majority of the scenarios

  • Worker - 2 vCPUs, 16GB RAM, 120 GB Disk

    Increase worker vCPUs, RAM and Disk based on application requirements

OCP Install

Follow these steps to kickstart OCP installation on PowerVM

Setup automation

On your Terraform client machine:

  1. git clone https://github.com/ocp-power-automation/ocp4-upi-powervm.git
  2. cd ocp4_upi_powervm

Setup required Terraform Variables

Update the var.tfvars file with values specific to your environment. Following is a brief description of the variables.

  • auth_url : Endpoint URL used to connect to PowerVC.
  • user_name : PowerVC login username.
  • password : PowerVC login password.
  • tenant_name : The Name of the Tenant (Identity v2) or Project (Identity v3) to login with.
  • domain_name : The Name of the Domain to scope to.
  • openstack_availability_zone : The availability zone in which to create the servers.
  • network_name : Name of the network to use for deploying all the hosts.
  • network_type (Optional) : Type of the network adapter for cluster hosts. You can set the value as "SRIOV", any other value will use "SEA". More info here.
  • scg_id (Optional) : ID of the PowerVC Storage Connectivity Group (SCG) to use for all nodes. Empty value will use the default SCG. Deployments might fail if you don't provide this value when having more than one default SCG configured on PowerVC.
  • rhel_username : The user that we should use for the connection to the bastion host.
  • keypair_name : Optional value for keypair used. Default is <cluster_id>-keypair.
  • public_key_file : A pregenerated OpenSSH-formatted public key file. Default path is 'data/id_rsa.pub'.
  • private_key_file : Corresponding private key file. Default path is 'data/id_rsa'.
  • private_key : The contents of an SSH key to use for the connection. Ignored if public_key_file is provided.
  • public_key : The contents of corresponding key to use for the connection. Ignored if public_key_file is provided.
  • rhel_subscription_username : The username required for RHEL subcription on bastion host.
  • rhel_subscription_password : The password required for RHEL subcription on bastion host.
  • bastion : Map of below parameters for bastion host.
    • instance_type : The name of the desired flavor.
    • image_id : The image ID of the RHEL 8.1 image.
  • bootstrap : Map of below parameters for bootstrap host.
    • instance_type : The name of the desired flavor.
    • image_id : The image ID of the RHCOS image.
    • count : Always set the value to 1 before starting the deployment. When the deployment is completed successfully set to 0 to delete the bootstrap node.
  • master : Map of below parameters for master hosts.
    • instance_type : The name of the desired flavor.
    • image_id : The image ID of the desired RHCOS image.
    • count : Number of master nodes.
  • worker : Map of below parameters for worker hosts. (Atleaset 2 Workers are required for running router pods in HA mode)
    • instance_type : The name of the desired flavor.
    • image_id : The image ID of the desired RHCOS image.
    • count : Number of worker nodes.
  • openshift_install_tarball : HTTP URL for OpenShift install tarball.
  • openshift_client_tarball : HTTP URL for OpenShift client (oc) tarball.
  • release_image_override : This is set to OPENSHIFT_INSTALL_RELEASE_IMAGE_OVERRIDE while creating ignition files.
  • installer_log_level : enable log level for OpenShift install (e.g. "debug | info | warn | error") (default "info")
  • ansible_extra_options : Ansible options to append to the ansible-playbook commands. Default is set to "-v".
  • helpernode_tag : ocp4-helpernode ansible playbook version to checkout.
  • install_playbook_tag : ocp4-playbooks ansible playbooks version to checkout.
  • pull_secret_file : Location of the OCP pull-secret file to be used.
  • cluster_domain : Cluster domain name. <cluster_id>.<cluster_domain> forms the fully qualified domain name.
  • cluster_id_prefix : Cluster identifier. Should not be more than 8 characters. Nodes are pre-fixed with this value, please keep it unique.
  • dns_forwarders : External DNS servers to forward DNS queries that cannot resolve locally. Eg: "8.8.8.8; 9.9.9.9".
  • storage_type : Storage provisioner to configure. Supported values: nfs (For now only nfs provisioner is supported, any other value won't setup a storageclass)
  • volume_size : If storage_type is nfs, a volume will be created with given size in GB and attached to bastion node. Eg: 1000 for 1TB disk.
  • volume_storage_template : Storage template name or ID for creating the volume. Empty value will use default template.

Setup required data files

You need to have following files in data/ directory before running the Terraform templates.

$ ls data/
id_rsa  id_rsa.pub  pull-secret.txt
  • id_rsa & id_rsa.pub : The key pair used for accessing the hosts. These files are not required if you provide public_key_file and private_key_file.
  • pull-secret.txt : File containing keys required to pull images on the cluster. You can download it from RH portal after login https://cloud.redhat.com/openshift/install/pull-secret.

Kickstart Install

Run the following from within the ocp4-upi-powervmdirectory:

  1. terraform init
  2. terraform apply -var-file var.tfvars

Now wait for the installation to complete. It may take around 40 mins to complete provisioning.

IMPORTANT: When using NFS storage, the OpenShift image registry will be using NFS PV claim. Otherwise the image registry uses ephemeral PV.

IMPORTANT: Once the deployment is completed successfully, you can safely delete the bootstrap node.

Post Install Steps

Delete Bootstrap node

  1. Change the count value to 0 in bootstrap map variable and re-run the apply command. Eg: bootstrap = {instance_type = "medium", image_id = "468863e6-4b33-4e8b-b2c5-c9ef9e6eedf4", "count" = 0}
  2. Run command terraform apply -var-file var.tfvars

Create API and Ingress DNS Records

You will also need to add the following records to your DNS server:

api.<cluster name>.<cluster domain>.  IN  A  <Bastion IP>
*.apps.<cluster name>.<cluster domain>.  IN  A  <Bastion IP>

If you're unable to create and publish these DNS records, you can add them to your /etc/hosts file.

<Bastion IP> api.<cluster name>.<cluster domain>
<Bastion IP> console-openshift-console.apps.<cluster name>.<cluster domain>
<Bastion IP> integrated-oauth-server-openshift-authentication.apps.<cluster name>.<cluster domain>
<Bastion IP> oauth-openshift.apps.<cluster name>.<cluster domain>
<Bastion IP> prometheus-k8s-openshift-monitoring.apps.<cluster name>.<cluster domain>
<Bastion IP> grafana-openshift-monitoring.apps.<cluster name>.<cluster domain>
<Bastion IP> <app name>.apps.<cluster name>.<cluster domain>

Note: For convenience, entries specific to your cluster will be printed at the end of a successful run. Just copy and paste value of output variable etc_hosts_entries to your hosts file.

OCP Login Credentials

The OCP login credentials are in bastion host. In order to retrieve the same follow these steps:

  1. ssh -i data/id_rsa <rhel_username>@<bastion_ip>
  2. cd ~/openstack-upi/auth
  3. kubeconfig can be used for CLI (oc or kubectl)
  4. kubeadmin user and content of kubeadmin-pasword as password for GUI

Clean up

Run terraform destroy -var-file var.tfvars to make sure that all resources are properly cleaned up. Do not manually clean up your environment unless both of the following are true:

  1. You know what you are doing
  2. Something went wrong with an automated deletion.