PVs are provisioned for StorageClasses with unavailable zones

Question

PVs are provisioned for StorageClasses with unavailable zones

jsafrane opened this issue 2 years ago · 11 comments

There is a slight behavioral change between in-tree and migrated GCP PD volume plugin:

In-tree volume plugin refuses to provision a volume in a GCP zone that does not have a node instance
The CSI driver provisions a new volume in this case. It does not check (cannot check?) if the requested zone has a node in it. The provisioned PV is then unusable in the cluster.

Steps to reproduce:

SC explicitly wants europe-west1-d:

allowVolumeExpansion: true
apiVersion: storage.k8s.io/v1
kind: StorageClass
metadata:
  annotations:
    storageclass.kubernetes.io/is-default-class: "false"
  name: sc-4fap4
parameters:
  replication-type: none
  type: pd-standard
  zone: europe-west1-d
provisioner: kubernetes.io/gce-pd
reclaimPolicy: Delete
volumeBindingMode: Immediate

All nodes are in us-east1-{b-d}:

$ kubectl get node -o yaml | grep topology.kubernetes.io/zone
      topology.kubernetes.io/zone: us-east1-b
      topology.kubernetes.io/zone: us-east1-c
      topology.kubernetes.io/zone: us-east1-d
      topology.kubernetes.io/zone: us-east1-b
      topology.kubernetes.io/zone: us-east1-c
      topology.kubernetes.io/zone: us-east1-d

User creates a PVC requesting the SC from step 1.

apiVersion: v1
kind: PersistentVolumeClaim
metadata:
  name: mypvc
spec:
  accessModes:
  - ReadWriteOnce
  resources:
    requests:
      storage: 1Gi
  storageClassName: sc-4fap4
  volumeMode: Filesystem

With in-tree volume plugin, I get an error event:

Warning ProvisioningFailed Failed to provision volume with StorageClass "sc-4fap4": kubernetes does not have a node in zone "europe-west1-d"

With CSI migration enabled, a PV is provisioned in europe-west1-d and the PV cannot be attached to any node in the cluster.

It would be probably hard to fix it correctly, the CSI driver does not know what nodes are in the cluster. IMO it would be enough just to document it somehow.

#1005 (comment)

Answer 1 · 2022-06-07T17:52:38.000Z

Yeah, that sounds reasonable.

Looks like our readme needs some love generally.

/assign @mattcary

Answer 2 · 2022-09-05T18:47:54.000Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

Answer 3 · 2022-10-05T19:02:37.000Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

Answer 4 · 2022-10-07T11:10:50.000Z

/remove-lifecycle rotten

Answer 5 · 2023-01-05T11:19:22.000Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

Answer 6 · 2023-01-05T17:03:12.000Z

/remove-lifecycle stale

Answer 7 · 2023-01-05T19:18:56.000Z

I started kubernetes/website#38786.

Answer 8 · 2023-04-05T19:54:44.000Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

Answer 9 · 2023-05-05T20:30:59.000Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle rotten
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

Answer 10 · 2023-06-04T20:50:39.000Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue with /reopen
Mark this issue as fresh with /remove-lifecycle rotten
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close not-planned

Answer 11 · 2023-06-04T20:50:44.000Z

@k8s-triage-robot: Closing this issue, marking it as "Not Planned".

In response to this:

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied

After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied

After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue with /reopen

Mark this issue as fresh with /remove-lifecycle rotten

Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close not-planned

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.