fsGroup securityContext does not apply to nfs mount

Question

fsGroup securityContext does not apply to nfs mount

kmarokas opened this issue 6 years ago · 65 comments

The example https://github.com/kubernetes/examples/tree/master/staging/volumes/nfs works fine if the container using nfs mount is running as root user. If I use securityContext to run not as root user then I have no write access to the mounted volume.

How to reproduce:
here is the nfs-busybox-rc.yaml with securityContext:

# This mounts the nfs volume claim into /mnt and continuously
# overwrites /mnt/index.html with the time and hostname of the pod.

apiVersion: v1
kind: ReplicationController
metadata:
  name: nfs-busybox
spec:
  replicas: 2
  selector:
    name: nfs-busybox
  template:
    metadata:
      labels:
        name: nfs-busybox
    spec:
      securityContext:
        runAsUser: 10000
        fsGroup: 10000
      containers:
      - image: busybox
        command:
          - sh
          - -c
          - 'while true; do date > /mnt/index.html; hostname >> /mnt/index.html; sleep $(($RANDOM % 5 + 5)); done'
        imagePullPolicy: IfNotPresent
        name: busybox
        securityContext:
          runAsUser: 10000
        volumeMounts:
          # name must match the volume name below
          - name: nfs
            mountPath: "/mnt"
      volumes:
      - name: nfs
        persistentVolumeClaim:
          claimName: nfs

Actual result:

kubectl exec nfs-busybox-2w9bp -t -- id
uid=10000 gid=0(root) groups=10000

kubectl exec nfs-busybox-2w9bp -t -- ls -l /
total 48
<..>
drwxr-xr-x    3 root     root          4096 Aug  2 12:27 mnt

Expected result:
the group ownership of /mnt folder should be user 10000

The mount options in nfs pv are not allowed except rw

apiVersion: v1
kind: PersistentVolume
metadata:
  name: nfs
spec:
  capacity:
    storage: 5Gi
  accessModes:
    - ReadWriteMany
  nfs:
    # FIXME: use the right IP
    server: 10.23.137.115
    path: "/"
  mountOptions:
#    - rw // is allowed
#    - root_squash // error during pod scheduling: mount.nfs: an incorrect mount option was specified
#    - all_squash // error during pod scheduling: mount.nfs: an incorrect mount option was specified
#    - anonuid=10000 // error during pod scheduling: mount.nfs: an incorrect mount option was specified
#    - anongid=10000 // error during pod scheduling: mount.nfs: an incorrect mount option was specified

kubectl version
Client Version: version.Info{Major:"1", Minor:"10", GitVersion:"v1.10.3", GitCommit:"2bba0127d85d5a46ab4b778548be28623b32d0b0", GitTreeState:"clean", BuildDate:"2018-05-21T09:17:39Z", GoVersion:"go1.9.3", Compiler:"gc", Platform:"windows/amd64"}
Server Version: version.Info{Major:"1", Minor:"10+", GitVersion:"v1.10.3-rancher1", GitCommit:"f6320ca7027d8244abb6216fbdb73a2b3eb2f4f9", GitTreeState:"clean", BuildDate:"2018-05-29T22:28:56Z", GoVersion:"go1.9.3", Compiler:"gc", Platform:"linux/amd64"}

varun-da commented 5 years ago

/reopen

kmarokas commented 5 years ago

/reopen

ravikanth39 commented 4 years ago

+1

👎3

tetsun commented 4 years ago

+1

👍1
👎3

rmunn commented 7 months ago

/reopen

Answer 1 · 2018-11-01T08:14:09.000Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

Answer 2 · 2018-12-01T09:00:28.000Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

Answer 3 · 2018-12-31T09:44:22.000Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

Answer 4 · 2018-12-31T09:44:29.000Z

@fejta-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Answer 5 · 2019-03-14T16:19:21.000Z

Why did this get closed with no resolution? I have this same issue. If there is a better solution than an init container please someone fill me in.

Answer 6 · 2019-04-26T01:13:39.000Z

Yeah... I'm having the same issue with NFS too. securityContext.fsGroup seems to have no affect on NFS volume mounts, so you kinda have to use the initContainer approach :(

Answer 7 · 2019-04-28T21:12:43.000Z

I'm having the same problem.

Answer 8 · 2019-07-08T09:02:06.000Z

same issue able to write but not able to read from nfs mounted volume . kubernetes shows success in mounting process but no luck .

Answer 9 · 2019-07-22T18:16:32.000Z

@varun-da: You can't reopen an issue/PR unless you authored it or you are a collaborator.

In response to this:

/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Answer 10 · 2019-07-23T21:12:03.000Z

@kmarokas: Reopened this issue.

In response to this:

/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Answer 11 · 2019-07-23T21:12:59.000Z

thanks @kmarokas!

Answer 12 · 2019-07-23T21:14:16.000Z

/remove-lifecycle rotten

Answer 13 · 2019-09-23T15:47:07.000Z

Would love for this to be addressed! In the mean time here's how we're dealing with it...

In this example there are two pods that are mounting an AWS EFS volume via nfs. To enable a non-root user, we make the mount point accessible via an initContainer.

---
apiVersion: v1
kind: Pod
metadata:
  name: alpine-efs-1
  labels:
    name: alpine
spec:
  volumes:
  - name: nfs-test
    nfs:
      server: fs-xxxxxxxx.efs.us-east-1.amazonaws.com
      path: /
  securityContext:
    fsGroup: 100
    runAsGroup: 100
    runAsUser: 405
  initContainers:
    - name: nfs-fixer
      image: alpine
      securityContext:
        runAsUser: 0
      volumeMounts:
      - name: nfs-test
        mountPath: /nfs
      command:
      - sh
      - -c
      - (chmod 0775 /nfs; chgrp 100 /nfs)
  containers:
  - name: alpine
    image: alpine
    volumeMounts:
      - name: nfs-test
        mountPath: /nfs
    command:
      - tail
      - -f
      - /dev/null
---
apiVersion: v1
kind: Pod
metadata:
  name: alpine-efs-2
  labels:
    name: alpine
spec:
  volumes:
  - name: nfs-test
    nfs:
      server: fs-xxxxxxxx.efs.us-east-1.amazonaws.com
      path: /
  securityContext:
    supplementalGroups:
      - 100
    fsGroup: 100
    # runAsGroup: 100
    runAsUser: 405
  initContainers:
    - name: nfs-fixer
      image: alpine
      securityContext:
        runAsUser: 0
      volumeMounts:
      - name: nfs-test
        mountPath: /nfs
      command:
      - sh
      - -c
      - (chmod 0775 /nfs; chgrp 100 /nfs)
  containers:
  - name: alpine
    image: alpine
    volumeMounts:
      - name: nfs-test
        mountPath: /nfs
    command:
      - tail
      - -f
      - /dev/null

Answer 14 · 2019-10-31T12:47:59.000Z

The same seems to be true for cifs mounts created through a custom volume driver: juliohm1978/kubernetes-cifs-volumedriver#8

Edit: Looks like there is very little magic that Kubernetes does when mounting the volumes. The individual volume drivers have to respect the fsGroup configuration set in the pod. Looks like the NFS provider doesn't do that as of now.

Is https://github.com/kubernetes-incubator/external-storage/tree/master/nfs-client the place where this could be fixed?

Answer 15 · 2020-01-29T13:24:18.000Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

Answer 16 · 2020-01-29T15:09:16.000Z

/remove-lifecycle stale

Answer 17 · 2020-04-28T15:33:15.000Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

Answer 18 · 2020-04-30T19:40:15.000Z

no solution since around 1 1/2 years? cant believe it.

Answer 19 · 2020-05-30T20:09:50.000Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

Answer 20 · 2020-05-30T21:33:34.000Z

/remove-lifecycle rotten

Maybe this issue needs to be taken to another repository. Is https://github.com/kubernetes-incubator/external-storage the right place for it?

Answer 21 · 2020-07-31T11:00:16.000Z

https://kubernetes.io/docs/tasks/configure-pod-container/security-context/

fsGroupChangePolicy: "Always"

refer the above link. But it seems that the feature is available only from k8-1.18 version.
Guess if i'm not wrong.

Answer 22 · 2020-07-31T12:53:20.000Z

fsGroupChangePolicy: "Always"

The docs are not totally clear about this, but I understand that this is already the default behaviour.

By default, Kubernetes recursively changes ownership and permissions for the contents of each volume to match the fsGroup specified in a Pod's securityContext when that volume is mounted.

The section also indicates that not every volume type necessarily supports changing permissions:

This field only applies to volume types that support fsGroup controlled ownership and permissions.

Answer 23 · 2020-08-24T19:52:26.000Z

The same issue for AWS EBS gp2 volumes

Answer 24 · 2020-12-15T23:34:26.000Z

I just ran into this issue today as well. Is there any workaround yet besides using an initContainer?

Answer 25 · 2021-01-27T00:35:22.000Z

+1 - facing this issue too!

Answer 26 · 2021-01-28T08:52:03.000Z

+1 - facing this issue

Answer 27 · 2021-02-18T14:19:36.000Z

block storage (eg: iSCSI, Ceph RBD, .. ) : use fsGroup to control access
shared storage (e.g: NFS, GlusterFS) : use supplementGroups instead

Give me like if i saved your day

Answer 28 · 2021-05-19T15:18:14.000Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

Answer 29 · 2021-05-19T15:49:45.000Z

/remove-lifecycle stale

Answer 30 · 2021-07-02T15:32:34.000Z

Has anyone been able to do this without an init container? Really hoping to avoid that if possible.

Answer 31 · 2021-07-03T02:00:42.000Z

I would rather avoid init containers, in fact I don't like any kind of scripts in the k8s manifests!
So If you don't want to use init container, you can do something like this -

e.g. for nfs volumes (This assumes you have control over nfs server)
On the nfs server have something like this in /srv/exportfs
/srv/vol1 *(rw,sync,all_squash,insecure,no_subtree_check,anonuid=2000,anongid=3000)
Create /srv/vol1 to match above

sudo chmod -R 2000:3000 /srv/vol1
sudo chmod -R 775 /srv/vol1

In the pod use security context to match above -

  securityContext:
    runAsUser: 2000
    runAsGroup: 2000
    fsGroup: 3000
    fsGroupChangePolicy: Always

I like this better than allowing to run as root in init containers. (PodSecurityPolicy may prevent it as well)
Also this is a better way for dishing out volumes with well known uid:gid that anyone can predictably use.

e.g. you can also use the same technique with hostPath based volumes.
On a k8s host have a dir e.g. /data/vol1with matching permissions -

sudo chown -R 2000:3000 /data/vol1
sudo chmod -R 775 /data/vol1
ls -l /data/
total 4
drwxrwxr-x 2 2000 3000 4096 Jul  3 01:51 vol1

Alternatively if you want to use commercial persistent volumes like aws/gcp/portworxVolume etc, it will depend on whether they support fsGroup.

Answer 32 · 2021-09-24T06:45:22.000Z

I disabled all sudo privileges from pod users for security reasons.
So I can't configure the privilege of the mount point because Kubernetes won't let me,
and I can't chown/chmod the mount point because my pod user can't sudo.
How do I solve this problem?

Answer 33 · 2021-12-23T07:38:35.000Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

Answer 34 · 2021-12-23T10:27:16.000Z

/remove-lifecycle stale

Answer 35 · 2022-03-23T10:36:11.000Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

Answer 36 · 2022-04-21T07:27:31.000Z

+1 - facing this issue

Answer 37 · 2022-04-21T07:28:06.000Z

/remove-lifecycle stale

Answer 38 · 2022-06-23T15:35:24.000Z

hit this issue. For a POC I just attached my share to a VM and manually chowned, but for prod that's probably not ok.

Answer 39 · 2022-07-04T01:44:01.000Z

Just recently set up a cluster on Linode and I can't believe it, feels incomplete. I know this has nothing to do with Linode but I just want to add some context. The primary way to mount a PVC through Linode is to buy their volumes. Which require a minimum of 10GB and can only be up to 8/linode. I thought I was smart when I found the Rook NFS workaround. Everything would've been perfect till none of my databases could be provisioned because I kept getting a permission denied error. Looking deeper into it, I came across this issue. Because I am using a postgres operator (tried kubegres and PGO), there doesn't seem to be a possibility to specify an init container. This means, that every time I provision a database (or a replica). I need to shell into my Linode, find the PVC and manually change the permissions. I really appreciate the work that the community has done towards kubernetes and that fact that it is FOSS, but this really seems to be an enormous issue that is being completely ignored.

Answer 40 · 2022-07-04T17:27:55.000Z

true, this is something that should be fixed 👍

Answer 41 · 2022-07-20T20:46:19.000Z

Looks like it is working for me (specifying all of runAsUser, runAsGroup and fsGroup) (version 1.24.1)

Answer 42 · 2022-10-19T05:36:00.000Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

Answer 43 · 2022-10-19T06:32:22.000Z

/remove-lifecycle stale

Answer 44 · 2023-01-17T07:31:08.000Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

Answer 45 · 2023-01-27T18:02:14.000Z

Anyone else can confirm what @ramihoudroge said ? that 1.24.1 works ?

I've also found this thread https://devops.stackexchange.com/questions/13939/how-to-allow-a-non-root-user-to-write-to-a-mounted-efs-in-eks which mention EFS access point.
Anyone had success with this ?

Answer 46 · 2023-02-26T18:05:18.000Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle rotten
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

Answer 47 · 2023-03-09T09:14:38.000Z

Also have this issue with permission denied. With a mongodb container nfs mounting to an EFS in AWS.
Using EKS 1.24
AWS EFS
https://stackoverflow.com/questions/75670387/error-executing-postinstallation-eperm-operation-not-permitted-utime-bitn

Answer 48 · 2023-03-13T01:42:16.000Z

/remove-lifecycle rotten

Answer 49 · 2023-06-11T02:04:58.000Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

Answer 50 · 2023-07-11T02:22:49.000Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle rotten
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

Answer 51 · 2023-07-31T02:58:37.000Z

/remove-lifecycle rotten

Answer 52 · 2023-10-20T15:32:11.000Z

I ran into this exact issue with a statical PV using the default mount nfs.
There is no possible mount options for nfs to change the nfs permission. securityContext.fsGroup setting is ignored without any outputs.
Unfortunately, the initContainer approach is not an option for me.
You can do something about this issue?

Answer 53 · 2023-12-13T21:42:23.000Z

@yingding have you found any workaround?

Answer 54 · 2023-12-13T22:29:13.000Z

@radirobi97 If you can use initContiners approach #260 (comment) , it will work.
I had this issue still with a pod from ML system which I do not have control over. Ultimately, i switched to object store and gave up on the default nfs mount of static PV.
But I think dynamic nfs CSI driver shall not have this static PV issue.

Answer 55 · 2024-03-12T22:37:41.000Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

Answer 56 · 2024-04-11T22:51:10.000Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle rotten
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

Answer 57 · 2024-05-11T23:01:34.000Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue with /reopen
Mark this issue as fresh with /remove-lifecycle rotten
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close not-planned

Answer 58 · 2024-05-11T23:01:38.000Z

@k8s-triage-robot: Closing this issue, marking it as "Not Planned".

In response to this:

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied

After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied

After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue with /reopen

Mark this issue as fresh with /remove-lifecycle rotten

Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close not-planned

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

Answer 59 · 2024-05-22T06:33:33.000Z

/remove-lifecycle rotten

Answer 60 · 2024-05-22T06:34:08.000Z

@rmunn: You can't reopen an issue/PR unless you authored it or you are a collaborator.

In response to this:

/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.