cilium-operator is missing RBAC permission to remove `node.cilium.io/agent-not-ready` taint #15464

maximumG · 2023-06-02T16:27:01Z

/kind bug

1. What kops version are you running? The command kops version, will display
this information.

1.26

2. What Kubernetes version are you running? kubectl version will print the
version if a cluster is running or provide the Kubernetes version specified as
a kops flag.

1.25.9

3. What cloud provider are you using?

AWS

4. What commands did you run? What is the simplest way to reproduce this issue?

Add the node.cilium.io/agent-not-ready taint to a kops instance group
Enable cilium debug log switch in the kops config
Check logs from the cilium-operator which is stating that it cannot patch the nodes resources

level=debug msg="Removing Node Taint" nodeName=i-0de0ebddfb1f1a317 subsys=watchers taint=node.cilium.io/agent-not-ready
level=debug msg="Controller func execution time: 1.045493ms" name=mark-k8s-node-i-0de0ebddfb1f1a317-as-available subsys=controller uuid=0a29a389-734c-4430-9ed2-5311fc45cd4a
level=debug msg="Controller run failed" consecutiveErrors=8 error="nodes \"i-0de0ebddfb1f1a317\" is forbidden: User \"system:serviceaccount:kube-system:cilium-operator\" cannot patch resource \"nodes\" in API group \"\" at the cluster scope" name=mark-k8s-node-i-0de0ebddfb1f1a317-as-available subsys=controller uuid=0a29a389-734c-4430-9ed2-5311fc45cd4a

5. What happened after the commands executed?

No pods can be scheduled anymore on nodes as the cilium related taint cannot be removed.

6. What did you expect to happen?

Cilium-operator pods should be allowed to patch nodes in order to remove the cilium related taint.

9. Anything else do we need to know?

We added this taint as per the cilium recommended installation guide and to avoid pods being scheduled before the CNI is actually working as expected.

It seems that the ClusterRole definition for the cilium-operator SA is not following the actual values from the official cilium helm chart.

kops/upup/models/cloudup/resources/addons/networking.cilium.io/k8s-1.16-v1.12.yaml.template

Lines 419 to 579 in 1bef619

    
             name: cilium-operator 
        
           rules: 
        
           - apiGroups: 
        
             - "" 
        
             resources: 
        
             - pods 
        
             verbs: 
        
             - get 
        
             - list 
        
             - watch 
        
           - apiGroups: 
        
             - discovery.k8s.io 
        
             resources: 
        
             - endpointslices 
        
             verbs: 
        
             - get 
        
             - list 
        
             - watch 
        
           - apiGroups: 
        
             - "" 
        
             resources: 
        
             - nodes 
        
             verbs: 
        
             - list 
        
             - watch 
        
           - apiGroups: 
        
             - "" 
        
             resources: 
        
             # to perform the translation of a CNP that contains `ToGroup` to its endpoints 
        
             - services 
        
             - endpoints 
        
             # to check apiserver connectivity 
        
             - namespaces 
        
             verbs: 
        
             - get 
        
             - list 
        
             - watch 
        
           - apiGroups: 
        
             - cilium.io 
        
             resources: 
        
             - ciliumnetworkpolicies 
        
             - ciliumclusterwidenetworkpolicies 
        
             verbs: 
        
             # Create auto-generated CNPs and CCNPs from Policies that have 'toGroups' 
        
             - create 
        
             - update 
        
             - deletecollection 
        
             # To update the status of the CNPs and CCNPs 
        
             - patch 
        
             - get 
        
             - list 
        
             - watch 
        
           - apiGroups: 
        
             - cilium.io 
        
             resources: 
        
             - ciliumnetworkpolicies/status 
        
             - ciliumclusterwidenetworkpolicies/status 
        
             verbs: 
        
             # Update the auto-generated CNPs and CCNPs status. 
        
             - patch 
        
             - update 
        
           - apiGroups: 
        
             - cilium.io 
        
             resources: 
        
             - ciliumendpoints 
        
             - ciliumidentities 
        
             verbs: 
        
             # To perform garbage collection of such resources 
        
             - delete 
        
             - list 
        
             - watch 
        
           - apiGroups: 
        
             - cilium.io 
        
             resources: 
        
             - ciliumidentities 
        
             verbs: 
        
             # To synchronize garbage collection of such resources 
        
             - update 
        
           - apiGroups: 
        
             - cilium.io 
        
             resources: 
        
             - ciliumnodes 
        
             verbs: 
        
             - create 
        
             - update 
        
             - get 
        
             - list 
        
             - watch 
        
               # To perform CiliumNode garbage collector 
        
             - delete 
        
           - apiGroups: 
        
             - cilium.io 
        
             resources: 
        
             - ciliumnodes/status 
        
             verbs: 
        
             - update 
        
           - apiGroups: 
        
             - cilium.io 
        
             resources: 
        
             - ciliumendpointslices 
        
             - ciliumenvoyconfigs 
        
             verbs: 
        
             - create 
        
             - update 
        
             - get 
        
             - list 
        
             - watch 
        
             - delete 
        
             - patch 
        
           - apiGroups: 
        
             - apiextensions.k8s.io 
        
             resources: 
        
             - customresourcedefinitions 
        
             verbs: 
        
             - create 
        
             - get 
        
             - list 
        
             - watch 
        
           - apiGroups: 
        
             - apiextensions.k8s.io 
        
             resources: 
        
             - customresourcedefinitions 
        
             verbs: 
        
             - update 
        
             resourceNames: 
        
             - ciliumloadbalancerippools.cilium.io 
        
             - ciliumbgppeeringpolicies.cilium.io 
        
             - ciliumclusterwideenvoyconfigs.cilium.io 
        
             - ciliumclusterwidenetworkpolicies.cilium.io 
        
             - ciliumegressgatewaypolicies.cilium.io 
        
             - ciliumegressnatpolicies.cilium.io 
        
             - ciliumendpoints.cilium.io 
        
             - ciliumendpointslices.cilium.io 
        
             - ciliumenvoyconfigs.cilium.io 
        
             - ciliumexternalworkloads.cilium.io 
        
             - ciliumidentities.cilium.io 
        
             - ciliumlocalredirectpolicies.cilium.io 
        
             - ciliumnetworkpolicies.cilium.io 
        
             - ciliumnodes.cilium.io 
        
           - apiGroups: 
        
             - cilium.io 
        
             resources: 
        
             - ciliumloadbalancerippools 
        
             verbs: 
        
             - get 
        
             - list 
        
             - watch 
        
           - apiGroups: 
        
             - cilium.io 
        
             resources: 
        
             - ciliumloadbalancerippools/status 
        
             verbs: 
        
             - patch 
        
           - apiGroups: 
        
             - coordination.k8s.io 
        
             resources: 
        
             - leases 
        
             verbs: 
        
             - create 
        
             - get 
        
             - update

The text was updated successfully, but these errors were encountered:

k8s-triage-robot · 2024-01-21T19:26:58Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot · 2024-02-20T19:31:37Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle rotten
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

k8s-triage-robot · 2024-03-21T20:20:50Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue with /reopen
Mark this issue as fresh with /remove-lifecycle rotten
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close not-planned

k8s-ci-robot · 2024-03-21T20:20:55Z

@k8s-triage-robot: Closing this issue, marking it as "Not Planned".

In response to this:

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied

After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied

After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue with /reopen

Mark this issue as fresh with /remove-lifecycle rotten

Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close not-planned

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-ci-robot added the kind/bug Categorizes issue or PR as related to a bug. label Jun 2, 2023

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 21, 2024

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Feb 20, 2024

k8s-ci-robot closed this as not planned Won't fix, can't repro, duplicate, stale Mar 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cilium-operator is missing RBAC permission to remove `node.cilium.io/agent-not-ready` taint #15464

cilium-operator is missing RBAC permission to remove `node.cilium.io/agent-not-ready` taint #15464

maximumG commented Jun 2, 2023

k8s-triage-robot commented Jan 21, 2024

k8s-triage-robot commented Feb 20, 2024

k8s-triage-robot commented Mar 21, 2024

k8s-ci-robot commented Mar 21, 2024

cilium-operator is missing RBAC permission to remove node.cilium.io/agent-not-ready taint #15464

cilium-operator is missing RBAC permission to remove node.cilium.io/agent-not-ready taint #15464

Comments

maximumG commented Jun 2, 2023

k8s-triage-robot commented Jan 21, 2024

k8s-triage-robot commented Feb 20, 2024

k8s-triage-robot commented Mar 21, 2024

k8s-ci-robot commented Mar 21, 2024

cilium-operator is missing RBAC permission to remove `node.cilium.io/agent-not-ready` taint #15464

cilium-operator is missing RBAC permission to remove `node.cilium.io/agent-not-ready` taint #15464