Recent changes to v1_node_condition.py causing ValueError for MCEHardwareErrors on Oracle OKE cloud platform #1733

koaps · 2022-03-01T23:30:07Z

Changes for 23.3.0: b227345

Includes a hard coded set of conditions which may not include all supported conditions for different cloud providers:

https://github.com/kubernetes-client/python/blob/master/kubernetes/client/models/v1_node_condition.py#L217

On Oracle's OKE cloud platform, it includes an additional condition: MCEHardwareErrors

Which is causing a ValueError when the 23.3.0 client is used to list nodes:

  File "/usr/local/lib/python3.6/site-packages/kubernetes/client/models/v1_node_condition.py", line 221, in type
    .format(type, allowed_values)
ValueError: Invalid value for `type` (MCEHardwareErrors), must be one of ['DiskPressure', 'MemoryPressure', 'NetworkUnavailable', 'PIDPressure',

Node conditions should be more dynamic based on the cloud provider used.

The text was updated successfully, but these errors were encountered:

julianseeger · 2022-03-02T11:55:19Z

Same for GCP/GKE:

.../lib/python3.9/site-packages/kubernetes/client/models/v1_pod_readiness_gate.py", line 78, in condition_type
    raise ValueError(
ValueError: Invalid value for `condition_type` (cloud.google.com/load-balancer-neg-ready), must be one of ['ContainersReady', 'Initialized', 'PodScheduled', 'Ready']

This is about pod conditions or pod readiness gates, but it looks to me like is has the same root cause as the node condition issue of @koaps

Edit: it happened first after upgrading from 21.7.0 to 23.3.0

chrislinan · 2022-03-02T13:42:46Z

I have the same issue :

File "/usr/lib/python3.6/site-packages/kubernetes/client/models/v1_node_condition.py", line 221, in type

[2022-03-02T09:48:00.771Z] E                 .format(type, allowed_values)

[2022-03-02T09:48:00.771Z] E             ValueError: Invalid value for `type` (FrequentDockerRestart), must be one of ['DiskPressure', 'MemoryPressure', 'NetworkUnavailable', 'PIDPressure', 'Ready']

kpulgam · 2022-03-03T00:48:33Z

Piling on here .. Getting similar error in AWS world in combination with AWS load balancer controller which injects readiness gate to pods .
Below is the error I see :

File "../lib/python3.9/site-packages/kubernetes/client/models/v1_pod_readiness_gate.py", line 52, in __init__
   self.condition_type = condition_type
 File "../lib/python3.9/site-packages/kubernetes/client/models/v1_pod_readiness_gate.py", line 78, in condition_type
   raise ValueError(
ValueError: Invalid value for `condition_type` (target-health.elbv2.k8s.aws/k8s-<name>-b77a905f9d), must be one of ['ContainersReady', 'Initialized', 'PodScheduled', 'Ready']

sybnex · 2022-03-18T10:33:28Z

Same for Azure/AKS

ValueError: Invalid value for type (ContainerRuntimeProblem), must be one of ['DiskPressure', 'MemoryPressure', 'NetworkUnavailable', 'PIDPressure', 'Ready']

roycaihw · 2022-03-28T16:44:02Z

This is being fixed in upstream. We will cut a new 1.23 client to backport the fix once the PR kubernetes/kubernetes#108740 is merged

iamkarlson · 2022-04-07T08:12:10Z

Is there any mitigation available while a permanent fix is being developed yet?

everpeace · 2022-04-11T03:49:24Z

Is there any mitigation available while a permanent fix is being developed yet?

We faced the issue too. We would like to know a workaround until a new version will be released.

k8s-triage-robot · 2022-07-10T03:51:01Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

iamkarlson · 2022-07-14T09:43:39Z

/remove-lifecycle stale

stan-sz · 2022-09-27T09:48:48Z

@roycaihw has this been backported?

SergeyKanzhelev · 2022-11-10T21:44:14Z

I see the change that was breaking it was reverted in #1789

k8s-triage-robot · 2023-02-08T22:40:42Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

iamkarlson · 2023-02-23T21:07:51Z

/remove-lifecycle stale

SergeyKanzhelev · 2023-02-23T21:11:05Z

/remove-lifecycle stale

Do you still see this issue? I didn't check the last release myself recently, but last time I checked, code was fixed.

k8s-triage-robot · 2023-05-24T21:46:10Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot · 2023-06-23T22:34:58Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle rotten
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

k8s-triage-robot · 2024-01-19T01:59:51Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue with /reopen
Mark this issue as fresh with /remove-lifecycle rotten
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close not-planned

k8s-ci-robot · 2024-01-19T01:59:53Z

@k8s-triage-robot: Closing this issue, marking it as "Not Planned".

In response to this:

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied

After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied

After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue with /reopen

Mark this issue as fresh with /remove-lifecycle rotten

Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close not-planned

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

koaps added the kind/bug Categorizes issue or PR as related to a bug. label Mar 1, 2022

ApproximateIdentity mentioned this issue Mar 4, 2022

client.CoreV1Api().list_node() does not work #1735

Closed

iameskild mentioned this issue Mar 7, 2022

[BUG] - kubernetes-client latest version breaks 02-infrastructure nebari-dev/nebari#1147

Closed

yliaog mentioned this issue Mar 10, 2022

fixing listing nodes #1739

Closed

jiahuif mentioned this issue Mar 10, 2022

remove enum markers on types without validation kubernetes/kubernetes#108639

Merged

novakivskiy added a commit to montikids/kubernetes-client-python that referenced this issue Apr 20, 2022

fix problem from issue kubernetes-client#1733

726ebd9

orlandothoeny mentioned this issue Apr 21, 2022

Error when saving Node Sources on GCP rundeck-plugins/kubernetes#130

Open

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 10, 2022

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 14, 2022

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 8, 2023

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 23, 2023

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 24, 2023

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jun 23, 2023

k8s-ci-robot closed this as not planned Won't fix, can't repro, duplicate, stale Jan 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Recent changes to v1_node_condition.py causing ValueError for MCEHardwareErrors on Oracle OKE cloud platform #1733

Recent changes to v1_node_condition.py causing ValueError for MCEHardwareErrors on Oracle OKE cloud platform #1733

koaps commented Mar 1, 2022

julianseeger commented Mar 2, 2022 •

edited

chrislinan commented Mar 2, 2022

kpulgam commented Mar 3, 2022

sybnex commented Mar 18, 2022 •

edited

roycaihw commented Mar 28, 2022

iamkarlson commented Apr 7, 2022

everpeace commented Apr 11, 2022

k8s-triage-robot commented Jul 10, 2022

iamkarlson commented Jul 14, 2022

stan-sz commented Sep 27, 2022

SergeyKanzhelev commented Nov 10, 2022 •

edited

k8s-triage-robot commented Feb 8, 2023

iamkarlson commented Feb 23, 2023

SergeyKanzhelev commented Feb 23, 2023

k8s-triage-robot commented May 24, 2023

k8s-triage-robot commented Jun 23, 2023

k8s-triage-robot commented Jan 19, 2024

k8s-ci-robot commented Jan 19, 2024

Recent changes to v1_node_condition.py causing ValueError for MCEHardwareErrors on Oracle OKE cloud platform #1733

Recent changes to v1_node_condition.py causing ValueError for MCEHardwareErrors on Oracle OKE cloud platform #1733

Comments

koaps commented Mar 1, 2022

julianseeger commented Mar 2, 2022 • edited

chrislinan commented Mar 2, 2022

kpulgam commented Mar 3, 2022

sybnex commented Mar 18, 2022 • edited

roycaihw commented Mar 28, 2022

iamkarlson commented Apr 7, 2022

everpeace commented Apr 11, 2022

k8s-triage-robot commented Jul 10, 2022

iamkarlson commented Jul 14, 2022

stan-sz commented Sep 27, 2022

SergeyKanzhelev commented Nov 10, 2022 • edited

k8s-triage-robot commented Feb 8, 2023

iamkarlson commented Feb 23, 2023

SergeyKanzhelev commented Feb 23, 2023

k8s-triage-robot commented May 24, 2023

k8s-triage-robot commented Jun 23, 2023

k8s-triage-robot commented Jan 19, 2024

k8s-ci-robot commented Jan 19, 2024

julianseeger commented Mar 2, 2022 •

edited

sybnex commented Mar 18, 2022 •

edited

SergeyKanzhelev commented Nov 10, 2022 •

edited