-
Notifications
You must be signed in to change notification settings - Fork 3.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Recent changes to v1_node_condition.py causing ValueError for MCEHardwareErrors on Oracle OKE cloud platform #1733
Comments
Same for GCP/GKE:
This is about pod conditions or pod readiness gates, but it looks to me like is has the same root cause as the node condition issue of @koaps Edit: it happened first after upgrading from 21.7.0 to 23.3.0 |
I have the same issue :
|
Piling on here .. Getting similar error in AWS world in combination with AWS load balancer controller which injects readiness gate to pods .
|
Same for Azure/AKS
|
This is being fixed in upstream. We will cut a new 1.23 client to backport the fix once the PR kubernetes/kubernetes#108740 is merged |
Is there any mitigation available while a permanent fix is being developed yet? |
We faced the issue too. We would like to know a workaround until a new version will be released. |
The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs. This bot triages issues and PRs according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /lifecycle stale |
/remove-lifecycle stale |
@roycaihw has this been backported? |
I see the change that was breaking it was reverted in #1789 |
The Kubernetes project currently lacks enough contributors to adequately respond to all issues. This bot triages un-triaged issues according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /lifecycle stale |
/remove-lifecycle stale |
Do you still see this issue? I didn't check the last release myself recently, but last time I checked, code was fixed. |
The Kubernetes project currently lacks enough contributors to adequately respond to all issues. This bot triages un-triaged issues according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /lifecycle stale |
The Kubernetes project currently lacks enough active contributors to adequately respond to all issues. This bot triages un-triaged issues according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /lifecycle rotten |
The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs. This bot triages issues according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /close not-planned |
@k8s-triage-robot: Closing this issue, marking it as "Not Planned". In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Changes for 23.3.0: b227345
Includes a hard coded set of conditions which may not include all supported conditions for different cloud providers:
https://github.com/kubernetes-client/python/blob/master/kubernetes/client/models/v1_node_condition.py#L217
On Oracle's OKE cloud platform, it includes an additional condition:
MCEHardwareErrors
Which is causing a ValueError when the 23.3.0 client is used to list nodes:
Node conditions should be more dynamic based on the cloud provider used.
The text was updated successfully, but these errors were encountered: