Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Failing Job] post-kubernetes-push-image-etcd #105284

Closed
spiffxp opened this issue Sep 27, 2021 · 10 comments · Fixed by #105484
Closed

[Failing Job] post-kubernetes-push-image-etcd #105284

spiffxp opened this issue Sep 27, 2021 · 10 comments · Fixed by #105484
Assignees
Labels
area/release-eng Issues or PRs related to the Release Engineering subproject kind/failing-test Categorizes issue or PR as related to a consistently or frequently failing test. priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. sig/release Categorizes an issue or PR as relevant to SIG Release. triage/accepted Indicates an issue or PR is ready to be actively worked on.
Milestone

Comments

@spiffxp
Copy link
Member

spiffxp commented Sep 27, 2021

Which jobs are failing:

  • post-kubernetes-push-image-etcd

Which test(s) are failing:

n/a

Since when has it been failing:

2020-08-11

Testgrid link:

https://testgrid.k8s.io/sig-k8s-infra-gcb#post-kubernetes-push-image-etcd

Reason for failure:

docker buildx version
docker: 'buildx' is not a docker command.
See 'docker --help'

Anything else we need to know:

Started failing when #104116 merged

No new versions of etcd are getting pushed until this gets fixed, when means the following PRs haven't fully taken effect:

Relevant SIG

/sig api-machinery

/priority critical-urgent
/milestone v1.23
/assign @saschagrunert @jpbetz
FYI @kubernetes/release-engineering

@spiffxp spiffxp added the kind/failing-test Categorizes issue or PR as related to a consistently or frequently failing test. label Sep 27, 2021
@k8s-ci-robot k8s-ci-robot added sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now. labels Sep 27, 2021
@k8s-ci-robot k8s-ci-robot added this to the v1.23 milestone Sep 27, 2021
@k8s-ci-robot k8s-ci-robot added the needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. label Sep 27, 2021
@spiffxp
Copy link
Member Author

spiffxp commented Sep 27, 2021

/sig release
/area release-eng

@k8s-ci-robot k8s-ci-robot added sig/release Categorizes an issue or PR as relevant to SIG Release. area/release-eng Issues or PRs related to the Release Engineering subproject labels Sep 27, 2021
@spiffxp spiffxp changed the title [Failing Job] post-kubernetes-push-image-etcd/ [Failing Job] post-kubernetes-push-image-etcd Sep 27, 2021
@spiffxp
Copy link
Member Author

spiffxp commented Sep 27, 2021

I've opened #105285 to rollback the offending commit, would like to keep this open until we can figure out the correct fix-forward

@fedebongio
Copy link
Contributor

/triage accepted

@k8s-ci-robot k8s-ci-robot added triage/accepted Indicates an issue or PR is ready to be actively worked on. and removed needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Sep 28, 2021
@spiffxp
Copy link
Member Author

spiffxp commented Sep 28, 2021

https://prow.k8s.io/view/gs/kubernetes-jenkins/logs/post-kubernetes-push-image-etcd/1442594901840629760 was triggered by the merge of #105285, which pushed gcr.io/k8s-staging-etcd/etcd:3.5.0-3

So that's now current / includes whatever PRs linked in the description may have tried to change (excluding #104116)

@spiffxp
Copy link
Member Author

spiffxp commented Sep 28, 2021

@pacoxu does this need promotion at all or are we good with the existing k8s.gcr.io/etcd:3.5.0-0 image?

@spiffxp
Copy link
Member Author

spiffxp commented Sep 29, 2021

/remove-priority critical-urgent
/priority important-soon
The job is back to passing

@k8s-ci-robot k8s-ci-robot added priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. and removed priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now. labels Sep 29, 2021
@pacoxu
Copy link
Member

pacoxu commented Sep 30, 2021

@pacoxu does this need promotion at all or are we good with the existing k8s.gcr.io/etcd:3.5.0-0 image?

Recent updates are related to Debian base image update and golang upgrade.
Nice to get a promotion. Good enough to use k8s.gcr.io/etcd:3.5.0-0.

@saschagrunert
Copy link
Member

saschagrunert commented Oct 4, 2021

Yeah, looks like that the env in the build job does not search for docker CLI plugins in the /root/.docker/cli-plugins path, which makes it unavailable during the run.

@saschagrunert
Copy link
Member

I think we have to set the $HOME environment variable like we already do in other build jobs, proposing the fix in #105484

@saschagrunert
Copy link
Member

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/release-eng Issues or PRs related to the Release Engineering subproject kind/failing-test Categorizes issue or PR as related to a consistently or frequently failing test. priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. sig/release Categorizes an issue or PR as relevant to SIG Release. triage/accepted Indicates an issue or PR is ready to be actively worked on.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

6 participants