Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Flaky test] ci-kubernetes-unit #110962

Closed
Nivedita-coder opened this issue Jul 5, 2022 · 7 comments
Closed

[Flaky test] ci-kubernetes-unit #110962

Nivedita-coder opened this issue Jul 5, 2022 · 7 comments
Assignees
Labels
kind/flake Categorizes issue or PR as related to a flaky test. sig/testing Categorizes an issue or PR as relevant to SIG Testing. triage/accepted Indicates an issue or PR is ready to be actively worked on.

Comments

@Nivedita-coder
Copy link

Which jobs are flaking?

release-master-blocking

  • ci-kubernetes-unit

Which tests are flaking?

  • ci-kubernetes-unit.Overall
  • k8s.io/kubernetes/pkg/kubelet/nodeshutdown: TestLocalStorage
  • k8s.io/kubernetes/pkg/kubelet/nodeshutdown: TestFeatureEnabled
  • k8s.io/kubernetes/pkg/schedular/framework/plugins/volumebinding.TestBindPodVolumes
  • k8s.io/kubernetes/pkg/schedular/framework/plugins/volumebinding.TestBindPodVolumes/binding-static-pv-succeeds-after-time

Since when has it been flaking?

07-04 22:51 EEST

Testgrid link

https://testgrid.k8s.io/sig-release-master-blocking#ci-kubernetes-unit

Reason for failure (if possible)

{Failed === CONT testing.go:1312: race detected during execution of test FAIL FAIL k8s.io/kubernetes/pkg/kubelet/nodeshutdown 2.384s }

Anything else we need to know?

No response

Relevant SIG(s)

/sig testing
cc @kubernetes/ci-signal

@Nivedita-coder Nivedita-coder added the kind/flake Categorizes issue or PR as related to a flaky test. label Jul 5, 2022
@k8s-ci-robot k8s-ci-robot added sig/testing Categorizes an issue or PR as relevant to SIG Testing. needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. and removed needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Jul 5, 2022
@endocrimes
Copy link
Member

/triage accepted
/assign

@k8s-ci-robot k8s-ci-robot added triage/accepted Indicates an issue or PR is ready to be actively worked on. and removed needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Jul 5, 2022
@MadhavJivrajani
Copy link
Contributor

@pohly maybe you can help provide insights here (at least for the nodeshutdown tests since they are related to race conditions due to logging)!

Logs with details about the race: https://prow.k8s.io/view/gs/kubernetes-jenkins/logs/ci-kubernetes-unit/1544926366053437440

Seems like the race arises due to t being passed to and used for receiving logs here:

func NewTestContext(tl TL) (logr.Logger, context.Context) {

Read happens here:
https://github.com/golang/go/blob/3a7cec27738c51347f4e5a466410a0b2089391b7/src/testing/testing.go#L882

Write:
https://github.com/golang/go/blob/3a7cec27738c51347f4e5a466410a0b2089391b7/src/testing/testing.go#L1424-L1426

@pohly
Copy link
Contributor

pohly commented Jul 7, 2022

This is a dupe of #110854. A revert is pending as a stop-gap solution, but I also expect an updated klog which is more resilient against leaked goroutines (the actual problem) soon.

@shuheiktgw
Copy link
Contributor

shuheiktgw commented Jul 7, 2022

Hello @pohly 👋 I'm a 1.25 CI Signal Shadow and would you let us know if this issue should be a blocker for the 1.25.0-alpha.3 release on July 9th? I believe this is not a blocker but just in case.

@pohly
Copy link
Contributor

pohly commented Jul 8, 2022

The klog update went in, the race in the logging path should be gone now. The kubelet shutdown test still has other data races, but they don't seem to occur in the CI.

@Nivedita-coder
Copy link
Author

job is passing :)
/close

@k8s-ci-robot
Copy link
Contributor

@Nivedita-coder: Closing this issue.

In response to this:

job is passing :)
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/flake Categorizes issue or PR as related to a flaky test. sig/testing Categorizes an issue or PR as relevant to SIG Testing. triage/accepted Indicates an issue or PR is ready to be actively worked on.
Projects
Development

No branches or pull requests

6 participants