Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Flaky PartitionTransitionImplTest.shouldAbortOngoingTransitionWhenNewTransitionIsRequested #18445

Open
npepinpe opened this issue May 13, 2024 · 1 comment
Assignees
Labels
component/zeebe Related to the Zeebe component/team kind/flake Categorizes issue or PR as related to a flaky test

Comments

@npepinpe
Copy link
Member

Summary

  • How often does the test fail?
    • Twice
  • Does it block your work?
    • No
  • Do we suspect that it is a real failure?
    • Likely not

Failures

Example assertion failure
java.lang.IndexOutOfBoundsException: Index 0 out of bounds for length 0
	at java.base/jdk.internal.util.Preconditions.outOfBounds(Preconditions.java:100)
	at java.base/jdk.internal.util.Preconditions.outOfBoundsCheckIndex(Preconditions.java:106)
	at java.base/jdk.internal.util.Preconditions.checkIndex(Preconditions.java:302)
	at java.base/java.util.Objects.checkIndex(Objects.java:385)
	at java.base/java.util.ArrayList.remove(ArrayList.java:551)
	at io.camunda.zeebe.broker.system.partitions.impl.PartitionTransitionProcess.lambda$proceedWithTransition$1(PartitionTransitionProcess.java:88)
	at io.camunda.zeebe.scheduler.testing.TestConcurrencyControl.run(TestConcurrencyControl.java:50)
@npepinpe npepinpe added kind/flake Categorizes issue or PR as related to a flaky test component/zeebe Related to the Zeebe component/team labels May 13, 2024
@oleschoenburg
Copy link
Member

I couldn't reproduce this yet. The exception says that PartitionTransitionProcess tries to remove from pendingSteps but it's empty. As far as I can tell, this can only happen if PartitionTransitionProcess isn't thread safe. Since the test uses TestConcurrencyControl which behaves differently to the production version, I suspect a concurrency issue that is test-only and difficult to trigger.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component/zeebe Related to the Zeebe component/team kind/flake Categorizes issue or PR as related to a flaky test
Projects
None yet
Development

No branches or pull requests

2 participants