Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cannot reserve XY bytes of direct buffer memory #9713

Closed
Zelldon opened this issue Jul 6, 2022 · 2 comments
Closed

Cannot reserve XY bytes of direct buffer memory #9713

Zelldon opened this issue Jul 6, 2022 · 2 comments
Labels
area/reliability Marks an issue as related to improving the reliability of our software (i.e. it behaves as expected) kind/bug Categorizes an issue or PR as a bug severity/high Marks a bug as having a noticeable impact on the user with no known workaround

Comments

@Zelldon
Copy link
Member

Zelldon commented Jul 6, 2022

Describe the bug
We seeing errors "Out of memory, exiting now because we can't recover from OOM. Consider adjusting memory limits." on prod.

Error: https://console.cloud.google.com/errors/detail/CKSrwcbo6qihLg;service=zeebe;time=P7D?project=camunda-cloud-240911
error

The cluster seems to be completely borked. Impact as high since it looks like that cluster is unusable.

To Reproduce
I have no clue yet, it seems to be an election loop. You can see that in the metrics but also in the logs where the term is really high.

Transition to FOLLOWER on term 353 - transitioning StreamProcessor

Expected behavior

Log/Stacktrace

Full Stacktrace

java.lang.OutOfMemoryError: Cannot reserve 75498884 bytes of direct buffer memory (allocated: 2003030408, limit: 2075918336)
	at java.nio.Bits.reserveMemory(Unknown Source) ~[?:?]
	at java.nio.DirectByteBuffer.<init>(Unknown Source) ~[?:?]
	at java.nio.ByteBuffer.allocateDirect(Unknown Source) ~[?:?]
	at io.camunda.zeebe.util.allocation.DirectBufferAllocator.allocate(DirectBufferAllocator.java:20) ~[zeebe-util-8.0.2.jar:8.0.2]
	at io.camunda.zeebe.util.allocation.BufferAllocators.allocateDirect(BufferAllocators.java:16) ~[zeebe-util-8.0.2.jar:8.0.2]
	at io.camunda.zeebe.dispatcher.DispatcherBuilder.initAllocatedBuffer(DispatcherBuilder.java:147) ~[zeebe-dispatcher-8.0.2.jar:8.0.2]
	at io.camunda.zeebe.dispatcher.DispatcherBuilder.build(DispatcherBuilder.java:93) ~[zeebe-dispatcher-8.0.2.jar:8.0.2]
	at io.camunda.zeebe.logstreams.impl.log.LogStreamImpl.createAndScheduleWriteBuffer(LogStreamImpl.java:340) ~[zeebe-logstreams-8.0.2.jar:8.0.2]
	at io.camunda.zeebe.logstreams.impl.log.LogStreamImpl.lambda$openAppender$9(LogStreamImpl.java:297) ~[zeebe-logstreams-8.0.2.jar:8.0.2]
	at io.camunda.zeebe.util.sched.ActorJob.invoke(ActorJob.java:79) ~[zeebe-util-8.0.2.jar:8.0.2]
	at io.camunda.zeebe.util.sched.ActorJob.execute(ActorJob.java:44) ~[zeebe-util-8.0.2.jar:8.0.2]
	at io.camunda.zeebe.util.sched.ActorTask.execute(ActorTask.java:122) ~[zeebe-util-8.0.2.jar:8.0.2]
	at io.camunda.zeebe.util.sched.ActorThread.executeCurrentTask(ActorThread.java:97) ~[zeebe-util-8.0.2.jar:8.0.2]
	at io.camunda.zeebe.util.sched.ActorThread.doWork(ActorThread.java:80) ~[zeebe-util-8.0.2.jar:8.0.2]
	at io.camunda.zeebe.util.sched.ActorThread.run(ActorThread.java:189) ~[zeebe-util-8.0.2.jar:8.0.2]

Environment:

  • OS: saas
  • Zeebe Version: 8.0.2
  • Configuration: G3-Trial
@Zelldon Zelldon added kind/bug Categorizes an issue or PR as a bug severity/high Marks a bug as having a noticeable impact on the user with no known workaround area/reliability Marks an issue as related to improving the reliability of our software (i.e. it behaves as expected) team/distributed labels Jul 6, 2022
@oleschoenburg
Copy link
Member

This cluster ran into #9115 a while ago : Logs

@menski
Copy link
Contributor

menski commented Jul 8, 2022

Closing for now, the cluster is in an unhealthy state, and the error didn't happen in another cluster. We are fixing an issue related to memory leaks, which might be related.

@menski menski closed this as completed Jul 8, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/reliability Marks an issue as related to improving the reliability of our software (i.e. it behaves as expected) kind/bug Categorizes an issue or PR as a bug severity/high Marks a bug as having a noticeable impact on the user with no known workaround
Projects
None yet
Development

No branches or pull requests

3 participants