-
Notifications
You must be signed in to change notification settings - Fork 556
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Endless loop in deleting #8103
Comments
@deepthidevaki found the cause #8099 (comment) 🙇 |
What's the impact on the cluster? Do we have any idea? Would it recover? |
Hard to say. I mean it is stuck on OOD, with the endless loop it blocks useful resources to do snapshots on other partitions I can imagine. Restart might help to get out of the loop, but I think in general the cluster would need manual help, like increasing disk, delete data or something. In general it should not end in OOD in the first place and of course it should not end in an endless loop try to clean up data, where is not possible. |
It is not an "endless loop" in which the thread is stuck for ever in this operation. Whenever follower receives an AppendRequest and it tries to write to the journal, it received OOD exception, and then it tries to "compact", which does nothing. So it is mostly just a wasteful work, and annoying logs. It will recover if the broker recovers from OOD. |
Thanks @deepthidevaki for clarifying this, haven't checked the code before. |
Let's remove the compact behavior here, as the follower can only compact on a new snapshot anyway. |
Never occurred again I will close this for now |
We can have an endless loop in deleting:
Originally posted by @Zelldon in #8099 (comment)
The text was updated successfully, but these errors were encountered: