New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
tokio-runtime segfault #1993
Comments
Debugging this further, it seems that anytime I get close to maxing out the RAM it crashes. |
We haven't seen this issue on any other hardware, so it would be super interesting to know what's unique about your setup that's causing this. It might be a bug in Tokio related to the specific (janky?) hypervisor your VPS provider uses. You could run something like Some of the mysterious database errors we've seen have also been from people running under a hypervisor, which might just be a coincidence, but I'm not sure. The bug may happen to be fixed by the Tokio 0.3 change (which is almost ready for release in v1.0.2) |
I'm running on Contabo VPS. $ ./slabbed-or-not
Not running under any known container type
Hypervisor: KVM I'll keep an eye for the Tokio 0.3 release :) |
The tokio 0.3 release has been merged. Let us know if the issue persists. I believe one cause of this issue was running with a high peer count (likely more than the computer can handle). |
I'm going to close this issue, assuming it has been resolved. Please re-open if the issue persists. |
It does appear a bit more stable, but still seeing the same issue on 2383bfe with 50 peers. I will try to ramp down to 30 to see if it improves. |
@AgeManning FYI, it doesn't look like I have the ability to reopen. |
hmm are you running any strange hardware? What OS? |
I only saw one reported issue of seg faults in tokio recently and looks like it's been fixed in 0.3: tokio-rs/tokio#3019 |
@AgeManning running Ubuntu 20.04 on a VPS. 8GB ram and 4 xeon vCores. The crash is relatively consistent, I've reprovisioned a few times only to find the same error. Maybe this week I can figure out how to get a proper core dump to share. |
@lightclient which lighthouse version are you running? |
Are you building the binary locally on the box? Can you try running the portable version and see if it also happens there
and use the binary at |
@AgeManning okay, I'll give that a shot. I've been building locally and have been using the optimized version. |
Closing this since I assumed it's been fixed by upgrading |
Description
I encountered some db corruption and I believe the root cause is lighthouse crashes regularly due to an issue with the tokio runtime. I'm continuing to debug. One concern is that the VPS I am running on doesn't have stellar stability reviews. I'm wondering if that is causing the runtime to go awry. However, I have managed to maintain 100% uptime on my go-ethereum nodes.
Version
I've run into this issue on both 5828ff1, v1.0, and v1.0.1.
Present Behaviour
The tokio-runtime segfaults after a period of time:
The text was updated successfully, but these errors were encountered: