Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

simulate hanging disk i/o (theory: raft does not trigger leader election) #164

Open
stapelberg opened this issue Apr 19, 2022 · 1 comment

Comments

@stapelberg
Copy link
Contributor

On 2022-April-19, one of the RobustIRC servers showed what looked like symptoms of a hanging storage controller.

The server was still reachable over the network, but any reads/writes would just hang. (I observed this with an interactive shell session I still had open.)

Interestingly enough, it seems like Raft triggers a leader election when the server is unreachable on the network, but doesn’t detect when the leader can’t make progress.

We should try to simulate this and see if there’s anything we can do to improve the current failure mode.

@stapelberg
Copy link
Contributor Author

Can reproduce, reported upstream at hashicorp/raft#503

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant