You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
We're encountering an odd case where a Vault leader is being replaced by a single follower that is restarted in a cluster of 3. We've deployed Vault to Kubernetes using the Helm chart and it's using the internal raft storage. The restart was performed with kubectl delete pod.
Details
Debug logs from the leader
2022-02-18T10:04:17.403Z [INFO] storage.raft: aborting pipeline replication: peer="{Voter aed4746c-6e41-ba17-dd97-848b8b58d11b vault-2.vault-internal:8201}"
2022-02-18T10:04:17.491Z [ERROR] storage.raft: failed to appendEntries to: peer="{Voter aed4746c-6e41-ba17-dd97-848b8b58d11b vault-2.vault-internal:8201}" error=EOF
2022-02-18T10:04:17.505Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:17.508Z [ERROR] storage.raft: failed to heartbeat to: peer=vault-2.vault-internal:8201 error="dial tcp 172.16.32.53:8201: connect: connection refused"
2022-02-18T10:04:17.583Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:17.629Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:17.903Z [WARN] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=500.514487ms
2022-02-18T10:04:18.383Z [WARN] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=980.221753ms
2022-02-18T10:04:18.852Z [WARN] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=1.448886321s
2022-02-18T10:04:19.351Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=1.948375865s
2022-02-18T10:04:19.785Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=2.381793723s
2022-02-18T10:04:20.274Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=2.871663404s
2022-02-18T10:04:20.662Z [ERROR] storage.raft: failed to heartbeat to: peer=vault-2.vault-internal:8201 error="dial tcp 172.16.32.53:8201: connect: no route to host"
2022-02-18T10:04:20.662Z [ERROR] storage.raft: failed to appendEntries to: peer="{Voter aed4746c-6e41-ba17-dd97-848b8b58d11b vault-2.vault-internal:8201}" error="dial tcp 172.16.32.53:8201: connect: no route to host"
2022-02-18T10:04:20.742Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:20.768Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=3.364927462s
2022-02-18T10:04:20.778Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:21.246Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=3.843671429s
2022-02-18T10:04:21.723Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=4.319956192s
2022-02-18T10:04:22.210Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=4.807359124s
2022-02-18T10:04:22.675Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=5.272142026s
2022-02-18T10:04:23.152Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=5.749080432s
2022-02-18T10:04:23.640Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=6.237388002s
2022-02-18T10:04:23.734Z [ERROR] storage.raft: failed to appendEntries to: peer="{Voter aed4746c-6e41-ba17-dd97-848b8b58d11b vault-2.vault-internal:8201}" error="dial tcp 172.16.32.53:8201: connect: no route to host"
2022-02-18T10:04:23.734Z [ERROR] storage.raft: failed to heartbeat to: peer=vault-2.vault-internal:8201 error="dial tcp 172.16.32.53:8201: connect: no route to host"
2022-02-18T10:04:23.814Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:23.858Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:24.083Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=6.680564421s
2022-02-18T10:04:24.551Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=7.148301984s
2022-02-18T10:04:24.992Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=7.588839969s
2022-02-18T10:04:25.465Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=8.062711818s
2022-02-18T10:04:25.963Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=8.560429476s
2022-02-18T10:04:26.431Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=9.027937456s
2022-02-18T10:04:26.806Z [ERROR] storage.raft: failed to appendEntries to: peer="{Voter aed4746c-6e41-ba17-dd97-848b8b58d11b vault-2.vault-internal:8201}" error="dial tcp 172.16.32.53:8201: connect: no route to host"
2022-02-18T10:04:26.918Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=9.515057505s
2022-02-18T10:04:26.939Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:27.383Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=9.979825297s
2022-02-18T10:04:27.852Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=10.449708301s
2022-02-18T10:04:28.307Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=10.903817242s
2022-02-18T10:04:28.777Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=11.374681422s
2022-02-18T10:04:29.255Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=11.852191499s
2022-02-18T10:04:29.693Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=12.28998255s
2022-02-18T10:04:29.878Z [ERROR] storage.raft: failed to heartbeat to: peer=vault-2.vault-internal:8201 error="dial tcp 172.16.32.53:8201: connect: no route to host"
2022-02-18T10:04:29.878Z [ERROR] storage.raft: failed to appendEntries to: peer="{Voter aed4746c-6e41-ba17-dd97-848b8b58d11b vault-2.vault-internal:8201}" error="dial tcp 172.16.32.53:8201: connect: no route to host"
2022-02-18T10:04:30.010Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:30.058Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:30.128Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=12.725059148s
2022-02-18T10:04:30.593Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=13.190626414s
2022-02-18T10:04:31.066Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=13.663332836s
2022-02-18T10:04:31.554Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=14.151433352s
2022-02-18T10:04:31.992Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=14.589504492s
2022-02-18T10:04:32.474Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=15.071567854s
2022-02-18T10:04:32.950Z [ERROR] storage.raft: failed to appendEntries to: peer="{Voter aed4746c-6e41-ba17-dd97-848b8b58d11b vault-2.vault-internal:8201}" error="dial tcp 172.16.32.53:8201: connect: no route to host"
2022-02-18T10:04:32.962Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=15.558984804s
2022-02-18T10:04:33.181Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:33.440Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=16.037516496s
2022-02-18T10:04:33.882Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=16.479150749s
2022-02-18T10:04:34.375Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=16.972016773s
2022-02-18T10:04:34.846Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=17.443499502s
2022-02-18T10:04:35.341Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=17.938211882s
2022-02-18T10:04:35.789Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=18.386742135s
2022-02-18T10:04:36.022Z [ERROR] storage.raft: failed to heartbeat to: peer=vault-2.vault-internal:8201 error="dial tcp 172.16.32.53:8201: connect: no route to host"
2022-02-18T10:04:36.022Z [ERROR] storage.raft: failed to appendEntries to: peer="{Voter aed4746c-6e41-ba17-dd97-848b8b58d11b vault-2.vault-internal:8201}" error="dial tcp 172.16.32.53:8201: connect: no route to host"
2022-02-18T10:04:36.226Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:36.237Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=18.834601049s
2022-02-18T10:04:36.278Z [ERROR] storage.raft: failed to heartbeat to: peer=vault-2.vault-internal:8201 error="dial tcp: lookup vault-2.vault-internal on 10.100.10.100:53: no such host"
2022-02-18T10:04:36.438Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:36.440Z [ERROR] storage.raft: failed to appendEntries to: peer="{Voter aed4746c-6e41-ba17-dd97-848b8b58d11b vault-2.vault-internal:8201}" error="dial tcp: lookup vault-2.vault-internal on 10.100.10.100:53: no such host"
2022-02-18T10:04:36.595Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:36.597Z [ERROR] storage.raft: failed to heartbeat to: peer=vault-2.vault-internal:8201 error="dial tcp: lookup vault-2.vault-internal on 10.100.10.100:53: no such host"
2022-02-18T10:04:36.701Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=19.29823513s
2022-02-18T10:04:37.082Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:37.085Z [ERROR] storage.raft: failed to heartbeat to: peer=vault-2.vault-internal:8201 error="dial tcp: lookup vault-2.vault-internal on 10.100.10.100:53: no such host"
2022-02-18T10:04:37.142Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:37.144Z [ERROR] storage.raft: failed to appendEntries to: peer="{Voter aed4746c-6e41-ba17-dd97-848b8b58d11b vault-2.vault-internal:8201}" error="dial tcp: lookup vault-2.vault-internal on 10.100.10.100:53: no such host"
2022-02-18T10:04:37.162Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=19.758994095s
2022-02-18T10:04:37.647Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=20.244676534s
2022-02-18T10:04:37.893Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:37.895Z [ERROR] storage.raft: failed to heartbeat to: peer=vault-2.vault-internal:8201 error="dial tcp: lookup vault-2.vault-internal on 10.100.10.100:53: no such host"
2022-02-18T10:04:38.146Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=20.743018444s
2022-02-18T10:04:38.517Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:38.519Z [ERROR] storage.raft: failed to appendEntries to: peer="{Voter aed4746c-6e41-ba17-dd97-848b8b58d11b vault-2.vault-internal:8201}" error="dial tcp: lookup vault-2.vault-internal on 10.100.10.100:53: no such host"
2022-02-18T10:04:38.604Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=21.201239805s
2022-02-18T10:04:39.100Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=21.697686171s
2022-02-18T10:04:39.283Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:39.285Z [ERROR] storage.raft: failed to heartbeat to: peer=vault-2.vault-internal:8201 error="dial tcp: lookup vault-2.vault-internal on 10.100.10.100:53: no such host"
2022-02-18T10:04:39.591Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=22.188032397s
2022-02-18T10:04:40.053Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=22.650398418s
2022-02-18T10:04:40.524Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=23.121134016s
2022-02-18T10:04:40.999Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=23.596327017s
2022-02-18T10:04:41.146Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:41.148Z [ERROR] storage.raft: failed to appendEntries to: peer="{Voter aed4746c-6e41-ba17-dd97-848b8b58d11b vault-2.vault-internal:8201}" error="dial tcp: lookup vault-2.vault-internal on 10.100.10.100:53: no such host"
2022-02-18T10:04:41.476Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=24.073632945s
2022-02-18T10:04:41.937Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=24.534144542s
2022-02-18T10:04:42.011Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:42.014Z [ERROR] storage.raft: failed to heartbeat to: peer=vault-2.vault-internal:8201 error="dial tcp 172.16.38.163:8201: connect: connection refused"
2022-02-18T10:04:42.397Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=24.994504709s
2022-02-18T10:04:42.833Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=25.430536679s
2022-02-18T10:04:43.325Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=25.921880386s
2022-02-18T10:04:43.814Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=26.411563481s
2022-02-18T10:04:44.278Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=26.874840614s
2022-02-18T10:04:44.766Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=27.363277322s
2022-02-18T10:04:45.238Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=27.835264267s
2022-02-18T10:04:45.717Z [DEBUG] core.cluster-listener: performing server cert lookup
2022-02-18T10:04:45.736Z [DEBUG] storage.raft.raft-net: accepted connection: local-address=vault-0.vault-internal:8201 remote-address=172.16.38.163:51474
2022-02-18T10:04:45.737Z [WARN] storage.raft: rejecting vote request since we have a leader: from=vault-2.vault-internal:8201 leader=vault-0.vault-internal:8201
2022-02-18T10:04:45.738Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=28.335421268s
2022-02-18T10:04:46.201Z [DEBUG] storage.raft: failed to contact: server-id=aed4746c-6e41-ba17-dd97-848b8b58d11b time=28.798085525s
2022-02-18T10:04:46.327Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=raft_storage_v1 host=raft-c15b9662-2d50-11da-29c3-6b62fa2e11fa
2022-02-18T10:04:46.336Z [DEBUG] core.cluster-listener: performing client cert lookup
2022-02-18T10:04:46.344Z [ERROR] storage.raft: peer has newer term, stopping replication: peer="{Voter aed4746c-6e41-ba17-dd97-848b8b58d11b vault-2.vault-internal:8201}"
2022-02-18T10:04:46.344Z [INFO] storage.raft: entering follower state: follower="Node at vault-0.vault-internal:8201 [Follower]" leader=
2022-02-18T10:04:46.344Z [WARN] core: leadership lost, stopping active operation
2022-02-18T10:04:46.344Z [INFO] core: pre-seal teardown starting
2022-02-18T10:04:46.344Z [INFO] storage.raft: aborting pipeline replication: peer="{Voter 8a322099-8e67-3c92-3687-fd0bd8bd8b10 vault-1.vault-internal:8201}"
2022-02-18T10:04:46.344Z [DEBUG] storage.raft.autopilot: state update routine is now stopped
2022-02-18T10:04:46.344Z [DEBUG] storage.raft.autopilot: autopilot is now stopped
2022-02-18T10:04:46.844Z [INFO] core: stopping raft active node
2022-02-18T10:04:46.844Z [DEBUG] expiration: stop triggered
2022-02-18T10:04:46.845Z [DEBUG] expiration: finished stopping
2022-02-18T10:04:46.845Z [INFO] rollback: stopping rollback manager
2022-02-18T10:04:46.845Z [INFO] core: pre-seal teardown complete
2022-02-18T10:04:46.863Z [ERROR] core: clearing leader advertisement failed: error="node is not the leader"
2022-02-18T10:04:46.863Z [ERROR] core: unlocking HA lock failed: error="node is not the leader"
2022-02-18T10:04:46.939Z [DEBUG] core: parsing information for new active node: active_cluster_addr=https://vault-0.vault-internal:8201 active_redirect_addr=https://172.16.23.221:8200
2022-02-18T10:04:46.939Z [DEBUG] core: refreshing forwarding connection
2022-02-18T10:04:46.939Z [DEBUG] core: clearing forwarding clients
2022-02-18T10:04:46.939Z [DEBUG] core: done clearing forwarding clients
2022-02-18T10:04:46.939Z [DEBUG] core: done refreshing forwarding connection
2022-02-18T10:04:46.940Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-0.vault-internal:8201 alpn=req_fw_sb-act_v1 host=fw-ba9cdaac-d2c2-350e-2008-50bfea8641bf
2022-02-18T10:04:46.942Z [WARN] core.cluster-listener: no TLS config found for ALPN: ALPN=["req_fw_sb-act_v1"]
2022-02-18T10:04:46.942Z [DEBUG] core.cluster-listener: error handshaking cluster connection: error="unsupported protocol"
2022-02-18T10:04:46.943Z [DEBUG] core: forwarding: error sending echo request to active node: error="rpc error: code = Unavailable desc = connection error: desc = \"transport: Error while dialing remote error: tls: internal error\""
2022-02-18T10:04:47.146Z [WARN] core.cluster-listener: no TLS config found for ALPN: ALPN=["req_fw_sb-act_v1"]
2022-02-18T10:04:47.146Z [DEBUG] core.cluster-listener: error handshaking cluster connection: error="unsupported protocol"
2022-02-18T10:04:47.235Z [DEBUG] storage.raft: lost leadership because received a requestVote with a newer term
2022-02-18T10:04:47.387Z [DEBUG] core.cluster-listener: performing server cert lookup
2022-02-18T10:04:47.402Z [DEBUG] storage.raft.raft-net: accepted connection: local-address=vault-0.vault-internal:8201 remote-address=172.16.38.163:51514
2022-02-18T10:04:49.440Z [DEBUG] core: parsing information for new active node: active_cluster_addr=https://vault-2.vault-internal:8201 active_redirect_addr=https://172.16.38.163:8200
2022-02-18T10:04:49.440Z [DEBUG] core: refreshing forwarding connection
2022-02-18T10:04:49.440Z [DEBUG] core: clearing forwarding clients
2022-02-18T10:04:49.440Z [DEBUG] core: done clearing forwarding clients
2022-02-18T10:04:49.440Z [DEBUG] core: forwarding: stopping heartbeating
2022-02-18T10:04:49.440Z [DEBUG] core: done refreshing forwarding connection
2022-02-18T10:04:49.440Z [DEBUG] core.cluster-listener: creating rpc dialer: address=vault-2.vault-internal:8201 alpn=req_fw_sb-act_v1 host=fw-7012efed-b8b7-d213-4d80-7e33705a8c05
2022-02-18T10:04:49.451Z [DEBUG] core.cluster-listener: performing client cert lookup
We are experiencing this behavior as well. We expected the cluster to be online and serving in HA mode in the event that a passive pod is lost however what we observe is that occasionally (no real rhyme or reason AFAICT) when the passive pod returns it seems to trigger an entire leadership election and cause then entire cluster to become momentarily unavailable until quorum amongst the raft members is re-established. This doesn't behave like the HA-design that we expected (i.e. we expected reads and writes to the cluster to still work since the rest of the raft members are still in quorum and there's nothing theoretically wrong with the cluster -- except that the new follower node doesn't like the old status-quo for some unknown reason so it forces a leader election).
Describe the bug
We're encountering an odd case where a Vault leader is being replaced by a single follower that is restarted in a cluster of 3. We've deployed Vault to Kubernetes using the Helm chart and it's using the internal raft storage. The restart was performed with
kubectl delete pod
.Details
Debug logs from the leader
Debug logs from the restarted node
Expected behavior
From reading the docs, we expected that the restarted node should re-join the cluster as a follower and not force an election
Environment:
vault status
):1.9.2
vault version
):1.9.2
0.18.0
Vault server configuration file(s):
The text was updated successfully, but these errors were encountered: