Kubectl: Keepalive connection to API server for exec and logs. #94301

nielsbasjes · 2020-08-28T08:01:48Z

What would you like to be added:

A feature (enabled by default) that any "long running" kubectl command (like exec and logs) send a periodic keep alive signal to the API server.

At this point this is only available for the proxy command where it is disabled by default.

I propose to have this keepalive made a part of also the exec and logs (and others?) and to set the default value to 5s.

Why is this needed:

When running a kubernetes cluster in High Available mode (i.e. multiple api servers behind a loadbalancer) it is common that client side inactivity timeouts are set in the load balancer so it is able to cleanup stale connections.

The advised haproxy config (which I have right now) advises to set the timeouts for client and server to 20s.
https://github.com/kubernetes/kubeadm/blob/master/docs/ha-considerations.md#haproxy-configuration

What I ran into is that I listen to the logs of a pod (i.e. kubectl logs -f) and the process at hand took a "long time" between the log messages (I'm debugging a new application). After the configured timeout (the mentioned 20 seconds) the connection would be lost and I had to restart the 'logs -f' to see the rest of the messages.

Also exec a shell in a pod to examine what is happening, looking something up on a webpage and then returning to the shell to find it has been closed because I was idle for more than 20 seconds is not very productive.

See also: #58486

The text was updated successfully, but these errors were encountered:

nielsbasjes · 2020-08-28T08:09:29Z

/sig api-machinery
/sig cli

nielsbasjes · 2020-08-28T08:23:45Z

Note that at this point the only available mitigation for these practical problems is to increase the timeouts on the loadbalancer (haproxy in my case).
Yet this not desirable because

also truly stale connections will remain in the memory of the loadbalancer for a longer time which may overload things in busy/large installations.
the connections will still be closed after the longer timeout even if this is not wanted.

fedebongio · 2020-09-01T20:18:41Z

/assign @lavalamp
(who volunteered to find the previous one related)

lavalamp · 2020-09-01T20:27:29Z

I think #94170 was the one I was thinking of. It adds the feature, but we would need to turn it on in various places. Probably both the kubectl <-> apiserver and the apiserver <-> kublet paths would need this on.

lavalamp · 2020-09-01T20:28:43Z

I don't think there's much harm in just turning it on, I don't really think we need a flag. A ping every 20s or so shouldn't break the bank.

caesarxuchao · 2020-09-01T21:00:04Z

#94170 might fix this.

edit: lavalamp beat me

lavalamp · 2020-09-01T21:01:49Z

Yeah, #94170 doesn't actually enable it anywhere, it just adds the parameter.

…

On Tue, Sep 1, 2020 at 2:00 PM Chao Xu ***@***.***> wrote: #94170 <#94170> might fix this. — You are receiving this because you were assigned. Reply to this email directly, view it on GitHub <#94301 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAE6BFSSXPZMPHXUQS5BP2LSDVOGJANCNFSM4QN3GKJQ> .

nielsbasjes · 2020-09-01T22:42:06Z

Given that the recommended timeout is 20 seconds I would set the default ping time to something like 5 or 10 seconds.

djzager · 2020-09-03T17:16:12Z

/assign

Nit123 · 2020-09-17T22:44:25Z

/assign

vinayvenkat · 2020-10-02T20:07:19Z

/assign

joshfix · 2020-12-03T20:58:20Z

+1 This would be super useful.

bevank · 2020-12-03T23:38:34Z

+1

knight42 · 2020-12-05T14:55:51Z

Hi, I have filed #97083 to enable SPDY pings to address this issue.

nielsbasjes · 2020-12-05T15:56:21Z

As I understand #97083 addresses the exec issue but not the logs -f issue.

knight42 · 2020-12-05T17:05:38Z

As I understand #97083 addresses the exec issue but not the logs -f issue.

More precisely, #97083 should address exec and portforward.

As for the logs -f, I think this is a different problem because logs -f actually sends a HTTP request to the REST API, and the connection might be terminated if the kubectl client, or the apiserver, or the load balancer between the client and server thought the connection has been idle for a specific period.

IMHO I guess we need to reconnect until user interrupts to address the logs -f issue.

tiloso · 2020-12-07T21:36:48Z

Hey, for logs -f the issue has been addressed in #95981 as far as I understand. (At least for environments that use HTTP/2.)

fejta-bot · 2021-03-07T21:52:07Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

fejta-bot · 2021-04-06T22:34:38Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle rotten

nielsbasjes · 2021-04-07T09:23:43Z

As far as I understand right now:

feat: enable SPDY pings on connections #97083 should address exec and portforward issues.
Enables HTTP/2 health check #95981 should address logs -f in HTTP/2 (SPDY) situations.

@djzager / @vinayvenkat / @Nit123 Does that mean this issue has been fully resolved (i.e. can be closed)?
Or is there something remaining?

fejta-bot · 2021-05-07T09:58:38Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-contributor-experience at kubernetes/community.
/close

k8s-ci-robot · 2021-05-07T09:58:47Z

@fejta-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-contributor-experience at kubernetes/community.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Reifier · 2022-11-15T22:33:56Z

/reopen

k8s-ci-robot · 2022-11-15T22:34:00Z

@Reifier: You can't reopen an issue/PR unless you authored it or you are a collaborator.

In response to this:

/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

nielsbasjes added the kind/feature Categorizes issue or PR as related to a new feature. label Aug 28, 2020

k8s-ci-robot added the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label Aug 28, 2020

k8s-ci-robot added sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. sig/cli Categorizes an issue or PR as relevant to SIG CLI. and removed needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Aug 28, 2020

nielsbasjes mentioned this issue Aug 28, 2020

Kubernetes: requesting flag for "kubectl logs" to avoid 5-minute timeout if no stdout/stderr #58486

Closed

k8s-ci-robot assigned lavalamp Sep 1, 2020

lavalamp added good first issue Denotes an issue ready for a new contributor, according to the "help wanted" guidelines. help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. labels Sep 1, 2020

lavalamp removed their assignment Sep 1, 2020

lavalamp added the priority/backlog Higher priority than priority/awaiting-more-evidence. label Sep 1, 2020

k8s-ci-robot assigned djzager Sep 3, 2020

k8s-ci-robot assigned Nit123 Sep 17, 2020

k8s-ci-robot assigned vinayvenkat Oct 2, 2020

pothos mentioned this issue Oct 19, 2020

Stable 2605.6.0 shutdown order prevents closing of TCP connections flatcar/Flatcar#213

Closed

knight42 mentioned this issue Dec 5, 2020

feat: enable SPDY pings on connections #97083

Merged

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Mar 7, 2021

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Apr 6, 2021

k8s-ci-robot closed this as completed May 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kubectl: Keepalive connection to API server for exec and logs. #94301

Kubectl: Keepalive connection to API server for exec and logs. #94301

nielsbasjes commented Aug 28, 2020

nielsbasjes commented Aug 28, 2020

nielsbasjes commented Aug 28, 2020

fedebongio commented Sep 1, 2020

lavalamp commented Sep 1, 2020

lavalamp commented Sep 1, 2020

caesarxuchao commented Sep 1, 2020 •

edited

lavalamp commented Sep 1, 2020 via email

nielsbasjes commented Sep 1, 2020

djzager commented Sep 3, 2020

Nit123 commented Sep 17, 2020

vinayvenkat commented Oct 2, 2020

joshfix commented Dec 3, 2020

bevank commented Dec 3, 2020

knight42 commented Dec 5, 2020

nielsbasjes commented Dec 5, 2020

knight42 commented Dec 5, 2020

tiloso commented Dec 7, 2020 •

edited

fejta-bot commented Mar 7, 2021

fejta-bot commented Apr 6, 2021

nielsbasjes commented Apr 7, 2021

fejta-bot commented May 7, 2021

k8s-ci-robot commented May 7, 2021

Reifier commented Nov 15, 2022

k8s-ci-robot commented Nov 15, 2022

Kubectl: Keepalive connection to API server for exec and logs. #94301

Kubectl: Keepalive connection to API server for exec and logs. #94301

Comments

nielsbasjes commented Aug 28, 2020

nielsbasjes commented Aug 28, 2020

nielsbasjes commented Aug 28, 2020

fedebongio commented Sep 1, 2020

lavalamp commented Sep 1, 2020

lavalamp commented Sep 1, 2020

caesarxuchao commented Sep 1, 2020 • edited

lavalamp commented Sep 1, 2020 via email

nielsbasjes commented Sep 1, 2020

djzager commented Sep 3, 2020

Nit123 commented Sep 17, 2020

vinayvenkat commented Oct 2, 2020

joshfix commented Dec 3, 2020

bevank commented Dec 3, 2020

knight42 commented Dec 5, 2020

nielsbasjes commented Dec 5, 2020

knight42 commented Dec 5, 2020

tiloso commented Dec 7, 2020 • edited

fejta-bot commented Mar 7, 2021

fejta-bot commented Apr 6, 2021

nielsbasjes commented Apr 7, 2021

fejta-bot commented May 7, 2021

k8s-ci-robot commented May 7, 2021

Reifier commented Nov 15, 2022

k8s-ci-robot commented Nov 15, 2022

caesarxuchao commented Sep 1, 2020 •

edited

tiloso commented Dec 7, 2020 •

edited