log: noderesourcetopology: per-flow additional log #289

ffromani · 2021-11-10T13:32:25Z

All the following applies first and foremost to the
NodeTopologyMatch plugin, but can be easily generalized.

Troubleshooting the behaviour of the components requires almost
always increasing the log level to peek into the flow.
This is due by the fact that, by default, components want to run
with concise logging to avoid log churn and spam.

But increasing the log level very often requires a configuration
change and a component reboot, which is nontrivial to do.
More than that, restarting requires changing the cluster state,
which can make torubleshooting harder, or just longer, while we
reproduce the state.

More than that, the log level is a global, or in the best case
scenario a per-module setting. On the other hand, we are often
interested in the behaviour of a specific flow, possibly across
components, rather than the behaviour of some components.

Increasing the log level then creates unnecessary and uninteresting
logs, which can actually make harder to track the flow.

It would thus be nice to have the option to dynamically enable
detailed logging per-flow across components, avoiding component restart.

I'm not aware of loggers package, including klog, implementing this
feature, but we can have a pretty close approximation with this PR.

We add a new annotation: pods can opt-in including this annotation
in the new behaviour. This allows extra verbosity only on selected
cases (pods).
We add utility function to tune the verbosiness of the logs with
the aforementioned annotation, overriding the default verbosiness level.

IOW, log messages can be emitted if either

the component (/module) verbosity is high enough
OR
a pod opts in adding the special annotation.

Finally, we consume this new feature in the NodeTopologyMatch plugin,
to both help troubleshooting and to demonstrate its usage.

CAVEATs:

the feature cannot be turned off (it should probably deserve a global
disable flag?)
potentially malicious actors can trigger extra logging adding
unnecessary annotations.
- but these actors need to be able to send pods to the scheduler,
  so they have already means to overload the component and/or cause
  extra logging. Not sure this is a practical concern, highligthing
  for full transparency.

Signed-off-by: Francesco Romani fromani@redhat.com

k8s-ci-robot · 2021-11-10T13:32:32Z

Welcome @fromanirh!

It looks like this is your first PR to kubernetes-sigs/scheduler-plugins 🎉. Please refer to our pull request process documentation to help your PR have a smooth ride to approval.

You will be prompted by a bot to use commands during the review process. Do not be afraid to follow the prompts! It is okay to experiment. Here is the bot commands documentation.

You can also check if kubernetes-sigs/scheduler-plugins has its own contribution guidelines.

You may want to refer to our testing guide if you run into trouble with your tests not passing.

If you are having difficulty getting your pull request seen, please follow the recommended escalation practices. Also, for tips and tricks in the contribution process you may want to read the Kubernetes contributor cheat sheet. We want to make sure your contribution gets all the attention it needs!

Thank you, and welcome to Kubernetes. 😃

k8s-ci-robot · 2021-11-10T13:32:32Z

Hi @fromanirh. Thanks for your PR.

I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

ffromani · 2021-11-10T13:32:35Z

/hold

ffromani · 2021-11-10T13:33:01Z

/cc @Tal-or @swatisehgal @AlexeyPerevalov

k8s-ci-robot · 2021-11-10T13:33:03Z

@fromanirh: GitHub didn't allow me to request PR reviews from the following users: Tal-or, AlexeyPerevalov.

Note that only kubernetes-sigs members and repo collaborators can review this PR, and authors cannot review their own PRs.

In response to this:

/cc @Tal-or @swatisehgal @AlexeyPerevalov

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Huang-Wei · 2021-11-10T18:40:52Z

@fromanirh the motivation sounds reasonable to me. Have you talked with sig-instrumentation about promoting it to be a generic klog primitive?

ffromani · 2021-11-10T18:51:04Z

@fromanirh the motivation sounds reasonable to me. Have you talked with sig-instrumentation about promoting it to be a generic klog primitive?

not yet, but I plan to.

ffromani · 2021-11-15T08:27:37Z

/hold cancel
internal test passed

ffromani · 2021-11-15T08:28:13Z

conversation with sig-instrumentation in progress. I'm reviewing klog's history, will discuss the contribution to klog afterwards.

denkensk · 2021-11-17T03:54:44Z

/ok-to-test

Huang-Wei · 2021-11-17T21:04:03Z

/approve

Will leave /lgtm to @Tal-or @swatisehgal @AlexeyPerevalov.

k8s-ci-robot · 2021-11-17T21:04:16Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: fromanirh, Huang-Wei

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [Huang-Wei]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Tal-or · 2021-11-18T14:34:19Z

It would be nice to have a unit test for this new feature. The Added-value of the unit-test is also an easy way to demonstrate this feature's functionalities.

One more thing - IIUC in order to activate this feature (besides having the annotation) we need to have -v=2 as a command-line argument.
Do we plan to have a follow-up PR with this change?

Besides that, it looks ok to me but I don't have lgtm permissions, so I leave it to the reviewers.

pohly · 2021-11-18T15:17:44Z

I am working on a KEP for contextual logging that I intend to propose for Kubernetes 1.24. That KEP will also address kubernetes/kubernetes#91633 (comment)

With contextual logging, what you are proposing here can be solved without adding more semantic to each log call. It's based on the idea that a logger is passed into a function and then the function does all logging with that logger.

The code that starts processing a pod then can:

add the pod as additional value to the logger, so all log message further down the call chain will include it (the issue above)
change the verbosity of log messages: an important pod could be logged with V(2) and less important ones with V(5) (your objective here)

pohly · 2021-11-18T15:21:18Z

log messages can be emitted if ... a pod opts in adding the special annotation

This probably was pointed out already, but that has security implications. It allows a normal user of a cluster to increase logging of a system component. Probably it won't enable a denial of service attack, but it's worth calling out.

ffromani · 2021-11-18T15:22:45Z

I am working on a KEP for contextual logging that I intend to propose for Kubernetes 1.24. That KEP will also address kubernetes/kubernetes#91633 (comment)

With contextual logging, what you are proposing here can be solved without adding more semantic to each log call. It's based on the idea that a logger is passed into a function and then the function does all logging with that logger.

The code that starts processing a pod then can:
* add the pod as additional value to the logger, so all log message further down the call chain will include it (the issue above)

* change the verbosity of log messages: an important pod could be logged with V(2) and less important ones with V(5) (your objective here)

awesome news! this sounds super interesting and seems to solve my use case brilliantly! I'll be following your KEP, thanks for pointing this out!

Considering this information maybe @Huang-Wei wants to revisit the approval.

ffromani · 2021-11-18T15:24:04Z

log messages can be emitted if ... a pod opts in adding the special annotation

This probably was pointed out already, but that has security implications. It allows a normal user of a cluster to increase logging of a system component. Probably it won't enable a denial of service attack, but it's worth calling out.

Indeed. I mentioned in the CAVEATs section of the commit message, which in retrospect I should probably must have made more evident. I'm curious to learn how you will handle this factor in your KEP!

ffromani · 2021-11-18T15:24:27Z

/hold
per #289 (comment)

pohly · 2021-11-18T15:38:59Z

I'm curious to learn how you will handle this factor in your KEP!

The KEP is just about the logging infrastructure that would enable this use case here. The scheduler would still need to look at pod attributes or some additional configuration to determine which pods are important and which are less important. So the security problem remains open.

Huang-Wei · 2021-11-18T17:58:37Z

Given @pohly is working on a generic solution that's endorsed by k/k, I think we can wait and adopt once it's live. @fromanirh WDYT?

pohly · 2021-11-19T08:27:27Z

Well, not endorsed yet and even once it is, it will take time to get it working completely. Help with both (KEP review/approval and the technical work) will be very welcome.

But I agree, let's first discuss that KEP in the 1.24 time frame before introducing specialized solutions.

ffromani · 2021-11-19T08:58:19Z

Given @pohly is working on a generic solution that's endorsed by k/k, I think we can wait and adopt once it's live. @fromanirh WDYT?

I surely agree that the general solution @pohly is working on is a better approach long term (and in general). I look forward to the KEP and to see the code, and I'd love to help in this effort however I can.

For the specific need of this scheduler plugin, however, I still do see a value in this PR, because:

time to get it merged and available in this project is a factor. With this feature in, we can still collect valuable feedback and still improve the troubleshooting experience
it allows us to experiment with how we make available contextual logging to the users (xref: log: noderesourcetopology: per-flow additional log #289 (comment)). Would this annotation approach be a promising way? we can get some feedback from the field running a limited experiment.

My take is that this PR can serve these purposes. The challanges are:

make very clear this is a intentionally limited feature of this plugin, because the long term solution for everything will be the contextual logging. IOW, drop immediately any generalization attempt like I hinted.
have someone volunteer to remove this code once the generic approach is available (that's easy: I volunteer).

TL;DR: if we label this feature as experimental, limit to this specific plugin, it has still enough value because it allows the project to gather feedback about how to enable users to consume contextual logging (which UX is good and which is not) and allow us to actually have better logging in the short term until we transition to the long term solution. I volunteer to remove this code at that point in time.

swatisehgal · 2021-11-19T13:54:07Z

+1. There is definitely value in this feature in the short-term and explicitly stating that it is an experimental feature sounds like a good approach to me. How do we want to make it explicit that this is an experimental capability? Might be worth having that in the annotation name (experimental.verbose.scheduling.sigs.k8s.io) itself?

All the following applies first and foremost to the NodeTopologyMatch plugin, but can be easily generalized. Troubleshooting the behaviour of the components requires almost always increasing the log level to peek into the flow. This is due by the fact that, by default, components want to run with concise logging to avoid log churn and spam. But increasing the log level very often requires a configuration change and a component reboot, which is nontrivial to do. More than that, restarting requires changing the cluster state, which can make torubleshooting harder, or just longer, while we reproduce the state. More than that, the log level is a global, or in the best case scenario a per-module setting. On the other hand, we are often interested in the behaviour of a specific flow, possibly across components, rather than the behaviour of some components. Increasing the log level then creates unnecessary and uninteresting logs, which can actually make harder to track the flow. It would thus be nice to have the option to dynamically enable detailed logging per-flow across components, avoiding component restart. I'm not aware of loggers package, including klog, implementing this feature, but we can have a pretty close approximation with this PR. We add a new annotation: pods can opt-in including this annotation in the new behaviour. This allows extra verbosity only on selected cases (pods). We add utility function to tune the verbosiness of the logs with the aforementioned annotation, overriding the default verbosiness level. IOW, log messages can be emitted if *either* - the component (/module) verbosity is high enough OR - a pod opts in adding the special annotation. Finally, we consume this new feature in the NodeTopologyMatch plugin, to both help troubleshooting and to demonstrate its usage. CAVEATs: - the feature cannot be turned off (it should probably deserve a global disable flag?) - potentially malicious actors can trigger extra logging adding unnecessary annotations. - but these actors need to be able to send pods to the scheduler, so they have already means to overload the component and/or cause extra logging. Not sure this is a practical concern, highligthing for full transparency. Signed-off-by: Francesco Romani <fromani@redhat.com>

pohly · 2021-12-07T12:53:54Z

I have published a PR with the KEP and intend to discuss it at this week's SIG Instrumentation meeting: kubernetes/enhancements#3078

The first step will be to decide whether the SIG wants to tackle this problem.

I've used this issue as basis for one of the use cases in the KEP.

k8s-ci-robot · 2022-01-20T18:34:39Z

@fromanirh: PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

ffromani · 2022-02-28T07:44:42Z

considering the ehnancements planned for the next kube cycle and the concerns emerged, I'm closing this PR. I'll be proposing more fine grained logs (to address the same need this PR tackles, but from a different angle) with a later PR.

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Nov 10, 2021

k8s-ci-robot added the needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. label Nov 10, 2021

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Nov 10, 2021

k8s-ci-robot requested review from cwdsuzhou and denkensk November 10, 2021 13:32

k8s-ci-robot added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label Nov 10, 2021

k8s-ci-robot requested a review from swatisehgal November 10, 2021 13:33

ffromani force-pushed the pod-sched-verbose-annotation branch 2 times, most recently from 31d9cf3 to d1a55fa Compare November 10, 2021 13:37

ffromani force-pushed the pod-sched-verbose-annotation branch 3 times, most recently from c3f7aee to a78f080 Compare November 11, 2021 17:20

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Nov 13, 2021

ffromani force-pushed the pod-sched-verbose-annotation branch from a78f080 to 387b10c Compare November 15, 2021 08:27

k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Nov 15, 2021

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Nov 15, 2021

k8s-ci-robot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Nov 17, 2021

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Nov 17, 2021

ffromani force-pushed the pod-sched-verbose-annotation branch from 387b10c to d075b6f Compare November 18, 2021 15:19

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Nov 18, 2021

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Nov 18, 2021

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Nov 24, 2021

ffromani force-pushed the pod-sched-verbose-annotation branch from d075b6f to 29f4db0 Compare November 25, 2021 13:23

k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Nov 25, 2021

ffromani force-pushed the pod-sched-verbose-annotation branch from 29f4db0 to ef69c8e Compare November 25, 2021 13:52

ffromani force-pushed the pod-sched-verbose-annotation branch from ef69c8e to d3ab779 Compare November 25, 2021 14:15

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jan 20, 2022

pohly mentioned this pull request Feb 2, 2022

KEP-3077: contextual logging kubernetes/enhancements#3078

Merged

ffromani closed this Feb 28, 2022

ffromani deleted the pod-sched-verbose-annotation branch December 16, 2022 10:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

log: noderesourcetopology: per-flow additional log #289

log: noderesourcetopology: per-flow additional log #289

ffromani commented Nov 10, 2021 •

edited

k8s-ci-robot commented Nov 10, 2021

k8s-ci-robot commented Nov 10, 2021

ffromani commented Nov 10, 2021

ffromani commented Nov 10, 2021

k8s-ci-robot commented Nov 10, 2021

Huang-Wei commented Nov 10, 2021

ffromani commented Nov 10, 2021

ffromani commented Nov 15, 2021

ffromani commented Nov 15, 2021

denkensk commented Nov 17, 2021

Huang-Wei commented Nov 17, 2021

k8s-ci-robot commented Nov 17, 2021

Tal-or commented Nov 18, 2021

pohly commented Nov 18, 2021

pohly commented Nov 18, 2021

ffromani commented Nov 18, 2021

ffromani commented Nov 18, 2021

ffromani commented Nov 18, 2021

pohly commented Nov 18, 2021

Huang-Wei commented Nov 18, 2021

pohly commented Nov 19, 2021

ffromani commented Nov 19, 2021

swatisehgal commented Nov 19, 2021

pohly commented Dec 7, 2021

k8s-ci-robot commented Jan 20, 2022

ffromani commented Feb 28, 2022

log: noderesourcetopology: per-flow additional log #289

log: noderesourcetopology: per-flow additional log #289

Conversation

ffromani commented Nov 10, 2021 • edited

k8s-ci-robot commented Nov 10, 2021

k8s-ci-robot commented Nov 10, 2021

ffromani commented Nov 10, 2021

ffromani commented Nov 10, 2021

k8s-ci-robot commented Nov 10, 2021

Huang-Wei commented Nov 10, 2021

ffromani commented Nov 10, 2021

ffromani commented Nov 15, 2021

ffromani commented Nov 15, 2021

denkensk commented Nov 17, 2021

Huang-Wei commented Nov 17, 2021

k8s-ci-robot commented Nov 17, 2021

Tal-or commented Nov 18, 2021

pohly commented Nov 18, 2021

pohly commented Nov 18, 2021

ffromani commented Nov 18, 2021

ffromani commented Nov 18, 2021

ffromani commented Nov 18, 2021

pohly commented Nov 18, 2021

Huang-Wei commented Nov 18, 2021

pohly commented Nov 19, 2021

ffromani commented Nov 19, 2021

swatisehgal commented Nov 19, 2021

pohly commented Dec 7, 2021

k8s-ci-robot commented Jan 20, 2022

ffromani commented Feb 28, 2022

ffromani commented Nov 10, 2021 •

edited