Add input information to fusion definitions for trace inspection and debugging #388

riccardofelluga · 2024-05-08T19:52:23Z

Before submitting

Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
[x ] Did you read the contributor guideline, Pull Request section?
Did you make sure to update the docs?
Did you write any new necessary tests?

What does this PR do?

Fixes #387.
This PR adds information about the inputs for a fusion definition such that it can be retrieved by inspecting the trace. A tutorial on how to read this information will be published as part of #205.

Also this PR is in preparation for #205

Quickly, from a trace:

trace = thunder.last_traces(fn)[-1]
trace_ctx = trace.python_ctx()
print(trace_ctx['nvFusion0'].last_inputs())

will print something like:

inputs = [
    torch.randn((2048,), dtype=torch.float32, device='cuda:0').as_strided((1, 2048), (2048, 1)),
    torch.randn((4096,), dtype=torch.bfloat16, device='cuda:0').as_strided((1, 2048, 4096), (4096, 0, 1))
]

I'm open to change the str output to returning a list of tensors, however for debugging it's usually enough to have a string.

for more information, see https://pre-commit.ci

IvanYashchuk

That's a really interesting idea and it would be a very useful addition to query the last used inputs for a specific fusion region. Since this an nvFuser executor specific change I don't think Thunder is the right place to put these queries.

How would you generalize it to all other execution functions and regions?

Thunder allows getting the nvfuser.FusionDefinition object with execution_trace.python_ctx()["nvFusion0"].last_used and maybe this particular code should be a method of nvfuser.FusionDefinition?

IvanYashchuk · 2024-05-16T09:53:48Z

thunder/executors/nvfuserex_impl.py


    def __call__(self, *args):
        fd = self.get_fd(to_descriptors(args))
        self.last_used = fd
+        self.last_inputs_meta = [(i.size(), i.stride(), i.dtype, i.device) for i in args if isinstance(i, torch.Tensor)]


What's the average size of args in GPT models? How much overhead does this line add to the execution?

mruberry · 2024-05-16T14:29:52Z

thunder/executors/nvfuserex_impl.py

@@ -413,6 +415,31 @@ def __call__(self, *args):
    def __repr__(self):
        return f"FusionDefinitionWrapper({self.name})"

+    def last_inputs(self) -> str:


plus @kevinstephano to consider if nvfuser itself would like to expose something like this. I think we discussed this briefly before, but maybe we could query nvFuser's Python API for something like "last reproduction" and that would include this information? Maybe the reproduction could be structured so it could be a Python string + contain structured information about the inputs?

For last_inputs here, I think we should consider just saving non-tensor inputs and only saving the metadata of tensor inputs. Non-tensor inputs can only be numbers or sequences of numbers, right? It'd be nice if last_inputs would return a valid set of inputs to the fusion, even creating the requested tensors based on the metadata. We could also rename this function to something like print_last_inputs.

I'm OK with this logic being in Thunder for now if it's helpful, even if we decide to expose it through nvfuser. When the nvfuser exposure is available we can remove this. Let's just add a warning when it's used that it's experimental and may change in the future.

How does that sound, @riccardofelluga? @IvanYashchuk? @kevinstephano?

add last inputs info to fd wrapper

86da585

riccardofelluga requested review from mruberry, lantiga, robieta, t-vi and carmocca as code owners May 8, 2024 19:52

[pre-commit.ci] auto fixes from pre-commit.com hooks

bcffee5

for more information, see https://pre-commit.ci

riccardofelluga mentioned this pull request May 10, 2024

Add more nvFuser debug information #387

Open

IvanYashchuk reviewed May 16, 2024

View reviewed changes

mruberry requested a review from kevinstephano May 16, 2024 14:24

mruberry reviewed May 16, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add input information to fusion definitions for trace inspection and debugging #388

Add input information to fusion definitions for trace inspection and debugging #388

riccardofelluga commented May 8, 2024

IvanYashchuk left a comment

IvanYashchuk May 16, 2024

mruberry May 16, 2024 •

edited

Add input information to fusion definitions for trace inspection and debugging #388

Are you sure you want to change the base?

Add input information to fusion definitions for trace inspection and debugging #388

Conversation

riccardofelluga commented May 8, 2024

What does this PR do?

IvanYashchuk left a comment

Choose a reason for hiding this comment

IvanYashchuk May 16, 2024

Choose a reason for hiding this comment

mruberry May 16, 2024 • edited

Choose a reason for hiding this comment

mruberry May 16, 2024 •

edited