Fix unidirectionally broadcast the shapes steps #663

huningxin · 2024-04-29T00:51:55Z

fdwr · 2024-04-29T01:22:48Z

The current definition is correct given the interpretation of the first/input tensor (A) being broadcasted to the second/target tensor (B), and the text elsewhere in the spec matches that interpretation. So for expand, the inputShape (A) is broadcasted to newShape (B), and for prelu, the slope (A) is broadcasted to the input (B) shape. Current text:

partial interface MLGraphBuilder {
  MLOperand expand(MLOperand input, sequence<[EnforceRange] unsigned long> newShape);
};
...
Set outputDescriptor.dimensions to the result of unidirectionally broadcasting the shapes input’s shape and newShape.
...
partial interface MLGraphBuilder {
  MLOperand prelu(MLOperand input, MLOperand slope);
};
...
Set descriptor.dimensions to the result of unidirectionally broadcasting the shapes slope’s shape and input’s shape.

So if we're going to flip the interpretation, then we'd need to adjust the wording in two places too:

- Set outputDescriptor.dimensions to the result of unidirectionally broadcasting the shapes input’s shape and newShape.
+ Set outputDescriptor.dimensions to the result of unidirectionally broadcasting the shapes newShape and input’s shape.
...
- Set descriptor.dimensions to the result of unidirectionally broadcasting the shapes slope’s shape and input’s shape.
+ Set descriptor.dimensions to the result of unidirectionally broadcasting the shapes input’s shape and slope’s shape.

I see for ONNX, rather than the parameter order of (input -> output) or (original shape -> target shape), it lists parameters as (target shape <- original shape), which feels oddly backwards to me, but I can accept that (technically both definitions are functionally equivalent, just a different parameter order). The bigger point of confusion is that A and B are simply unclear names, and we could pick some that are clearer. How about originalShape and targetShape or oldShape and newShape or another less ambiguous pair?

huningxin · 2024-04-29T02:17:19Z

@fdwr

So if we're going to flip the interpretation, then we'd need to adjust the wording in two places too:

We also need to change another caller "unidirectionally broadcastable":

-shapeA is unidirectionally broadcastable to shapeB if [unidirectionally broadcasting the shapes](https://www.w3.org/TR/webnn/#unidirectionally-broadcasting-the-shapes) shapeA and shapeB does not result in failure.
+shapeA is unidirectionally broadcastable to shapeB if [unidirectionally broadcasting the shapes](https://www.w3.org/TR/webnn/#unidirectionally-broadcasting-the-shapes) shapeB and shapeA does not result in failure.

And we need to flip one internal step of "unidirectionally broadcast the shapes":

-If dimA is not equal to dimB and dimA is not equal to 1, then return failure.
+If dimB is not equal to dimA and dimB is not equal to 1, then return failure.

huningxin · 2024-04-29T03:08:35Z

@fdwr

How about originalShape and targetShape or oldShape and newShape or another less ambiguous pair?

+1. I suppose this ambiguity comes from the readers cannot easily tell the broadcasting direction. How about using fromShape and toShape?

huningxin · 2024-04-29T14:21:22Z

The bigger point of confusion is that A and B are simply unclear names, and we could pick some that are clearer.

7f2c783 tries to use shapeFrom and shapeTo. Hope they are clearer. Please take another look. Thanks!

inexorabletash · 2024-04-29T17:35:36Z

index.bs

-    1. Let |dimB| be |paddedB|[|index|].
+1. [=list/For each=] |index| in [=the range=] 0 to |sizeB|, exclusive:
+    1. Let |dimA| be |paddedA|[|index|].
+    1. Let |dimB| be |shapeB|[|index|].
    1. If |dimA| is not equal to |dimB| and |dimA| is not equal to 1, then return failure.


I want to defer to @fdwr but call out:

If the claim "The current definition is correct given the interpretation of ..." is correct, and this change is turning into a strict (1) reorder and (2) rename of the params, then it seems like this line needs to be altered to maintain the current behavior.

Also, @fdwr's note about updating the "call sites" needs to happen as well?

inexorabletash · 2024-05-30T19:06:17Z

Can we move forward here? I'd recommend starting from a fresh change (force-push) that just renames the parameters from A and B (or is that B and A?) to shapeFrom and shapeTo and then we can decide if we want to swap the order or make other changes.

Fix unidirectionally broadcast the shapes steps

eb08f9a

Fix webmachinelearning#662

huningxin requested review from inexorabletash and fdwr April 29, 2024 00:52

Replace shapeA and shapeB with shapeFrom and shapeTo

7f2c783

inexorabletash reviewed Apr 29, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix unidirectionally broadcast the shapes steps #663

Fix unidirectionally broadcast the shapes steps #663

huningxin commented Apr 29, 2024 •

edited by pr-preview bot

fdwr commented Apr 29, 2024 •

edited

huningxin commented Apr 29, 2024 •

edited

huningxin commented Apr 29, 2024

huningxin commented Apr 29, 2024

inexorabletash Apr 29, 2024

inexorabletash commented May 30, 2024

Fix unidirectionally broadcast the shapes steps #663

Are you sure you want to change the base?

Fix unidirectionally broadcast the shapes steps #663

Conversation

huningxin commented Apr 29, 2024 • edited by pr-preview bot

fdwr commented Apr 29, 2024 • edited

huningxin commented Apr 29, 2024 • edited

huningxin commented Apr 29, 2024

huningxin commented Apr 29, 2024

inexorabletash Apr 29, 2024

Choose a reason for hiding this comment

inexorabletash commented May 30, 2024

huningxin commented Apr 29, 2024 •

edited by pr-preview bot

fdwr commented Apr 29, 2024 •

edited

huningxin commented Apr 29, 2024 •

edited