Add Split-20 - change uneven split behavior to be more torch-like #5321

p-wysocki · 2023-06-15T15:02:51Z

Description

Introduce Split-20, which performs uneven tensor split in torch's manner: instead of reducing only the size of the last dimension of the output, it reduces the last n dimensions' sized by 1 each.

This was done using a mode attribute, which has 2 possible values: torch (default) and legacy. It's up to discussion if it should stay like that or maybe it should be renamed or changed to a boolean.

Motivation and Context

Issue: #4742

TODO

Update reference evaluator
Confirm the correctness of setting a 0-size dimension in ONNX (test_split_uneven_split_3d_4)
Add more test cases and assert backend tests are correct

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

onnx/defs/tensor/defs.cc

xadupre · 2023-06-15T15:12:19Z

onnx/defs/tensor/defs.cc

+            "Uneven split mode. "
+            "Possible values are 'torch' (default) and 'legacy'.",
+            AttributeProto::STRING,
+            std::string("torch"))


Should we keep the legacy as default?

As far as I know torch is the desired one, and I set legacy to be default in version converter when coming from opset 19 to 20. I can change it.

Good question. I think the current choice is ok. Especially if it is only the torch exporter that uses this, and presumably the torch converter needs the new option, not the old one.

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

gramalingam · 2023-06-15T15:49:20Z

onnx/defs/tensor/defs.cc

+                    split.push_back(chunk_size);
+                  }
+                } else {
+                  int chunk_size = (split_dim_value / num_outputs) + 1;


Actually, this doesn't always work. Eg., if we split 7 values into 5, then this will produce 4, 4, 4, 4, -1 which doesn't make sense. We should do 4, 3, 0, 0, 0, I think. We may need to change the description accordingly. In the other mode, we will do 2, 2, 1, 1, 1.

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

onnx/defs/tensor/defs.cc

onnx/version_converter/convert.h

gramalingam · 2023-07-26T19:04:33Z

onnx/backend/test/case/node/split.py

+            "Split",
+            inputs=["input"],
+            outputs=["output_1", "output_2", "output_3"],
+            axis=0,


Instead of changing the axis, it would make sense to change the new attribute introduced minimize_diff or whatever it is called. At least, I am looking for some test-case to show the difference that attribute makes, whether that is this test or some other test.

gramalingam · 2023-08-21T22:35:11Z

Hi, is this ready for review? Would it be possible to complete this PR for the upcoming release?

justinchuby · 2023-08-30T04:14:30Z

@p-wysocki

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

onnx/defs/tensor/defs.cc

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

onnx/test/shape_inference_test.py

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

p-wysocki · 2023-10-05T14:06:01Z

Added new tests, generated docs and test files are on the way - I'm currently having some issues with protobuf in my environment, which stops me from generating them.

gramalingam · 2023-10-05T15:54:27Z

It will also have to be moved to a later opset (21, I guess?)

zhenhuaw-me · 2023-12-05T05:41:24Z

Spec like mode:torch requires background of how Torch implements it.

We can make it more specific by:

Keep num_outputs, and change the semantic to match numpy.array_split and torch.tensor_split

If indices_or_sections is an integer n or a zero dimensional long tensor with value n, input is split into n sections along dimension dim. If input is divisible by n along dimension dim, each section will be of equal size, input.size(dim) / n. If input is not divisible by n, the sizes of the first int(input.size(dim) % n) sections will have size int(input.size(dim) / n) + 1, and the rest will have size int(input.size(dim) / n).

Introduce split_size, which is the same as torch.split:

If split_size_or_sections is an integer type, then tensor will be split into equally sized chunks (if possible). Last chunk will be smaller if the tensor size along the given dimension dim is not divisible by split_size.

Add Split20

6db44ff

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

p-wysocki requested review from a team as code owners June 15, 2023 15:02

xadupre reviewed Jun 15, 2023

View reviewed changes

onnx/defs/tensor/defs.cc Outdated Show resolved Hide resolved

xadupre reviewed Jun 15, 2023

View reviewed changes

Minor change

9c6938f

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

p-wysocki marked this pull request as draft June 15, 2023 15:33

gramalingam reviewed Jun 15, 2023

View reviewed changes

gramalingam added the operator Issues related to ONNX operators label Jul 5, 2023

p-wysocki added 3 commits July 6, 2023 12:52

Merge remote-tracking branch 'upstream/main' into split20

2aa2fd1

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

Rename mode to minimize_diff

3b8cfca

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

attempt to fix build

aab6d3f

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

xadupre reviewed Jul 26, 2023

View reviewed changes

onnx/defs/tensor/defs.cc Outdated Show resolved Hide resolved

xadupre reviewed Jul 26, 2023

View reviewed changes

onnx/defs/tensor/defs.cc Outdated Show resolved Hide resolved

gramalingam reviewed Jul 26, 2023

View reviewed changes

onnx/version_converter/convert.h Outdated Show resolved Hide resolved

gramalingam reviewed Jul 26, 2023

View reviewed changes

justinchuby added this to the 1.15 milestone Aug 30, 2023

p-wysocki added 3 commits September 22, 2023 11:25

Merge remote-tracking branch 'upstream/main' into split20

b0010da

some CR changes

a911415

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

Update docs

4a0ab8e

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

justinchuby added the auto update doc Generate md/proto files automatically using the CI pipeline label Sep 22, 2023

p-wysocki and others added 2 commits September 22, 2023 16:44

hopefully fix build on CI

7c50c89

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

CI:apply auto updated documentation/backend test data

2587995

Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

justinchuby reviewed Sep 22, 2023

View reviewed changes

onnx/defs/tensor/defs.cc Outdated Show resolved Hide resolved

p-wysocki added 3 commits October 3, 2023 15:36

Merge remote-tracking branch 'upstream/main' into split20

6d57f33

Try to fix compilation on CI

c5c6a75

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

cast vector size to int64

a39060b

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

github-advanced-security bot found potential problems Oct 3, 2023

View reviewed changes

onnx/test/shape_inference_test.py Fixed Show fixed Hide fixed

onnx/test/shape_inference_test.py Fixed Show fixed Hide fixed

onnx/test/shape_inference_test.py Fixed Show fixed Hide fixed

onnx/test/shape_inference_test.py Fixed Show fixed Hide fixed

p-wysocki and others added 3 commits October 5, 2023 15:51

Change attribute to req, add tests

9a4b19e

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

remove whitespaces

9af3089

Signed-off-by: p-wysocki <przemyslaw.wysocki@intel.com>

CI:apply auto updated documentation/backend test data

aed30e7

Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

justinchuby modified the milestones: 1.15, 1.16 Nov 4, 2023

liqunfu mentioned this pull request Dec 4, 2023

Operator spec of Split operator's attribute num_outputs is wrong #5766

Open

justinchuby modified the milestones: 1.16, 1.17 Feb 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Split-20 - change uneven split behavior to be more torch-like #5321

Add Split-20 - change uneven split behavior to be more torch-like #5321

p-wysocki commented Jun 15, 2023

xadupre Jun 15, 2023

p-wysocki Jun 15, 2023

gramalingam Jun 21, 2023

gramalingam Jun 15, 2023

gramalingam Jul 26, 2023

gramalingam commented Aug 21, 2023

justinchuby commented Aug 30, 2023

p-wysocki commented Oct 5, 2023

gramalingam commented Oct 5, 2023

zhenhuaw-me commented Dec 5, 2023

Add Split-20 - change uneven split behavior to be more torch-like #5321

Are you sure you want to change the base?

Add Split-20 - change uneven split behavior to be more torch-like #5321

Conversation

p-wysocki commented Jun 15, 2023

Description

Motivation and Context

TODO

xadupre Jun 15, 2023

Choose a reason for hiding this comment

p-wysocki Jun 15, 2023

Choose a reason for hiding this comment

gramalingam Jun 21, 2023

Choose a reason for hiding this comment

gramalingam Jun 15, 2023

Choose a reason for hiding this comment

gramalingam Jul 26, 2023

Choose a reason for hiding this comment

gramalingam commented Aug 21, 2023

justinchuby commented Aug 30, 2023

p-wysocki commented Oct 5, 2023

gramalingam commented Oct 5, 2023

zhenhuaw-me commented Dec 5, 2023