Tensor Dialect canonicalizer `FoldTensorCastProducerOp` can produce invalid IR #91265

christopherbate · 2024-05-06T20:26:04Z

Reproducer:

// RUN: mlir-opt -canonicalize
func.func @materialize_in_destination_tensor_cast(%arg0: tensor<4xf32>, %arg1: index) -> tensor<?xf32> {
  %0 = bufferization.alloc_tensor(%arg1) : tensor<?xf32>
  %1 = tensor.cast %arg0 : tensor<4xf32> to tensor<?xf32>
  %2 = bufferization.materialize_in_destination %1 in %0 : (tensor<?xf32>, tensor<?xf32>) -> tensor<?xf32>
  return %2 : tensor<?xf32>
}

FoldTensorCastProducerOp defined in lib/Dialect/Tensor/IR/TensorOps.cpp assumes that any operand of an op that implements DestinationStyleOpInterface can absorb a tensor.cast operation that is erasing information from the type. However, that assumption is not encoded anywhere in the spec of DestinationStyleOpInterface. For example, bufferization.materialize_in_destination` requires that the shapes of its DPS input and DPS output operands match, and therefore the canonicalizer will produce IR that fails verification :

within split at test/Dialect/Bufferization/canonicalize.mlir:392 offset :6:8: error: 'bufferization.materialize_in_destination' op failed to verify that all of {source, dest} have same shape
  %2 = bufferization.materialize_in_destination %1 in %0 : (tensor<?xf32>, tensor<?xf32>) -> tensor<?xf32>
       ^
within split at test/Dialect/Bufferization/canonicalize.mlir:392 offset :6:8: note: see current operation: %1 = "bufferization.materialize_in_destination"(%arg0, %0) : (tensor<4xf32>, tensor<?xf32>) -> tensor<?xf32>

The text was updated successfully, but these errors were encountered:

…ith `bufferization.materialize_in_destination` Attempts to address a bug pointed out in llvm/llvm-project#91265 by relaxing the requirement for source/dest shapes to match in the `bufferization.materialize_in_destination` operation. The relaxation allows differences in static vs dynamic dims but still rejects cases where the shapes are statically known to be different.

Attempts to address a bug pointed out in llvm#91265 by moving the FoldTensorCastProducerOp canonicalizer definition upward into the MLIRDialectUtils library. Since the MLIRDialectUtils can't depend on any dialect, the canonicalizer had to change slightly, and a templated version is introduced. Then, we need to add this canonicalization routine where it was used before, except for places where it is incorrect as pointed out in the bug. Based on cursory inspection of the TableGen definitions, only `bufferization.materialize_in_destination` should *not* have the canonicalizer, but existing tests passed if the canonicalizer as only added for `tensor.pack|unpack|extract_slice` and the LinalgOp interface. I went ahead and added tests where they were missing.

Attempts to address a bug pointed out in llvm#91265 by moving the FoldTensorCastProducerOp canonicalizer definition upward into the MLIRDialectUtils library. Since the MLIRDialectUtils can't depend on any dialect, the canonicalizer had to change slightly, and a templated version is introduced. Then, we need to add this canonicalization routine where it was used before, except for places where it is incorrect as pointed out in the bug. Based on cursory inspection of the TableGen definitions, only `bufferization.materialize_in_destination` should *not* have the canonicalizer, but existing tests passed if the canonicalizer as only added for `tensor.pack|unpack|extract_slice` and the LinalgOp interface.

…ze_in_destination` op This commit relaxes the verifier of `bufferization.materialize_in_destination` such that mixed static/dynamic dimensions are allowed for the source and destination operands. E.g., `tensor<5xf32>` and `tensor<?xf32>` are now compatible, but it is assumed that the dynamic dimension is `5` at runtime. This commit fixes #91265.

matthias-springer · 2024-05-19T09:12:22Z

#92681 fixes the issue for materialize_in_destination. A more general fix would make the canonicalization pattern opt-in for each op that supports it. There are already a few ops that are blacklisted in the canonicalization pattern (e.g., scf.forall) and it is not always possible/practical to add additional ops to that list.

(Generalizing materialize_in_destination as in #92681 still makes sense because memref.copy also supports mixed static/dynamic dimensions.)

joker-eph · 2024-05-19T18:28:58Z

This looks more like a workaround than a fix to me: a canonicalized shouldn't change the type of an SSA value without more information

matthias-springer · 2024-05-19T18:37:34Z

I agree, it's likely still broken for other ops. I tried to make this pattern opt-in for each op that supports it. I.e., only ops that support the pattern should add it in their getCanonicalizationPatterns function; and rooting the pattern only the specified op name.

This requires a few more cleanups first, so I didn't get to it yet. E.g., tensor.pack, currently uses hasCanonicalizeMethod but we want hasCanonicalizer so that I can add the pattern. (I think the two cannot be used at the same time?)

christopherbate · 2024-05-20T02:40:13Z

I agree, it's likely still broken for other ops. I tried to make this pattern opt-in for each op that supports it. I.e., only ops that support the pattern should add it in their getCanonicalizationPatterns function; and rooting the pattern only the specified op name.

@matthias-springer @joker-eph I opened PR #91382 two weeks ago to do just this. That PR makes the pattern opt-in.

matthias-springer · 2024-05-20T08:39:44Z

Do you mean #91382?

I think there's a simpler solution (without that many templates): Add an additional StringRef opName constructor to FoldTensorCastProducerOp (can be passed to the Pattern constructor) such that the pattern applies only to the specified op name. (Same for op interface; in that case you need a TypeID.)

Then have a public function that populates the pattern for a given op name or interface type ID. The "populate function" can then be called from the getCanonicalizationPatterns functions of the ops that support this kind of transformation.

matthias-springer · 2024-05-20T08:44:01Z

If you have time, I'd recommend two PRs: One that makes the canonicalizer of tensor.pack/unpack pattern-based (let hasCanonicalizer = 1). And another one that adds the opName/interfaceID constructor to FoldTensorCastProducerOp and adds the pattern to all necessary getCanonicalizationPatterns functions.

christopherbate · 2024-05-22T15:21:09Z

Do you mean #91382?

Oops, yeah thanks.

If you have time, I'd recommend two PRs: One that makes the canonicalizer of tensor.pack/unpack pattern-based (let hasCanonicalizer = 1). And another one that adds the opName/interfaceID constructor to FoldTensorCastProducerOp and adds the pattern to all necessary getCanonicalizationPatterns functions.

Ok, should be able to work on it on Friday

…ze_in_destination` op (#92681) This commit relaxes the verifier of `bufferization.materialize_in_destination` such that mixed static/dynamic dimensions are allowed for the source and destination operands. E.g., `tensor<5xf32>` and `tensor<?xf32>` are now compatible, but it is assumed that the dynamic dimension is `5` at runtime. This commit fixes #91265.

…ze_in_destination` op (llvm#92681) This commit relaxes the verifier of `bufferization.materialize_in_destination` such that mixed static/dynamic dimensions are allowed for the source and destination operands. E.g., `tensor<5xf32>` and `tensor<?xf32>` are now compatible, but it is assumed that the dynamic dimension is `5` at runtime. This commit fixes llvm#91265.

github-actions bot added the new issue label May 6, 2024

EugeneZelenko added mlir:tensor and removed new issue labels May 6, 2024

christopherbate mentioned this issue May 6, 2024

[MLIR][Bufferization] Address tensor cast canonicalizer interaction with bufferization.materialize_in_destination #91274

Closed

joker-eph assigned matthias-springer May 6, 2024

christopherbate mentioned this issue May 7, 2024

[MLIR][Tensor] Fix DPS op canonicalizer with tensor.cast` #91382

Open

matthias-springer mentioned this issue May 19, 2024

[mlir][bufferization] Allow mixed static/dynamic shapes in materialize_in_destination op #92681

Merged

matthias-springer closed this as completed in #92681 Jun 1, 2024

EugeneZelenko added mlir:bufferization Bufferization infrastructure and removed mlir:tensor labels Jun 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tensor Dialect canonicalizer `FoldTensorCastProducerOp` can produce invalid IR #91265

Tensor Dialect canonicalizer `FoldTensorCastProducerOp` can produce invalid IR #91265

christopherbate commented May 6, 2024

matthias-springer commented May 19, 2024 •

edited

joker-eph commented May 19, 2024

matthias-springer commented May 19, 2024

christopherbate commented May 20, 2024 •

edited

matthias-springer commented May 20, 2024

matthias-springer commented May 20, 2024

christopherbate commented May 22, 2024

Tensor Dialect canonicalizer FoldTensorCastProducerOp can produce invalid IR #91265

Tensor Dialect canonicalizer FoldTensorCastProducerOp can produce invalid IR #91265

Comments

christopherbate commented May 6, 2024

matthias-springer commented May 19, 2024 • edited

joker-eph commented May 19, 2024

matthias-springer commented May 19, 2024

christopherbate commented May 20, 2024 • edited

matthias-springer commented May 20, 2024

matthias-springer commented May 20, 2024

christopherbate commented May 22, 2024

Tensor Dialect canonicalizer `FoldTensorCastProducerOp` can produce invalid IR #91265

Tensor Dialect canonicalizer `FoldTensorCastProducerOp` can produce invalid IR #91265

matthias-springer commented May 19, 2024 •

edited

christopherbate commented May 20, 2024 •

edited