[Sparse] add Fused Attention kernel and API for SparseCsrTensor #43966

zhwesky2010 · 2022-06-30T06:55:17Z

PR types

New features

PR changes

OPs

Describe

paddle.incubate.sparse.nn.functional.attention(query, key, value, sparse_mask, key_padding_mask, attn_mask)

import paddle

batch_size = 16
num_heads = 16
seq_len = 512
head_dim = 32

query = paddle.rand([batch_size, num_heads, seq_len, head_dim])
key = paddle.rand([batch_size, num_heads, seq_len, head_dim])
value = paddle.rand([batch_size, num_heads, seq_len, head_dim])

query.stop_gradient = False
key.stop_gradient = False
value.stop_gradient = False

mask = paddle.nn.functional.dropout(paddle.ones([seq_len, seq_len])).expand([batch_size*num_heads, seq_len, seq_len])
sp_mask = mask.to_sparse_csr()

kp_mask = paddle.randint(0, 2, [batch_size, seq_len]).astype('float32')
attn_mask = paddle.randint(0, 2, [seq_len, seq_len]).astype('float32')

output = paddle.incubate.sparse.nn.functional.attention(query, key, value, sp_mask, kp_mask, attn_mask)
# kp_mask, attn_mask 是optional
output = paddle.incubate.sparse.nn.functional.attention(query, key, value, sp_mask)
output.backward()

由于该API调用的 cusparseDnMatSetStridedBatch、cusparseCsrSetStridedBatch等NV接口需要在CUDA11.7上才支持，因此CI无法直接运行，本地运行单测结果如下：

zkh2016 · 2022-07-04T13:22:16Z

paddle/phi/kernels/sparse/cpu/fused_attention_grad_kernel.cc

+                                 DenseTensor* dkey,
+                                 DenseTensor* dvalue) {
+  PD_THROW(
+      "Only support 'fused_attention' CPU backward kernel of SparseTensor now");


zkh2016 · 2022-07-04T13:24:53Z

python/paddle/fluid/tests/unittests/test_sparse_fused_attention_op.py

@@ -0,0 +1,146 @@
+#   Copyright (c) 2021 PaddlePaddle Authors. All Rights Reserved.


zhwesky2010 changed the title ~~add fused_attention kernel for SparseTensor~~ [Sparse]add fused_attention kernel for SparseTensor Jun 30, 2022

zhwesky2010 force-pushed the sparse_attention branch from 2719f19 to 52b61f6 Compare July 1, 2022 12:08

zhwesky2010 changed the title ~~[Sparse]add fused_attention kernel for SparseTensor~~ [Sparse] add fused_attention API and kernel of SparseCsrTensor Jul 1, 2022

zhwesky2010 changed the title ~~[Sparse] add fused_attention API and kernel of SparseCsrTensor~~ [Sparse] add SparseCsrTensor fused_attention kernel and API Jul 1, 2022

zhwesky2010 force-pushed the sparse_attention branch 3 times, most recently from 5f7862e to 8a0e330 Compare July 4, 2022 07:19

[Sparse] add SparseCsrTensor fused_attention kernel and API

7ef44da

zhwesky2010 force-pushed the sparse_attention branch from 8a0e330 to 7ef44da Compare July 4, 2022 09:16

zkh2016 reviewed Jul 4, 2022

View reviewed changes

zhwesky2010 force-pushed the sparse_attention branch from 928ddc3 to cfbaabb Compare July 5, 2022 04:29

fix comment

849deff

zhwesky2010 force-pushed the sparse_attention branch from cfbaabb to 849deff Compare July 5, 2022 06:34

zkh2016 approved these changes Jul 5, 2022

View reviewed changes

zhwesky2010 merged commit 59813de into PaddlePaddle:develop Jul 5, 2022

zhwesky2010 changed the title ~~[Sparse] add SparseCsrTensor fused_attention kernel and API~~ [Sparse] add Fused attention kernel and API for SparseCsrTensor Jul 5, 2022

zhwesky2010 changed the title ~~[Sparse] add Fused attention kernel and API for SparseCsrTensor~~ [Sparse] add Fused Attention kernel and API for SparseCsrTensor Jul 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Sparse] add Fused Attention kernel and API for SparseCsrTensor #43966

[Sparse] add Fused Attention kernel and API for SparseCsrTensor #43966

zhwesky2010 commented Jun 30, 2022 •

edited

zkh2016 Jul 4, 2022

zhwesky2010 Jul 5, 2022

zkh2016 Jul 4, 2022

zhwesky2010 Jul 4, 2022

		@@ -0,0 +1,146 @@
		# Copyright (c) 2021 PaddlePaddle Authors. All Rights Reserved.

[Sparse] add Fused Attention kernel and API for SparseCsrTensor #43966

[Sparse] add Fused Attention kernel and API for SparseCsrTensor #43966

Conversation

zhwesky2010 commented Jun 30, 2022 • edited

PR types

PR changes

Describe

zkh2016 Jul 4, 2022

Choose a reason for hiding this comment

zhwesky2010 Jul 5, 2022

Choose a reason for hiding this comment

zkh2016 Jul 4, 2022

Choose a reason for hiding this comment

zhwesky2010 Jul 4, 2022

Choose a reason for hiding this comment

zhwesky2010 commented Jun 30, 2022 •

edited