Add test for torchscripting nn.TransformerEncoder, including fast path #79796

erichan1 · 2022-06-17T17:35:56Z

Summary:
Add test just to check if TransformerEncoder will crash when enumerating over params [with_no_grad, use_torchscript, training].

Motivation for this was that TransformerEncoder fast path (so with_no_grad=True) and use_torchscript=True would crash with the issue that NestedTensor doesn't have size. This was caused because the TransformerEncoder fast path generates a NestedTensor automatically as a perf optimization and torchscript attempts to find intermediate tensor sizes while it optimizes. But NestedTensor has not implemented a size method, so things fail.

This test goes together with this fix #79480

Test Plan:

buck build --show-output mode/opt -c fbcode.enable_gpu_sections=true -c fbcode.nvcc_arch=a100 mode/inplace  //caffe2/test:transformers

./fbcode/buck-out/gen/caffe2/test/transformers#binary.par

Test runs and passes together with the changes from the PR above (I made another diff on top of this with those changes). Does not pass without the fix.

Reviewed By: mikekgfb

Differential Revision: D37222923

…ch#79576) Summary: Pull Request resolved: pytorch#79576 Includes both forced and incremental decoding. Add a new file test_transformers.py to put transformers tests in and move away from huge monolithic test_nn.py. A todo item is to move existing transformer tests from test_nn.py to test_transformers.py. Add a numerical test comparing torch.nn._transformer_decoder_layer_fwd and fairseq decoder. Both decoders use the weights of a common nn.TransformerEncoder. Test Plan: ``` buck build --show-output mode/opt -c fbcode.enable_gpu_sections=true -c fbcode.nvcc_arch=a100 mode/inplace //caffe2/test:transformers ./fbcode/buck-out/gen/caffe2/test/transformers#binary.par ``` Test runs and passes! Differential Revision: D37157391 fbshipit-source-id: 3b3f1c7fdac8269278982e0dcc2d32ff6b63547d

Summary: Add test just to check if TransformerEncoder will crash when enumerating over params [with_no_grad, use_torchscript, training]. Motivation for this was that TransformerEncoder fast path (so with_no_grad=True) and use_torchscript=True would crash with the issue that NestedTensor doesn't have size. This was caused because the TransformerEncoder fast path generates a NestedTensor automatically as a perf optimization and torchscript attempts to find intermediate tensor sizes while it optimizes. But NestedTensor has not implemented a size method, so things fail. This test goes together with this fix pytorch#79480 Test Plan: ``` buck build --show-output mode/opt -c fbcode.enable_gpu_sections=true -c fbcode.nvcc_arch=a100 mode/inplace //caffe2/test:transformers ./fbcode/buck-out/gen/caffe2/test/transformers#binary.par ``` Test runs and passes together with the changes from the PR above (I made another diff on top of this with those changes). Does not pass without the fix. Reviewed By: mikekgfb Differential Revision: D37222923 fbshipit-source-id: 670c58a8570b7bf459c6aeb1f11800de0dba6584

facebook-github-bot · 2022-06-17T17:36:04Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/79796
📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓Need help or want to give feedback on the CI? Visit our office hours

✅ No Failures (0 Pending)

As of commit 6d647a8 (more details on the Dr. CI page):

Expand to see more

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

facebook-github-bot · 2022-06-17T17:36:41Z

This pull request was exported from Phabricator. Differential Revision: D37222923

zrphercule

This LGTM, thanks!

erichan1 · 2022-06-17T21:59:30Z

@pytorchbot merge

pytorchmergebot · 2022-06-17T22:00:44Z

@pytorchbot successfully started a merge job. Check the current status here

github-actions · 2022-06-17T22:01:22Z

Hey @erichan1.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

erichan1 · 2022-06-20T16:07:21Z

This PR also included this one #79576 (also approved). I intended to merge these as stacked PRs, but ended up just merging this one by mistake which included code for both PRs.

#79796) (#79796) Summary: Add test just to check if TransformerEncoder will crash when enumerating over params [with_no_grad, use_torchscript, training]. Motivation for this was that TransformerEncoder fast path (so with_no_grad=True) and use_torchscript=True would crash with the issue that NestedTensor doesn't have size. This was caused because the TransformerEncoder fast path generates a NestedTensor automatically as a perf optimization and torchscript attempts to find intermediate tensor sizes while it optimizes. But NestedTensor has not implemented a size method, so things fail. This test goes together with this fix #79480 Pull Request resolved: #79796 Approved by: https://github.com/zrphercule Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/06274d7a487bf7995da77b9df9b5c1f7dc13f35b Test plan from GitHub: ``` buck build --show-output mode/opt -c fbcode.enable_gpu_sections=true -c fbcode.nvcc_arch=a100 mode/inplace //caffe2/test:transformers ./fbcode/buck-out/gen/caffe2/test/transformers#binary.par ``` Test runs and passes together with the changes from the PR above (I made another diff on top of this with those changes). Does not pass without the fix. Reviewed By: mikekgfb Differential Revision: D37222923 Pulled By: erichan1 fbshipit-source-id: 5a16e7d240cb51c0a613d16a79931d41122aba8b

* Add test for torchscripting nn.TransformerEncoder, including fast path (#79796) (#79796) Summary: Add test just to check if TransformerEncoder will crash when enumerating over params [with_no_grad, use_torchscript, training]. Motivation for this was that TransformerEncoder fast path (so with_no_grad=True) and use_torchscript=True would crash with the issue that NestedTensor doesn't have size. This was caused because the TransformerEncoder fast path generates a NestedTensor automatically as a perf optimization and torchscript attempts to find intermediate tensor sizes while it optimizes. But NestedTensor has not implemented a size method, so things fail. This test goes together with this fix #79480 Pull Request resolved: #79796 Approved by: https://github.com/zrphercule Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/06274d7a487bf7995da77b9df9b5c1f7dc13f35b Test plan from GitHub: ``` buck build --show-output mode/opt -c fbcode.enable_gpu_sections=true -c fbcode.nvcc_arch=a100 mode/inplace //caffe2/test:transformers ./fbcode/buck-out/gen/caffe2/test/transformers#binary.par ``` Test runs and passes together with the changes from the PR above (I made another diff on top of this with those changes). Does not pass without the fix. Reviewed By: mikekgfb Differential Revision: D37222923 Pulled By: erichan1 fbshipit-source-id: 5a16e7d240cb51c0a613d16a79931d41122aba8b * disable src mask for transformer and multiheadattention fastpath (#81277) (#81277) Summary: Disable fastpath if src_mask passed to TransformerEncoderLayer and MultiheadAttention. - Refactored test_transformerencoder from test_nn.py to test_transformers.py. Added a src_mask test there. - Added a specific src_mask test in test_transformers.py Fixes #81129 Pull Request resolved: #81277 Approved by: https://github.com/zrphercule Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/23088fcfdf77632d4e6db4d35ce62735ca6622d2 Reviewed By: DanilBaibak Differential Revision: D37919513 Pulled By: erichan1 fbshipit-source-id: 0697d789634775136897fdb6a310356a6a45030d * remove decoder tests for feature not in 1.12 * remove unnecessary changes from #77903 to make changes more minimal

erichan1 added 2 commits June 17, 2022 10:31

erichan1 requested a review from a team as a code owner June 17, 2022 17:35

facebook-github-bot added the cla signed label Jun 17, 2022

facebook-github-bot added the fb-exported label Jun 17, 2022

erichan1 requested a review from zrphercule June 17, 2022 17:48

erichan1 mentioned this pull request Jun 17, 2022

[JIT] Nested fix #79480

Closed

zrphercule approved these changes Jun 17, 2022

View reviewed changes

pytorchmergebot added the Merged label Jun 17, 2022

pytorchmergebot closed this in 06274d7 Jun 17, 2022

erichan1 added module: tests Issues related to tests (not the torch.testing module) release notes: jit release notes category release notes: nn release notes category topic: bug fixes topic category topic: not user facing topic category labels Jun 17, 2022

erichan1 mentioned this pull request Jun 20, 2022

Add numerical test comparing BetterDecoder and fairseq decoder #79576

Closed

erichan1 mentioned this pull request Jul 13, 2022

Transformer and CPU path with src_mask raises error with torch 1.12 #81129

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add test for torchscripting nn.TransformerEncoder, including fast path #79796

Add test for torchscripting nn.TransformerEncoder, including fast path #79796

erichan1 commented Jun 17, 2022

facebook-github-bot commented Jun 17, 2022 •

edited

facebook-github-bot commented Jun 17, 2022

zrphercule left a comment

erichan1 commented Jun 17, 2022

pytorchmergebot commented Jun 17, 2022

github-actions bot commented Jun 17, 2022

erichan1 commented Jun 20, 2022 •

edited

Add test for torchscripting nn.TransformerEncoder, including fast path #79796

Add test for torchscripting nn.TransformerEncoder, including fast path #79796

Conversation

erichan1 commented Jun 17, 2022

facebook-github-bot commented Jun 17, 2022 • edited

🔗 Helpful links

✅ No Failures (0 Pending)

facebook-github-bot commented Jun 17, 2022

zrphercule left a comment

Choose a reason for hiding this comment

erichan1 commented Jun 17, 2022

pytorchmergebot commented Jun 17, 2022

github-actions bot commented Jun 17, 2022

erichan1 commented Jun 20, 2022 • edited

facebook-github-bot commented Jun 17, 2022 •

edited

erichan1 commented Jun 20, 2022 •

edited