Runtime Error raised by `torch.nn.modules.activation.MultiheadAttention` when `bias=False, batch_first=True` #88669

shakedbr · 2022-11-08T11:15:22Z

🐛 Describe the bug

Hi,

When creating an object of torch.nn.modules.activation.MultiheadAttention with bias=False and batch_first=True, activating evaluation mode, and calling the forward pass you get an exception:

import torch

x = torch.rand((1, 5, 10))
model = torch.nn.modules.activation.MultiheadAttention(10, 1, bias=False, batch_first=True)
model.eval()
model(x, x, x)

Traceback (most recent call last):
  File "/Users/test.py", line 376, in <module>
    model(x,x,x)
  File "/opt/homebrew/Caskroom/miniforge/base/envs/py39/lib/python3.9/site-packages/torch/nn/modules/module.py", line 1130, in _call_impl
    return forward_call(*input, **kwargs)
  File "/opt/homebrew/Caskroom/miniforge/base/envs/py39/lib/python3.9/site-packages/torch/nn/modules/activation.py", line 1107, in forward
    elif not all([(x.is_cuda or 'cpu' in str(x.device)) for x in tensor_args]):
  File "/opt/homebrew/Caskroom/miniforge/base/envs/py39/lib/python3.9/site-packages/torch/nn/modules/activation.py", line 1107, in <listcomp>
    elif not all([(x.is_cuda or 'cpu' in str(x.device)) for x in tensor_args]):
AttributeError: 'NoneType' object has no attribute 'is_cuda'

It seems that the following lines don't handle the case where a parameter is None.

pytorch/torch/nn/modules/activation.py

Lines 1117 to 1119 in 8cb5c55

    
           elif not all([(x.is_cuda or 'cpu' in str(x.device)) for x in tensor_args]): 
        
               why_not_fast_path = "some Tensor argument is neither CUDA nor CPU" 
        
           elif torch.is_grad_enabled() and any([x.requires_grad for x in tensor_args]):

Versions

[pip3] numpy==1.23.4
[pip3] torch==1.12.1
[pip3] torch-scatter==2.0.9
[pip3] torchaudio==0.12.1
[pip3] torchvision==0.2.2
[conda] numpy 1.23.4 py39hefdcf20_0 conda-forge
[conda] pytorch 1.12.1 py3.9_0 pytorch
[conda] torch-scatter 2.0.9 pypi_0 pypi
[conda] torchaudio 0.12.1 py39_cpu pytorch
[conda] torchvision 0.2.2 py_3 pytorch

cc @jbschlosser @bhosmer @cpuhrsch @erichan1

The text was updated successfully, but these errors were encountered:

cpuhrsch · 2022-11-10T18:03:52Z

Thank you for opening the issue @shakedbr - does this issue persist with newer versions of PyTorch or nightlies?

shakedbr · 2022-11-14T10:20:16Z

@cpuhrsch, this also happens in version 1.13.0 and in a nightly version 1.14.0.dev20221113, but in these versions, to reproduce the bug you need to include an even number of heads e.g.:

import torch

x = torch.rand((1, 5, 10))
model = torch.nn.modules.activation.MultiheadAttention(10, num_heads=2, bias=False, batch_first=True)
model.eval()
model(x, x, x)

cpuhrsch · 2022-11-14T16:40:40Z

Thanks for checking @shakedbr - @mikekgfb has sent a fix and it looks like it'll be included in 1.13.1.

malfet · 2022-11-28T17:25:32Z

@mikekgfb can you please link the fix to the issue?

weiwangmeta · 2022-11-28T21:46:17Z

https://github.com/pytorch/pytorch/pull/88970/files

weiwangmeta · 2022-11-28T21:50:45Z

Can this be closed given the above PR? cc @mikekgfb @atalman @malfet

mikekgfb · 2022-12-02T15:26:14Z

Also needs #88854

weiwangmeta · 2022-12-02T16:57:44Z

Also needs #88854

Thank you Michael. #89855 (comment) is the cherry-pick to release/1.13

atalman · 2022-12-13T18:10:55Z

closing since cherry-pick is included in the release

shakedbr changed the title ~~Runtime Error raised by torch.nn.modules.activation.MultiheadAttention when bias=True, batch_first=True~~ Runtime Error raised by torch.nn.modules.activation.MultiheadAttention when bias=False, batch_first=True Nov 8, 2022

albanD added triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module oncall: transformer/mha Issues related to Transformers and MultiheadAttention labels Nov 10, 2022

mikekgfb added this to the 1.13.1 milestone Nov 11, 2022

weiwangmeta mentioned this issue Nov 28, 2022

Fix cuda/cpu check on NoneType (Unit test) #88970

Closed

cpuhrsch mentioned this issue Dec 2, 2022

Fix cuda/cpu check on NoneType #88854

Closed

weiwangmeta mentioned this issue Dec 2, 2022

[v.1.13.1] Release Tracker #89855

Closed

atalman closed this as completed Dec 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Runtime Error raised by `torch.nn.modules.activation.MultiheadAttention` when `bias=False, batch_first=True` #88669

Runtime Error raised by `torch.nn.modules.activation.MultiheadAttention` when `bias=False, batch_first=True` #88669

shakedbr commented Nov 8, 2022 •

edited by pytorch-bot bot

cpuhrsch commented Nov 10, 2022

shakedbr commented Nov 14, 2022

cpuhrsch commented Nov 14, 2022

malfet commented Nov 28, 2022

weiwangmeta commented Nov 28, 2022

weiwangmeta commented Nov 28, 2022

mikekgfb commented Dec 2, 2022

weiwangmeta commented Dec 2, 2022

atalman commented Dec 13, 2022

Runtime Error raised by torch.nn.modules.activation.MultiheadAttention when bias=False, batch_first=True #88669

Runtime Error raised by torch.nn.modules.activation.MultiheadAttention when bias=False, batch_first=True #88669

Comments

shakedbr commented Nov 8, 2022 • edited by pytorch-bot bot

🐛 Describe the bug

Versions

cpuhrsch commented Nov 10, 2022

shakedbr commented Nov 14, 2022

cpuhrsch commented Nov 14, 2022

malfet commented Nov 28, 2022

weiwangmeta commented Nov 28, 2022

weiwangmeta commented Nov 28, 2022

mikekgfb commented Dec 2, 2022

weiwangmeta commented Dec 2, 2022

atalman commented Dec 13, 2022

Runtime Error raised by `torch.nn.modules.activation.MultiheadAttention` when `bias=False, batch_first=True` #88669

Runtime Error raised by `torch.nn.modules.activation.MultiheadAttention` when `bias=False, batch_first=True` #88669

shakedbr commented Nov 8, 2022 •

edited by pytorch-bot bot