Device error on TokenClassificationPipeline #13816

ierezell · 2021-09-30T18:31:36Z

Environment info

transformers version: 4.11.0
Platform: Linux-5.14.8-arch1-1-x86_64-with-arch
Python version: 3.7.11
PyTorch version (GPU?): 1.9.1+cu102 (True)
Tensorflow version (GPU?): not installed (NA)
Flax version (CPU?/GPU?/TPU?): not installed (NA)
Jax version: not installed
JaxLib version: not installed
Using GPU in script?: True
Using distributed or parallel set-up in script?: False

Who can help

Library:

pipelines: @LysandreJik

Information

Model I am using (Bert, XLNet ...):

The problem arises when using:

the official example scripts: (give details below)

The tasks I am working on is:

an official GLUE/SQUaD task: (give the name)

To reproduce

Steps to reproduce the behavior:

Create a pipe = TokenClassificationPipeline(model=DistilBertForTokenClassification.from_pretrained("PATH"))
Pipe some text in pipe(["My", "text", "tokens"])
Get a TypeError: can't convert cuda:0 device type tensor to numpy. Use Tensor.cpu() to copy the tensor to host memory first.

Expected behavior

Be able to run the pipeline

The pipeline should bring data to gpu/cpu or model to gpu/cpu and vice versa.

The traceback

In .venv/lib/python3.7/site-packages/transformers/pipelines/token_classification.py:209 in _forward                                                                                
    206 │   │   if self.framework == "tf":                                                         
    207 │   │   │   outputs = self.model(model_inputs.data)[0][0].numpy()                          
    208 │   │   else:                                                                              
 ❱ 209 │   │   │   outputs = self.model(**model_inputs)[0][0].numpy()   <== HERE
    210 │   │   return {                                                                           
    211 │   │   │   "outputs": outputs,                                                            
    212 │   │   │   "special_tokens_mask": special_tokens_mask,

Placing a .cpu() would solve the problem

Thanks in advance for any help
Have a wonderful day

The text was updated successfully, but these errors were encountered:

LysandreJik · 2021-09-30T20:52:00Z

Nice catch! Would you like to open a PR with the fix?

ierezell · 2021-09-30T21:17:36Z

Yes, I can do it for only 6 characters

ierezell · 2021-09-30T21:40:49Z

Done, See pull request above: #13819

I let the CI/CD tests run as there is no new features and I didn't want to run them locally burning my pc down :)
I made it fast but tell me if anything is not okay.

Have a great day

mallorbc · 2021-09-30T23:23:04Z

similar issue later in the file, line 223

 220 │   │   sentence = model_outputs["sentence"]                           │
│   221 │   │   input_ids = model_outputs["input_ids"][0]                      │
│   222 │   │   offset_mapping = model_outputs["offset_mapping"][0] if model_o │
│ ❱ 223 │   │   special_tokens_mask = model_outputs["special_tokens_mask"][0].numpy() │
│   224 │   │                                                                  │
│   225 │   │   scores = np.exp(outputs) / np.exp(outputs).sum(-1, keepdims=Tr │
│   226 │   │   pre_entities = self.gather_pre_entities(

ierezell · 2021-10-01T13:18:43Z

Thanks, I committed new changes.

@LysandreJik Do you want me to also add a test (all currents tests are passing) ?

in tests/test_pipelines_token_classification.py like :

@require_torch_gpu
@slow
def test_correct_devices(self):
    sentence = "This dummy sentence checks if all the variables can be loaded on gpu and bring back to cpu"
    ner = TokenClassificationPipeline(model="distilbert-base-cased", device=0)

LysandreJik · 2021-10-05T12:24:33Z

I believe this was fixed by #13856, which also implemented tests.

ierezell mentioned this issue Sep 30, 2021

[Fix]: Send model output to cpu before numpy cast in token_clf_pipeline #13819

Closed

5 tasks

LysandreJik closed this as completed Oct 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Device error on TokenClassificationPipeline #13816

Device error on TokenClassificationPipeline #13816

ierezell commented Sep 30, 2021

LysandreJik commented Sep 30, 2021

ierezell commented Sep 30, 2021 •

edited

ierezell commented Sep 30, 2021

mallorbc commented Sep 30, 2021 •

edited

ierezell commented Oct 1, 2021 •

edited

LysandreJik commented Oct 5, 2021

Device error on TokenClassificationPipeline #13816

Device error on TokenClassificationPipeline #13816

Comments

ierezell commented Sep 30, 2021

Environment info

Who can help

Information

To reproduce

Expected behavior

LysandreJik commented Sep 30, 2021

ierezell commented Sep 30, 2021 • edited

ierezell commented Sep 30, 2021

mallorbc commented Sep 30, 2021 • edited

ierezell commented Oct 1, 2021 • edited

LysandreJik commented Oct 5, 2021

ierezell commented Sep 30, 2021 •

edited

mallorbc commented Sep 30, 2021 •

edited

ierezell commented Oct 1, 2021 •

edited