AttributeError: 'NoneType' object has no attribute 'from_pretrained' #8864

louisabraham · 2020-12-01T01:59:44Z

This code was working yesterday but doesn't work today:

from transformers import AutoTokenizer
AutoTokenizer("Helsinki-NLP/opus-mt-en-fr")

jacampo · 2020-12-01T11:25:12Z

Same here a couple of hours ago

LysandreJik · 2020-12-01T17:42:21Z

Hi, could you please provide the information related to your environment?
When you say it was working yesterday but was working before, do you mean to say you've upgraded to version v4.0.0 released yesterday? If this is so, you may be obtaining the following error message: AttributeError: 'NoneType' object has no attribute 'from_pretrained'. This would be because you do not have sentencepiece installed.
Are you sure this worked previously? This should never have worked, as AutoTokenizer cannot be initialized like this, but has to be instantiated from the from_pretrained method:

from transformers import AutoTokenizer
AutoTokenizer.from_pretrained("Helsinki-NLP/opus-mt-en-fr")

which works on v4.0.0 and on master, as long as you have SentencePiece installed.

LysandreJik · 2020-12-01T17:57:30Z

Putting a better error message in #8881.

louisabraham · 2020-12-02T15:26:37Z

Right, I was using

AutoTokenizer.from_pretrained("Helsinki-NLP/opus-mt-en-fr")

Thanks, pip install sentencepiece fixed the issue!

It looks that previously the tokenizer outputted torch tensors and now lists. Is this intended? It breaks existing code.

LysandreJik · 2020-12-02T15:35:07Z

Yes, this was a bug. Tokenizers are framework-agnostic and should not output a specific framework's tensor. The implementation of the Marian tokenizer was not respecting the API in that regard.

Tokenizers can still handle torch tensors, you need to specify that you want them though:

tokenizer(xxx, return_tensors="pt")

I guess in your situation it has to do with the prepare_seq2seq_batch:

tokenizer.prepare_seq2seq_batch(xxx, return_tensors="pt")

louisabraham · 2020-12-02T16:41:23Z

Thanks!

LysandreJik mentioned this issue Dec 1, 2020

Better warning when loading a tokenizer with AutoTokenizer w/o Sneten… #8881

Merged

LysandreJik closed this as completed in #8881 Dec 1, 2020

patrickvonplaten mentioned this issue May 31, 2021

PegasusTokenizer returning None #11789

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AttributeError: 'NoneType' object has no attribute 'from_pretrained' #8864

AttributeError: 'NoneType' object has no attribute 'from_pretrained' #8864

louisabraham commented Dec 1, 2020

jacampo commented Dec 1, 2020

LysandreJik commented Dec 1, 2020 •

edited

LysandreJik commented Dec 1, 2020

louisabraham commented Dec 2, 2020 •

edited

LysandreJik commented Dec 2, 2020

louisabraham commented Dec 2, 2020

AttributeError: 'NoneType' object has no attribute 'from_pretrained' #8864

AttributeError: 'NoneType' object has no attribute 'from_pretrained' #8864

Comments

louisabraham commented Dec 1, 2020

jacampo commented Dec 1, 2020

LysandreJik commented Dec 1, 2020 • edited

LysandreJik commented Dec 1, 2020

louisabraham commented Dec 2, 2020 • edited

LysandreJik commented Dec 2, 2020

louisabraham commented Dec 2, 2020

LysandreJik commented Dec 1, 2020 •

edited

louisabraham commented Dec 2, 2020 •

edited