Proper handling of left-padded inputs #877

njhill · 2023-03-10T21:16:00Z

At least for some models including Codegen, I'm observing very inconsistent outputs using ORTModelForCausalLM when the same inputs have different amounts of left padding (but correct corresponding attention mask). In other words an equivalent problem to the one with vanilla transformers reported in huggingface/transformers#21080 and with fixes in huggingface/transformers#21853 and huggingface/transformers#22069.

This comment alludes to something w.r.t. handling of position_ids which I was wondering might be related.

The text was updated successfully, but these errors were encountered:

fxmarty · 2023-03-11T17:29:14Z

Thank you for the report, will have a look shortly!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proper handling of left-padded inputs #877

Proper handling of left-padded inputs #877

njhill commented Mar 10, 2023

fxmarty commented Mar 11, 2023

Proper handling of left-padded inputs #877

Proper handling of left-padded inputs #877

Comments

njhill commented Mar 10, 2023

fxmarty commented Mar 11, 2023