add_generation_prompt=False in Tokenizer.apply_chat_template has no effect #30893

AndreiMuresanu · 2024-05-18T20:44:41Z

System Info

Hi, I am trying to apply a chat template to an input without the generation prompt. However, setting add_generation_prompt=False appears to have no effect.

Who can help?

@ArthurZucker @Rocketknight1

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

Minimal reproducible example:

from transformers import AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained("meta-llama/Meta-Llama-3-8B-Instruct")

chat = [
		{"role": "assistant", "content": "Hello, how are you?"},
	]
formatted_chat = tokenizer.apply_chat_template(chats, add_generation_prompt=False, tokenize=False)
print(formatted_chat)

transformers==4.41.0

Expected behavior

Observed Output:

<|begin_of_text|><|start_header_id|>assistant<|end_header_id|>

Hello, how are you?<|eot_id|><|start_header_id|>assistant<|end_header_id|>

Expected Output:

<|begin_of_text|><|start_header_id|>assistant<|end_header_id|>

Hello, how are you?<|eot_id|>

The text was updated successfully, but these errors were encountered:

Rocketknight1 · 2024-05-20T13:07:09Z

Hi @AndreiMuresanu, I just tried to reproduce this and add_generation_prompt=False worked correctly for me. Can you retry your code, and make sure you're updated to the latest version of transformers?

Taeyoung-Jang · 2024-05-23T07:19:08Z

I have also encountered this issue.

model : Llama-3-8B-Instruct
package: transformers==4.41.1, tokenizer==0.19.1

Rocketknight1 · 2024-05-23T13:14:14Z

Can you try installing transformers from main with pip install git+https://github.com/huggingface/transformers.git? I can't reproduce this issue at all, and I'm curious if it's caused by an old cached version of the model, or some recent change, or something.

iseesaw · 2024-05-27T17:29:37Z

same problem

iseesaw · 2024-05-27T17:51:08Z

same problem

Solved. Because I used the initial chat template, which was later updated.

See Fix chat template to add generation prompt only if the option is selected (#9)

Rocketknight1 · 2024-05-27T19:19:43Z

@iseesaw thanks for pointing that one out! It seems like this was an issue with the model's chat template, which has since been resolved, and the issue will be fixed if people redownload the tokenizer. Going to close this issue now, since it's not a bug in transformers itself.

amyeroberts added the Chat Template label May 20, 2024

Rocketknight1 closed this as completed May 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add_generation_prompt=False in Tokenizer.apply_chat_template has no effect #30893

add_generation_prompt=False in Tokenizer.apply_chat_template has no effect #30893

AndreiMuresanu commented May 18, 2024 •

edited

Rocketknight1 commented May 20, 2024

Taeyoung-Jang commented May 23, 2024

Rocketknight1 commented May 23, 2024

iseesaw commented May 27, 2024

iseesaw commented May 27, 2024

Rocketknight1 commented May 27, 2024 •

edited

add_generation_prompt=False in Tokenizer.apply_chat_template has no effect #30893

add_generation_prompt=False in Tokenizer.apply_chat_template has no effect #30893

Comments

AndreiMuresanu commented May 18, 2024 • edited

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

Rocketknight1 commented May 20, 2024

Taeyoung-Jang commented May 23, 2024

Rocketknight1 commented May 23, 2024

iseesaw commented May 27, 2024

iseesaw commented May 27, 2024

Rocketknight1 commented May 27, 2024 • edited

AndreiMuresanu commented May 18, 2024 •

edited

Rocketknight1 commented May 27, 2024 •

edited