Display_Prompt = no

#24

by thegamecat - opened Oct 15, 2024

Oct 15, 2024

My understanding is this parameter should suppress the output of the Prompt. It appears to do nothing, or I'm putting it in the wrong place. Anyone know how to use it?

Sanyam

Meta Llama org Oct 16, 2024

Can you share more details please?

thegamecat

Oct 16, 2024

Basically the prompt is always appearing in the output from the LLM.

Happens with both 1b and 3b instruct.

I am not using pipeline.

It happens whether I use a chat template / system, user approach or not.

I can share some code tomorrow if that helps. I'm basically looking for a parameter to just get the response displayed.

codelion

Oct 29, 2024

@thegamecat just split the output upto input length and take the rest as response:

response = tokenizer.decode(output[0][len(inputs[0]):])

thegamecat

Nov 14, 2024

The number of tokens is already exceeded by that point.

codelion

Nov 14, 2024

What do you mean exceeded? Just increase the max_new_tokens to a larger value then.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment