fix prompt format for Llama-3.2-11B-Vision

#43

by chenhegu - opened Oct 17, 2024

base: refs/heads/main

←

from: refs/pr/43

Discussion Files changed

-2

chenhegu

Oct 17, 2024

The correct prompt format is

<|begin_of_text|><|image|>{user message}

based on the model card
The current example will generate wired output in some cases.

fix prompt format for Llama-3.2-11B-Vision46cd8afe

chenhegu

Oct 17, 2024

Also, this PR adds the BOS token by the tokenizer, this will make the tokenized input become:

<|begin_of_text|><|image|><|begin_of_text|>{user message}

if processor is used by the user and add_special_tokens=False isn't specified in processor.__call__.
This shouldn't be the default setting so I suggest to revert that.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment