How to increase max token output ?

#40

by Truc95 - opened Jul 2, 2024

Jul 2, 2024

Hello,

While using Florence-2 for OCR tasks, the maximum token output is easily reached - around 400 words.
Is it possible to increase the maximum output ? I have tried to change both max_new_tokens which has no effect.
I have tried "max_position_embeddings" but it seems that changing this parameter would require a retraining of the model.

Is there any way to increase the output tokens ?

lucasjin

Jul 3, 2024

It's not possible, the max token trained are limited to 1024

elizanyambu

Dec 18, 2024

Thank you for the explanation, @lucasjin . I noticed the behavior where I sometimes received sentences with a length of 1265 characters. It now makes sense that this is due to the tokenization process. I had set max_new_tokens to 4098 (as I process OCR for long documents), but the sentences always seemed to cut off at the same place. This limitation aligns with the model's tokenization and maximum token handling.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment