AutoModel.from_pretrained error in loading state_dict

by Srymaker - opened 3 days ago

Discussion

Srymaker

3 days ago

why is this? I have tried updating transformers to the latest version.

hehesang

3 days ago

same problem

Xiangtai

3 days ago

•

edited 3 days ago

I meet the same error. It seems that the text prediction head (weights and bias) shape in current transformers is [1152, 1152] while the weights the authors provided are [1536, 1152] to match the visual token output.

Xiangtai

3 days ago

Bugs are here in current transformer source code.

hehesang

3 days ago

this version (https://github.com/huggingface/transformers/releases/tag/v4.49.0-SigLIP-2) should fix the problem

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment