Model is downloaded twice with transformers

by dulacp - opened Jan 31

Jan 31

My understanding is that you've included the model weights in two ways, once with the file consolidated.safetensors and a second time with weight chunks model-....safetensors.

It doubles the hgcache size in all our pods (94.3GB instead of 47.2GB).

Is it the expected behavior?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment