Broken quants?

#1
by FlareRebellion - opened

Same Problem.
Just a bunch of "β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…β–…" must really be broken, darn.

Same.

Mixtral 8x7b and the like apparently have problems with K quants sometimes (or always, I didn't test). Did you try if it works with Q5_0?

deleted

Mixtral 8x7b and the like apparently have problems with K quants sometimes (or always, I didn't test). Did you try if it works with Q5_0?

Odd, im using mix8 instruct @ Q6_K ( gguf ) and its doing great for me.

That's intestesting. I use to download Q6K too. Perhaps it depends on the program you use.

deleted
β€’
edited Jan 27

im mostly using ooba's text gen for the gui, and llama.ccp for the engine for GGUFs. For 'raw' models, mostly transformers engine, but i dont have a big enough GPU to do that for large models so gguf for me :)

I confirm 5_K_M in this repo is broken, while 6_K is working. NeverSleep's version for both quants is working ok.

I also can confirm that file for "5_K_M" in this specific repo is corrupted. Do not download. Wish I seen the discussion first. I have verified the checksums on my end, so the file uploaded itself is already corrupted.

Sign up or log in to comment