Working with llama cpp ?

by ivanpzk - opened Apr 24

Apr 24

•

Thank you ! is it working with llama cpp ? Don't have time to try now

0-hero

Quant Factory org Apr 24

Yes, they’re made with llama.cpp

Apr 24

Q4 k_m not loading in koboldcpp and text generation web ui. No idea why. Everything up-to-date.

Apr 24

As far as I can tell the 128K version if not supported by Llama.cpp yet. See: https://github.com/ggerganov/llama.cpp/issues/6849#issuecomment-2074899603

Perhaps the quantization tool already supports making the GGUF files, but you cannot inference this model yet. Please correct me if I'm wrong.

Quant Factory org May 24

Quants have been updated with the latest release for llama.cpp

munish0838 changed discussion status to closed May 24

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment