mradermacher
/

model_requests

Model card Files Files and versions Community

https://huggingface.co./mistralai/Mistral-Nemo-Instruct-2407

#148

by WOOSAH - opened Jul 18, 2024

WOOSAH

Jul 18, 2024

Please add gguf for this.
would also love to see imatrix quants too <3 <3 <3

Owner Jul 18, 2024

How I love gated models. Anyway, it's queued and should be done in a few hours or so. Cheers!

mradermacher changed discussion status to closed Jul 18, 2024

WOOSAH

Jul 18, 2024

Thank you thank you thank you, much love!

Owner Jul 18, 2024

Unfortunately, it turns out to not be supported by llama.cpp at this time (due to lack of pretokenizer support). Sorry :/

Aug 7, 2024

This should now be supported in latest llama.cpp (b3542). I ran convert_hf_to_gguf.py and performed Q5_K_M quantization myself and everything worked perfectly fine.

Owner Aug 7, 2024

Let's give it another try then :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment