GGUF?

by aaha - opened Feb 28

aaha

Feb 28

•

Hi @macadeliccc ! Thanks for the model.
Could you also share gguf quants for the model?

Thanks again!

Owner Feb 28

Yeah I can upload them for you tomorrow!

aaha

Feb 28

Thanks! What would be the prompt format to get the best output? Especially for chat use case?

Owner Feb 28

The prompt template should be chatML. If youre using the GGUF version with ooba or llama.cpp it should auto populate from the config.

If chatml doesnt work try alpaca. sometimes rewriting the template doesnt work right if you merge lora weights instead of a full finetune.

aaha

Feb 28

Thanks! I'll try it in LM Studio with chatML. Is the context window 32k?

Owner Feb 28

Yes it is 32k

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment