https://huggingface.co./nvidia/Llama-3_1-Nemotron-51B-Instruct

#306

by Pomni - opened Sep 24, 2024

Sep 24, 2024

i've seen a benchmark of this in the lm studio server and apparently it's comparable to a 70b model. would like to try it out (i started using the downstairs living room pc which has WAY better specs and an AVX2 cpu over my main AVX pc)

mradermacher

Owner Sep 24, 2024

well, let's see if it is supported by llama.cpp. i am a bit skeptical...

mradermacher

Owner Sep 24, 2024

yeah, unfortunately:

ERROR:hf-to-gguf:Model DeciLMForCausalLM is not supported

mradermacher changed discussion status to closed Sep 24, 2024

nicoboss

18 days ago

•

edited 17 days ago

@mradermacher This is now finally properly supported as llama.cpp merged https://github.com/ggerganov/llama.cpp/pull/11008

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment