unsloth
/

DeepSeek-R1-Distill-Llama-8B-GGUF

Inference Endpoints

Model card Files Files and versions Community

danielhanchen commited on 14 days ago

Commit

0e12f93

·

verified ·

1 Parent(s): 7916422

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -17,7 +17,8 @@ tags:
 ### Instructions to run this model in llama.cpp:
 Or you can view more detailed instructions here: [unsloth.ai/blog/deepseek-r1](https://unsloth.ai/blog/deepseek-r1)
 1. Do not forget about `<｜User｜>` and `<｜Assistant｜>` tokens! - Or use a chat template formatter
-2. Example with Q8_0 K quantized cache **Notice -no-cnv disables auto conversation mode**
    ```bash
    ./llama.cpp/llama-cli \
        --model unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF/DeepSeek-R1-Distill-Llama-8B-Q4_K_M.gguf \

 ### Instructions to run this model in llama.cpp:
 Or you can view more detailed instructions here: [unsloth.ai/blog/deepseek-r1](https://unsloth.ai/blog/deepseek-r1)
 1. Do not forget about `<｜User｜>` and `<｜Assistant｜>` tokens! - Or use a chat template formatter
+2. Obtain the latest `llama.cpp` at https://github.com/ggerganov/llama.cpp
+3. Example with Q8_0 K quantized cache **Notice -no-cnv disables auto conversation mode**
    ```bash
    ./llama.cpp/llama-cli \
        --model unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF/DeepSeek-R1-Distill-Llama-8B-Q4_K_M.gguf \