morgendigital
/

Llama-2-13b-chat-german-GGUF

Text Generation

Model card Files Files and versions Community

freefallr commited on Sep 5, 2023

Commit

bca02b0

•

1 Parent(s): f7a2d95

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -31,7 +31,12 @@ This repository contains the model [jphme/Llama-2-13b-chat-german](https://huggi
 ## Conversion Code
 ```
 ```
 ___

 ## Conversion Code
 ```
+# Convert original model to FP16 GGUF format
+python3 llama.cpp/convert.py ./original-models/Llama-2-13b-chat-german --outtype f16 --outfile ./converted_gguf/Llama-2-13b-chat-german-GGUF.fp16.bin
+# Quantize FP16 GGUF to 8, 5_K_M and 4_K_M bit
+./llama.cpp/quantize Llama-2-13b-chat-german-GGUF.fp16.bin Llama-2-13b-chat-german-GGUF.q8_0.bin q8_0
+./llama.cpp/quantize Llama-2-13b-chat-german-GGUF.fp16.bin Llama-2-13b-chat-german-GGUF.q8_0.bin q5_K_M
+./llama.cpp/quantize Llama-2-13b-chat-german-GGUF.fp16.bin Llama-2-13b-chat-german-GGUF.q8_0.bin q4_K_M
 ```
 ___