pipihand01
/

gemma-2-9b-it-SimPO-GGUF

alignment-handbook

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

pipihand01 commited on 24 days ago

Commit

b04f585

·

verified ·

1 Parent(s): ab936b4

Update README.md

Files changed (1) hide show

README.md +2 -6

README.md CHANGED Viewed

@@ -18,10 +18,6 @@ GGUF files of [princeton-nlp/gemma-2-9b-it-SimPO](https://huggingface.co/princet
 ---
 ## Files quantized with larger embed and output weights than normal GGUF setting
-- Q8_0 embed and output weights:
-Q6_K_L, Q5_K_L, Q4_K_L
-- bf16 embed and output weights: (maybe slower inference)
-Q8_0_L, Q6_K_XL, Q5_K_XL, Q4_K_XL

 ---
 ## Files quantized with larger embed and output weights than normal GGUF setting
+- Q8_0 embed and output weights: Q6_K_L, Q5_K_L, Q4_K_L
+- bf16 embed and output weights (maybe slower inference): Q8_0_L, Q6_K_XL, Q5_K_XL, Q4_K_XL