ThomasBaruzier commited on
Commit
b9301db
1 Parent(s): d87465d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +35 -0
README.md CHANGED
@@ -198,6 +198,41 @@ Original model: [https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct](htt
198
 
199
  All quants were made using the imatrix option and Bartowski's [calibration file](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8).
200
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
201
  <hr><br>
202
 
203
  ## Model Details
 
198
 
199
  All quants were made using the imatrix option and Bartowski's [calibration file](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8).
200
 
201
+ <hr>
202
+
203
+ # Perplexity table (the lower the better)
204
+
205
+ | Quant | Size (MB) | PPL | Size (%) | Accuracy (%) | PPL error rate |
206
+ | ------- | --------- | ------- | -------- | ------------ | -------------- |
207
+ | IQ1_S | 1927 | 60.6812 | 12.57 | 13.79 | 0.45829 |
208
+ | IQ1_M | 2062 | 29.1481 | 13.46 | 28.7 | 0.21385 |
209
+ | IQ2_XXS | 2289 | 16.2991 | 14.94 | 51.32 | 0.12078 |
210
+ | IQ2_XS | 2486 | 12.5744 | 16.22 | 66.52 | 0.09212 |
211
+ | IQ2_S | 2631 | 11.5222 | 17.17 | 72.6 | 0.08563 |
212
+ | IQ2_M | 2812 | 10.3045 | 18.35 | 81.18 | 0.07545 |
213
+ | Q2_K_S | 2851 | 11.3784 | 18.6 | 73.52 | 0.0842 |
214
+ | Q2_K | 3032 | 10.572 | 19.78 | 79.12 | 0.07761 |
215
+ | IQ3_XXS | 3124 | 9.4049 | 20.38 | 88.94 | 0.06908 |
216
+ | IQ3_XS | 3356 | 9.017 | 21.9 | 92.77 | 0.06548 |
217
+ | Q3_K_S | 3495 | 9.4611 | 22.81 | 88.41 | 0.06768 |
218
+ | IQ3_S | 3512 | 8.944 | 22.92 | 93.53 | 0.06596 |
219
+ | IQ3_M | 3610 | 8.8635 | 23.56 | 94.38 | 0.06505 |
220
+ | Q3_K_M | 3833 | 8.8718 | 25.01 | 94.29 | 0.0665 |
221
+ | Q3_K_L | 4122 | 8.8139 | 26.9 | 94.91 | 0.06611 |
222
+ | IQ4_XS | 4242 | 8.5606 | 27.68 | 97.72 | 0.06361 |
223
+ | Q4_0 | 4460 | 8.508 | 29.1 | 98.32 | 0.06214 |
224
+ | IQ4_NL | 4462 | 8.5774 | 29.12 | 97.52 | 0.06391 |
225
+ | Q4_K_S | 4476 | 8.5273 | 29.21 | 98.1 | 0.06333 |
226
+ | Q4_K_M | 4693 | 8.5215 | 30.62 | 98.16 | 0.06327 |
227
+ | Q4_1 | 4893 | 8.5255 | 31.93 | 98.12 | 0.06337 |
228
+ | Q5_K_S | 5340 | 8.4169 | 34.85 | 99.38 | 0.06261 |
229
+ | Q5_0 | 5354 | 8.437 | 34.94 | 99.15 | 0.06281 |
230
+ | Q5_K_M | 5468 | 8.4087 | 35.68 | 99.48 | 0.06245 |
231
+ | Q5_1 | 5788 | 8.3893 | 37.77 | 99.71 | 0.06229 |
232
+ | Q6_K | 6291 | 8.3894 | 41.05 | 99.71 | 0.06231 |
233
+ | Q8_0 | 8146 | 8.3634 | 53.15 | 100.02 | 0.0621 |
234
+ | F16 | 15325 | 8.365 | 100 | 100 | 0.06216 |
235
+
236
  <hr><br>
237
 
238
  ## Model Details