Pinkstack commited on
Commit
8ec3b49
·
verified ·
1 Parent(s): 1a5a110

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -22,7 +22,7 @@ Based on Smollm2 by HuggingfaceTB, Tinyparm v2 (this version) has very strong pe
22
  # Which quant is right for you?
23
 
24
  - ***F16:*** Highest quality answers, Not needed for most use cases due to its heavier size.
25
- - ***Q8:*** Very high-quality answers, highly recommended! easy to run on modern mobile devices such as phones (tested on Z fold 5)
26
  - ***Q5:*** High quality answers, recommended but Q4 is very similar.
27
  - ***Q4:*** Good quality answers, highly recommended! very easy to run on most devices due to its smaller size, although quality is lower. can be run on ~1 - 2GB vram
28
 
 
22
  # Which quant is right for you?
23
 
24
  - ***F16:*** Highest quality answers, Not needed for most use cases due to its heavier size.
25
+ - ***Q8:*** Very high-quality answers, highly recommended! easy to run on modern mobile devices such as phones (test on Z fold 5 via llama.cpp: 14.03 t/s for token generation.)
26
  - ***Q5:*** High quality answers, recommended but Q4 is very similar.
27
  - ***Q4:*** Good quality answers, highly recommended! very easy to run on most devices due to its smaller size, although quality is lower. can be run on ~1 - 2GB vram
28