Update README.md
Browse files
README.md
CHANGED
@@ -2,7 +2,7 @@
|
|
2 |
license: cc-by-nc-4.0
|
3 |
---
|
4 |
## There's some kind of gguf tokenization issue where its not properly handling the <|im_end|> EOS token and instead breaks it into multiple smaller ones which degrades performance for gguf users. Need to submit an issue ticket. Unsure if this impacts other quantization formats.
|
5 |
-
## If possible, please use the safetensors
|
6 |
|
7 |
This is the first public release of the Humanize model I've been working on. That way folks have a stable repo to download from and can expect some consistient level of performance when downloading.
|
8 |
## Goals
|
|
|
2 |
license: cc-by-nc-4.0
|
3 |
---
|
4 |
## There's some kind of gguf tokenization issue where its not properly handling the <|im_end|> EOS token and instead breaks it into multiple smaller ones which degrades performance for gguf users. Need to submit an issue ticket. Unsure if this impacts other quantization formats.
|
5 |
+
## If possible, please use the safetensors or Exllama2 for best performance. Exllama2 tokenizes correctly.
|
6 |
|
7 |
This is the first public release of the Humanize model I've been working on. That way folks have a stable repo to download from and can expect some consistient level of performance when downloading.
|
8 |
## Goals
|