eachadea
/

legacy-ggml-vicuna-13b-4bit

Text Generation

text-generation-inference

Model card Files Files and versions Community

eachadea commited on Apr 13, 2023

Commit

17ebbf7

•

1 Parent(s): df20cac

Update README.md

Files changed (1) hide show

README.md +2 -10

README.md CHANGED Viewed

@@ -4,19 +4,11 @@ tags:
 - vicuna
 - llama
 - text-generation-inference
----
-Converted for use with [llama.cpp](https://github.com/ggerganov/llama.cpp)
----
-- 4-bit quantized
-- Needs ~10GB of CPU RAM
-- Won't work with alpaca.cpp or old llama.cpp (new ggml format requires latest llama.cpp)
-- EOS token fix added (download rev1)
 ---
-If you only have 8GB RAM, a smaller 7B version of this can be found here: https://huggingface.co/eachadea/ggml-vicuna-7b-4bit.
-7B is over 2x faster and is also uncensored, while 13B isn't.
 ---
 tags:

 - vicuna
 - llama
 - text-generation-inference
 ---
+**NOTE: Download new version here: https://huggingface.co/eachadea/ggml-vicuna-13b-1.1**
+**NOTE: Download new version (7B): https://huggingface.co/eachadea/ggml-vicuna-7b-1.1**
 ---
 tags: