Update README.md
Browse files
README.md
CHANGED
@@ -7,7 +7,7 @@ tags:
|
|
7 |
---
|
8 |
4.25 bpw/bits exl2 quantization of [Venus-120b](https://huggingface.co/nsfwthrowitaway69/Venus-120b-v1.0), using the measurement.json and dataset posted in the model page.
|
9 |
|
10 |
-
This size lets it being able to use CFG on 72GB VRAM. (use
|
11 |
|
12 |
# Original model card
|
13 |
|
|
|
7 |
---
|
8 |
4.25 bpw/bits exl2 quantization of [Venus-120b](https://huggingface.co/nsfwthrowitaway69/Venus-120b-v1.0), using the measurement.json and dataset posted in the model page.
|
9 |
|
10 |
+
This size lets it being able to use CFG on 72GB VRAM. (use 20,21.5,21.5 for gpu split)
|
11 |
|
12 |
# Original model card
|
13 |
|