Update README.md
Browse files
README.md
CHANGED
@@ -17,7 +17,7 @@ Using <a href="https://github.com/turboderp/exllamav2/releases/tag/v0.0.16">turb
|
|
17 |
|
18 |
Each branch contains an individual bits per weight, with the main one containing only the meaurement.json for further conversions.
|
19 |
|
20 |
-
Original model: https://huggingface.co/cognitivecomputations/dolphin-2.8-mistral-7b-v02
|
21 |
|
22 |
| Branch | Bits | lm_head bits | VRAM (4k) | VRAM (16k) | VRAM (32k) | Description |
|
23 |
| ----- | ---- | ------- | ------ | ------ | ------ | ------------ |
|
|
|
17 |
|
18 |
Each branch contains an individual bits per weight, with the main one containing only the meaurement.json for further conversions.
|
19 |
|
20 |
+
Original model (currently private): https://huggingface.co/cognitivecomputations/dolphin-2.8-mistral-7b-v02
|
21 |
|
22 |
| Branch | Bits | lm_head bits | VRAM (4k) | VRAM (16k) | VRAM (32k) | Description |
|
23 |
| ----- | ---- | ------- | ------ | ------ | ------ | ------------ |
|