NeoChen1024
commited on
Commit
•
435bdbf
1
Parent(s):
57e9011
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,8 @@
|
|
1 |
-
---
|
2 |
-
license: apache-2.0
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
base_model:
|
4 |
+
- cognitivecomputations/dolphin-2.7-mixtral-8x7b
|
5 |
+
---
|
6 |
+
|
7 |
+
GGUF IQ3_M quant of cognitivecomputations/dolphin-2.7-mixtral-8x7b
|
8 |
+
It fits into 24GiB VRAM with 32768 context (@ 8bit KV cache quantization).
|