Update README.md
Browse files
README.md
CHANGED
@@ -6,7 +6,7 @@ license: apache-2.0
|
|
6 |
|
7 |
**Goal is to have the best performing MoE < 10gb**
|
8 |
|
9 |
-
Experimental q8 and q4 files for training/finetuning too.
|
10 |
|
11 |
* *No sparsity tricks yet.*
|
12 |
|
@@ -24,7 +24,7 @@ make -j
|
|
24 |
|
25 |
wget https://huggingface.co/nisten/quad-mixtrals-gguf/resolve/main/4mixq2.gguf
|
26 |
|
27 |
-
./server -m 4mixq2.gguf --host "
|
28 |
```
|
29 |
|
30 |
|
|
|
6 |
|
7 |
**Goal is to have the best performing MoE < 10gb**
|
8 |
|
9 |
+
>Experimental q8 and q4 files for training/finetuning too.
|
10 |
|
11 |
* *No sparsity tricks yet.*
|
12 |
|
|
|
24 |
|
25 |
wget https://huggingface.co/nisten/quad-mixtrals-gguf/resolve/main/4mixq2.gguf
|
26 |
|
27 |
+
./server -m 4mixq2.gguf --host "my.internal.ip.or.my.cloud.host.name.goes.here.com" -c 512
|
28 |
```
|
29 |
|
30 |
|