nisten commited on
Commit
e065fd7
1 Parent(s): 0e9fc27

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -2,17 +2,17 @@
2
  license: apache-2.0
3
  ---
4
 
5
- ** Experimental quants of 4 expert MoE mixtrals in various GGUF formats. **
6
 
7
- ** Goal is to have the best performing MoE < 10gb **
8
 
9
  They still need training/finetuning.
10
 
11
- * * No sparsity tricks yet. * *
12
 
13
  8.4gb custom 2bit quant works ok up until 512 token length then starts looping.
14
 
15
- Install llama.cpp from github and run it
16
 
17
 
18
  ```bash
 
2
  license: apache-2.0
3
  ---
4
 
5
+ **Experimental quants of 4 expert MoE mixtrals in various GGUF formats.**
6
 
7
+ **Goal is to have the best performing MoE < 10gb**
8
 
9
  They still need training/finetuning.
10
 
11
+ * *No sparsity tricks yet.* *
12
 
13
  8.4gb custom 2bit quant works ok up until 512 token length then starts looping.
14
 
15
+ Install llama.cpp from github and run it:
16
 
17
 
18
  ```bash