Update README.md
Browse files
README.md
CHANGED
@@ -10,9 +10,9 @@ This repository contains CPU-optimized GGUF quantizations of the Meta-Llama-3.1-
|
|
10 |
## Available Quantizations
|
11 |
|
12 |
1. Q4_0_4_8 (CPU FMA-Optimized): ~246 GB
|
13 |
-
2. BF16: ~
|
14 |
-
3. Q8_0: ~
|
15 |
-
4.
|
16 |
|
17 |
## Use Aria2 for parallelized downloads, links will download 9x faster
|
18 |
|
@@ -33,6 +33,15 @@ aria2c -x 16 -s 16 -k 1M -o meta-405b-inst-cpu-optimized-q4048-00005-of-00006.gg
|
|
33 |
aria2c -x 16 -s 16 -k 1M -o meta-405b-inst-cpu-optimized-q4048-00006-of-00006.gguf https://huggingface.co/nisten/meta-405b-instruct-cpu-optimized-gguf/resolve/main/meta-405b-inst-cpu-optimized-q4048-00006-of-00006.gguf
|
34 |
```
|
35 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
36 |
### BF16 Version
|
37 |
|
38 |
```bash
|
|
|
10 |
## Available Quantizations
|
11 |
|
12 |
1. Q4_0_4_8 (CPU FMA-Optimized): ~246 GB
|
13 |
+
2. BF16: ~811 GB
|
14 |
+
3. Q8_0: ~406 GB
|
15 |
+
4. Q2-Q8mix ~ 165Gb
|
16 |
|
17 |
## Use Aria2 for parallelized downloads, links will download 9x faster
|
18 |
|
|
|
33 |
aria2c -x 16 -s 16 -k 1M -o meta-405b-inst-cpu-optimized-q4048-00006-of-00006.gguf https://huggingface.co/nisten/meta-405b-instruct-cpu-optimized-gguf/resolve/main/meta-405b-inst-cpu-optimized-q4048-00006-of-00006.gguf
|
34 |
```
|
35 |
|
36 |
+
### Q2K-Q8 Mixed 2bit 8bit I wrote myself. This is the smallest coherent one I could make without yet doing imatrix
|
37 |
+
|
38 |
+
```verilog
|
39 |
+
aria2c -x 16 -s 16 -k 1M -o meta-405b-inst-cpu-2kmix8k-00001-of-00004.gguf https://huggingface.co/nisten/meta-405b-instruct-cpu-optimized-gguf/resolve/main/meta-405b-inst-cpu-2kmix8k-00001-of-00004.gguf?download=true
|
40 |
+
aria2c -x 16 -s 16 -k 1M -o meta-405b-inst-cpu-2kmix8k-00002-of-00004.gguf https://huggingface.co/nisten/meta-405b-instruct-cpu-optimized-gguf/resolve/main/meta-405b-inst-cpu-2kmix8k-00002-of-00004.gguf?download=true
|
41 |
+
aria2c -x 16 -s 16 -k 1M -o meta-405b-inst-cpu-2kmix8k-00003-of-00004.gguf https://huggingface.co/nisten/meta-405b-instruct-cpu-optimized-gguf/resolve/main/meta-405b-inst-cpu-2kmix8k-00003-of-00004.gguf?download=true
|
42 |
+
aria2c -x 16 -s 16 -k 1M -o meta-405b-inst-cpu-2kmix8k-00004-of-00004.gguf https://huggingface.co/nisten/meta-405b-instruct-cpu-optimized-gguf/resolve/main/meta-405b-inst-cpu-2kmix8k-00004-of-00004.gguf?download=true
|
43 |
+
```
|
44 |
+
|
45 |
### BF16 Version
|
46 |
|
47 |
```bash
|