|
--- |
|
language: |
|
- en |
|
--- |
|
|
|
--- |
|
|
|
# To request a quant, open an new discussion in the Community tab (preferably with the url in the title) |
|
|
|
--- |
|
|
|
# Mini-FAQ |
|
|
|
## What does the "-i1" mean in "-i1-GGUF"? |
|
|
|
tbd |
|
|
|
## Why are you doing this? |
|
|
|
tbd |
|
|
|
## You have amazing hardware!?!?! |
|
|
|
tbd |
|
|
|
## Why don't you use gguf-split? |
|
|
|
TL;DR: I don't have the hardware/resources for that. |
|
|
|
Long answer: gguf-split requires a full copy for every quant. |
|
Unlike what many people think, my hardware is rather outdated and not very fast. The extra processing that gguf-split requires |
|
either runs out of space on my systems with fast disk, or takes a very long time and a lot of I/O bandwidth on the slower |
|
disks, all of which already run at their limits. Supporting gguf-split would mean |
|
|
|
While this is the blocking reason, I also find it less than ideal that yet another incompatible file format was created that |
|
requires special tools to manage, instead of supporting the tens of thousands of existing quants, of which the vast majority |
|
could just be mmapped together into memory from split files. That doesn't keep me from supporting it, but it would have |
|
been nice to look at the existing reality and/or consult the community before throwing yet another hard to support format out |
|
there without thinking. |
|
|
|
There are some developments to make this less of a pain, and I will revisit this issue from time to time to see if it has become feasible. |
|
|