mradermacher
/

model_requests

Model card Files Files and versions Community

treehugg3 commited on 4 days ago

Commit

1efd43b

·

verified ·

1 Parent(s): 8507cc0

Add link to llama-imatrix

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -144,6 +144,8 @@ is unfortunately very frequent).
 ## What do I need to do to compute imatrix files for large models?
 ### Hardware
 * RAM: A lot of RAM is required to compute imatrix files. Example: 512 GB is just enough to compute 405B imatrix quants in Q8.

 ## What do I need to do to compute imatrix files for large models?
+Use [`llama-imatrix`](https://github.com/ggml-org/llama.cpp/blob/master/examples/imatrix/README.md) to compute imatrix files.
 ### Hardware
 * RAM: A lot of RAM is required to compute imatrix files. Example: 512 GB is just enough to compute 405B imatrix quants in Q8.