Commit
·
34cf98b
1
Parent(s):
3d5cc96
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,5 @@
|
|
|
|
|
|
1 |
This quantization was made by using this repository: https://github.com/qwopqwop200/GPTQ-for-LLaMa/tree/triton
|
2 |
|
3 |
And I used the triton branch with all the gptq implementations available (true_sequential + act_order + groupsize 128)
|
|
|
1 |
+
This is the gptq 4bit quantization of this model: https://huggingface.co/openaccess-ai-collective/manticore-13b
|
2 |
+
|
3 |
This quantization was made by using this repository: https://github.com/qwopqwop200/GPTQ-for-LLaMa/tree/triton
|
4 |
|
5 |
And I used the triton branch with all the gptq implementations available (true_sequential + act_order + groupsize 128)
|