JosephusCheung
commited on
Commit
•
fc2dc9c
1
Parent(s):
7e39dfd
Update README.md
Browse files
README.md
CHANGED
@@ -4,7 +4,8 @@ license: gpl-3.0
|
|
4 |
|
5 |
Try Multimodal version with Colab Free T4 demo:
|
6 |
|
7 |
-
![Open In Colab](
|
|
|
8 |
|
9 |
This repository is for Guanaco model with 4-bit quantized weights. The model benefits from two novel techniques introduced by GPTQ: quantizing columns in order of decreasing activation size and performing sequential quantization within a single Transformer block. These innovations enable compact, consumer-level multilingual models to function effectively.
|
10 |
|
|
|
4 |
|
5 |
Try Multimodal version with Colab Free T4 demo:
|
6 |
|
7 |
+
[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://huggingface.co/JosephusCheung/GuanacoVQAOnConsumerHardware)
|
8 |
+
|
9 |
|
10 |
This repository is for Guanaco model with 4-bit quantized weights. The model benefits from two novel techniques introduced by GPTQ: quantizing columns in order of decreasing activation size and performing sequential quantization within a single Transformer block. These innovations enable compact, consumer-level multilingual models to function effectively.
|
11 |
|