Model Description

This is a 8bit GPTQ quantization of Llasa-3B by the HKUSTAudio team. I tested using a script written by GitHub user nivibilla, linked below. For some reason, I was not able to run it on my RTX 3090, while quantized Llasa-1B worked fine. Please let me know if you can get it working.

Model Sources

Downloads last month
22
Safetensors
Model size
1.33B params
Tensor type
I32
·
FP16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for AgeOfAlgorithms/Llasa-3b-GPTQ-8bit

Quantized
(6)
this model