Model Description
This is a 4bit GPTQ quantization of Llasa-1B by the HKUSTAudio team. I tested using a script written by GitHub user nivibilla, linked below. The tests were successful, but the quality of the generated voice is often unusable. In case you don't believe me, I'll leave this model up here so you can test it yourself.
Model Sources
- Repository: HKUSTAudio/Llasa-1B
- Paper: LLaSA: Scaling Train-Time and Inference-Time Compute for LLaMA-based Speech Synthesis (Coming soon)
- Test Script: https://github.com/slives-lab/local-llasa-tts_voice/blob/main/llasa_vllm_longtext_inference.ipynb
- Downloads last month
- 32
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the model is not deployed on the HF Inference API.