AgeOfAlgorithms/Llasa-3b-GPTQ-4bit

Model Description

This is a 4bit GPTQ quantization of Llasa-3B by the HKUSTAudio team. I tested using a script written by GitHub user nivibilla, linked below. For some reason, I was not able to run it on my RTX 3090, while quantized Llasa-1B worked fine. Please let me know if you can get it working.

Model Sources

Repository: HKUSTAudio/Llasa-3B
Paper: LLaSA: Scaling Train-Time and Inference-Time Compute for LLaMA-based Speech Synthesis (Coming soon)
Test Script: https://github.com/slives-lab/local-llasa-tts_voice/blob/main/llasa_vllm_longtext_inference.ipynb

AgeOfAlgorithms
/

Llasa-3b-GPTQ-4bit

Model Description

Model Sources

Model tree for AgeOfAlgorithms/Llasa-3b-GPTQ-4bit