keithito/lj_speech
Updated โข 1.32k โข 62
Fine-tuned from LibriTTS base (ckpt-15000) on LJSpeech.
| Architecture | CGN v2 (autoregressive) |
| Parameters | 206M (1024d / 16L) |
| Audio codec | SNAC @ 24kHz (3-level, 4096 codebook) |
| Text frontend | Flite G2P (ARPAbet) |
| Metric | Value |
|---|---|
| eval_loss | 2.727 |
| WER | 5.2% (Whisper base.en) |
| DNSMOS | 3.24 |