dac-44khz-GGUF

GGUF conversion of descript/dac_44khz โ€” the Descript Audio Codec (DAC) decoder at 44.1 kHz.

Used as the vocoder backend for Zonos-v0.1-transformer-GGUF and other DAC-based TTS models running in CrispASR.

Files

File Size Description
dac-44khz-f16.gguf 104 MB F16 weights, full precision

Architecture

The DAC decoder converts 9 residual VQ codebooks (1024 entries each, ~86 codes/s) to 44.1 kHz mono PCM:

  • RVQ quantizer: 9 codebooks ร— 1024 entries ร— 8-dim โ†’ projected to 1024-dim
  • 4-stage convolutional decoder with Snake1d activations and upsampling ratios [8, 8, 4, 2]
  • Total upsampling factor: 512 โ†’ 86 Hz codes ร— 512 = 44,032 Hz โ‰ˆ 44.1 kHz

Usage

Place alongside the Zonos transformer GGUF or pass via --codec-model:

# Auto-discovered if named dac-44khz-f16.gguf in the same directory as the AR model
crispasr tts --backend zonos-tts \
  --model /path/to/zonos-v0.1-transformer-q4_k.gguf \
  --codec-model /path/to/dac-44khz-f16.gguf \
  "Hello world."

Conversion

Converted from descript/dac_44khz using models/convert-dac-to-gguf.py in CrispASR. Source model is MIT licensed (descriptinc/descript-audio-codec).

Downloads last month
187
GGUF
Model size
54.3M params
Architecture
dac-44khz
Hardware compatibility
Log In to add your hardware

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for cstr/dac-44khz-GGUF

Quantized
(2)
this model