MOSS-TTS-PNY GGUF

Portable GGUF/ONNX artifacts for the MOSS-TTS-PNY C++ runtime.

This repository is intended for use with the model-free Windows Vulkan + DirectML bundle produced by the companion C++ project. Put these files in the bundle's models/ directory, or run download-models.bat from the bundle root.

Files

File Purpose
moss-tts-pny-f16.gguf Full precision main MOSS TTS GGUF.
moss-tts-pny-global-q8_0.gguf Main model with the global transformer quantized to Q8_0.
moss-tts-pny-global-q6_k.gguf Main model with the global transformer quantized to Q6_K.
moss-audio-decoder4-f16.gguf Full precision decoder4 codes-to-features model.
moss-audio-decoder4-q8_0.gguf Q8_0 decoder4 codes-to-features model.
moss-tts-qwen2-tokenizer.gguf Qwen2 tokenizer vocabulary GGUF used by the runtime.
istftnet2_decoder.onnx ONNX iSTFTNet2 vocoder used by the Windows DirectML path.
istftnet2-vocoder-f16.gguf Experimental GGUF vocoder artifact. The packaged demos currently use ONNX.

Windows Bundle Usage

From the extracted Windows bundle root:

download-models.bat
run-vulkan-directml-full.bat

download-models.bat uses public HTTPS downloads from this repository. It does not require the Hugging Face CLI, Python, Git LFS, or a Hugging Face account.

The quantized demo uses:

run-vulkan-directml-quant.bat

Expected model layout:

models/
  moss-tts-pny-f16.gguf
  moss-tts-pny-global-q8_0.gguf
  moss-tts-pny-global-q6_k.gguf
  moss-audio-decoder4-f16.gguf
  moss-audio-decoder4-q8_0.gguf
  moss-tts-qwen2-tokenizer.gguf
  istftnet2_decoder.onnx
  istftnet2-vocoder-f16.gguf

Notes

  • The model-free Windows zip intentionally excludes these large artifacts.
  • The full demo uses moss-tts-pny-f16.gguf and moss-audio-decoder4-f16.gguf.
  • The quant demo uses moss-tts-pny-global-q6_k.gguf and moss-audio-decoder4-q8_0.gguf.
  • The ONNX vocoder is the supported vocoder path for the DirectML Windows demo.
Downloads last month
-
GGUF
Model size
31.6M params
Architecture
moss_istftnet2
Hardware compatibility
Log In to add your hardware

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support