Inflect-Nano banner

Inflect-Nano GGUF Quantized Weights

This repository contains quantized GGUF weights of Inflect-Nano-v1 model for the InflectNanoTTS C++/GGML runtime. The original model was published by its author, owensong under Apache-2.0 license. Inflect-Nano-v1 aims to deliver ultra-lightweight elglish text to speech to edge devices. I will continue experimeting with it, the final goal is to run it on an ESP32 device.

Static Badge

Repository Contents

acoustic/
  inflect_acoustic_f16.gguf
  inflect_acoustic_q2_k.gguf
  inflect_acoustic_q3_k.gguf
  inflect_acoustic_q3_k_dynamic.gguf
  inflect_acoustic_q4_0.gguf
  inflect_acoustic_q4_k.gguf
  inflect_acoustic_q5_0.gguf
  inflect_acoustic_q5_k.gguf
  inflect_acoustic_q6_k.gguf
  inflect_acoustic_q8_0.gguf
vocoder/
  inflect_vocoder_f16.gguf
  inflect_vocoder_q4_0.gguf
  inflect_vocoder_q5_k.gguf
  inflect_vocoder_q8_0.gguf

Quality drop becomes significant lower than q3_k, others work relatively well.

Usage

Build the runtime from the InflectNanoTTS.cpp project:

./tools/build.sh

Run inference:

build/linux-arm64/inflect-nano \
  -a acoustic/inflect_acoustic_q4_0.gguf \
  -v vocoder/inflect_vocoder_q4_0.gguf \
  -d cmudict.bin \
  -t "Hello, this is a test." \
  -o output.wav

Adjust the binary path for your platform, for example build/macos-arm64/inflect-nano on Apple Silicon.

cmudict.bin is a compact binary pronunciation dictionary used by the text frontend.

It is derived from CMU Pronouncing Dictionary: English words mapped to ARPAbet phoneme sequences. The project converts that text dictionary into a faster binary lookup format so the runtime can avoid parsing the full text dictionary at startup.

Its purpose is to turn English words into phoneme IDs before acoustic inference. In practice: input text -> normalize/tokenize -> dictionary lookup for pronunciations -> phoneme/tone/lang IDs -> acoustic model -> mel -> vocoder audio.

cmudict.bin was obtained by converting cmudict.rep from the original repo with this script:

python tools/compile_cmudict.py \
    third_party/tiny_tts_frontend/tiny_tts/text/cmudict.rep \
    cmudict.bin
Downloads last month
153
GGUF
Model size
3.47M params
Architecture
inflect-acoustic
Hardware compatibility
Log In to add your hardware

2-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for remixerdec/Inflect-Nano-v1-GGUF

Quantized
(2)
this model