GGUF + pure-C++ runtime in CrispASR — Moonshine base

by cstr - opened 16 days ago

We've added Moonshine-base to CrispASR — same moonshine backend as tiny, just dispatched on GGUF metadata (moonshine-impl.h shared between sizes).

Runtime details:

Conv stem + 8L transformer encoder + 8L decoder (416d, partial RoPE, SiLU).
KV-cached autoregressive decode with flash attention.
Companion-file mechanism for tokenizer.bin on auto-download.

Pre-quantised GGUFs (MIT): cstr/moonshine-base-GGUF — plus the per-language variants we converted: cstr/moonshine-base-{ja,ko,zh,ar,vi,uk}-GGUF.

./build/bin/crispasr --backend moonshine -m moonshine-base-q4_k.gguf -f audio.wav -osrt

Companion tiny size: cstr/moonshine-tiny-GGUF. Streaming variants: cstr/moonshine-streaming-{tiny,small,medium}-GGUF (separate moonshine-streaming backend).

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment