Zen3 Audio
Collection
Speech recognition + text-to-speech. • 7 items • Updated
How to use zenlm/zen-3-tts-voice-design with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-to-speech", model="zenlm/zen-3-tts-voice-design") # Load model directly
from transformers import AutoModelForSeq2SeqLM
model = AutoModelForSeq2SeqLM.from_pretrained("zenlm/zen-3-tts-voice-design", dtype="auto")Zen3 TTS variant specialized for voice design: synthesize new speakers from natural-language descriptors (timbre, age, accent, affect) without reference audio. Part of the Zen3 audio family.
Served at api.hanzo.ai as zen3-tts-voice-design.
Mirror staging in progress. Use the canonical base via the base_model link in frontmatter until weights land here.
Built on Qwen/Qwen3-TTS-12Hz-1.7B-VoiceDesign (Apache-2.0). Mirrored here for the Zen3 audio family.
Base model
Qwen/Qwen3-TTS-12Hz-1.7B-VoiceDesign