Step Audio 2 Mini
Generate audio responses from text or voice input
Generate audio responses from text or voice input
An end-to-end speech large language model.
Launch VibeVoice demo for text-to-speech conversion
Identify speakers in an audio file
Generate speech from text in Japanese
Generate anime character voice
Text-to-speech (TTS) with Next-gen Kaldi
Efficient, fast, and natural text to speech with StyleTTS 2!
Turn Any Article to Podcast
Separate speakers in audio recordings
Realtime implementation of Whisper large turbo
Spanish finetune for the original F5 model.
Transcribe audio to text
Conversational speech generation
Generate text and speech from audio, video, and text inputs
ASR demo using onnx-asr
Higgs Audio Demo
State-of-the-art TTS model under 25MB
An Open-Source Omni-Model for Long Audio and Voice Clone
This is a demo for OWSM-V4 CTC and medium model.
Chatterbox TTS supporting 23 languages
Turn any ebook into audiobook, 1107+ languages supported!
Generate audio from text with AI voices
converts text into natural-sounding speech using AI.