Audio Spaces
- Runtime error70π
- Running on T4948π
Seamless M4T
- Running on A10G4.49kπ΅
MusicGen
- Running on A10G772π
Audioldm Text To Audio Generation
- Running on A10G284π
AudioLDM2 Text2Audio Text2Music Generation
- Runtime error220π
AudioSep
- Running152π΅π΅π΅
Lp Music Caps
- Running on T4233π’
Tortoise Tts
ExpressivText-to-Speech
- Sleeping13π
All In One
- Running on T42.05kπΈ
XTTS
- Paused188πΈπΆ
Coqui Bark Voice Cloning
- Running on A10G342π
VALL E X
- Sleeping189π₯
WavJourney
- Paused266πΆπ
Music To Image
- Running on A10G260π
MMS
- Running534π£οΈ
ElevenLabs TTS
- Runtime error287π
AudioGPT
- Running on T42.04kπΆ
Bark
- Runtime error36π©βπ€
SpeechT5 Speech Recognition Demo
- Runtime error172πΈ
CoquiTTS (Official)
- Running on L41.8kπ
Whisper
- Running on CPU Upgrade599πποΈ
Moe TTS
- Build error17π₯
YourTTS
- Running538π
Talking Face Generation with Multilingual TTS
- Running563π
OpenAI TTS New
- Running on A10G162π’
Mustango
- Sleeping55π
OWSM Demo
- Running on T4580π£οΈ
StyleTTS 2
Efficient, fast, and natural text to speech with StyleTTS 2!
- Running on T4357β‘
HierSpeech++ (Zero-shot TTS)
- Running on T418π
Video2music
- Running on T4182π€«
Whisper Large V2
- Running on T456π
Musicgen Prompt Upsampling
- Runtime error47π€
Qwen-Audio
- Runtime error514π
Seamless M4T v2
- Running on T4236π
Seamless Streaming
- Runtime error47π΅
Matcha TTS
- Running on Zero233π₯
MusicGen Streaming
- Running on T4278π
Resemble Enhance
- Running on A10G225πΌ
Singing Voice Conversion
- Sleeping50π§
NaturalSpeech2
- Paused21π₯
Create Your Own TTS Dataset
- Sleepingπ’
Podcast Transcription
- Running975π€
OpenVoice
- Running on L40S126βπ»
NeMo Speech-to-Text
- Runtime error93π»
M2UGen Demo
- Runtime error70π
Pheme
- Sleeping5π
ESPnet2 TTS
- Running13π
Whisper-WebUI
- Running172π
Image2SFX Comparison
Generates audio environment from an image
- Running on T4378π¬οΈπ¬π
WhisperSpeech
- Build error147π£οΈ
MetaVoice 1B
A demo of MetaVoice 1B, a new TTS model by MetaVoice.
- Running on CPU Upgrade490π
TTS Arena
Vote on the latest TTS models!
- Runtime error167π½
Whisper Speech X DreamTalk
Combine voice cloning and portrait lipsync animation
- Running on T4172π€
Canary 1b
- Sleeping75β‘
SALMONN Audio Questioning
Deeply interrogate audio file content
- Running on T4382π£οΈ
MeloTTS
Fast, efficient, & multilingual text-to-speech
- Running on Zero262π§
Audio Editing
Edit audios with text prompts
- Runtime error18π»
ChatMusician
- Running on CPU Upgrade60π§ββοΈπ§ββοΈπ§ββοΈ
xVASynth TTS
CPU powered, low RTF, emotional, multilingual TTS
- Running on Zero164π
NaturalSpeech3 FACodec
- Sleeping22βοΈ
Hey Gemma
- Configuration error68π£οΈποΈ
Ratchet + Whisper
- Paused3π
AutoSubs
Automatically add on-screen subs to your videos
- Build error162π
VoiceCraft
- Running on Zero119π
Tango2
Fast Text to Audio Generator
- Running on Zero730π₯
Parler-TTS
High-fidelity Text-To-Speech
- Sleeping178π₯
Sing an idea β‘οΈ Music
Bring song ideas to life
- Running on Zero55π
Musicgen Songstarter Demo
- Paused90π
Whisper JAX
- Running on Zero15π’
AudioLCM
- Running on Zero155π»
Stable Audio Live Multiplayer
- Running on Zero345π₯
Stable Audio Open Zero
- Running on Zero12π
Make An Audio 3
- Sleeping60π
Mars5 Space
- Runtime error5π΅
Tango Music AF
Text to Music Generator
- Runtime error6π₯
Tango AF
Text to Audio Generator
- Running on Zero90π
BigVGAN
- Running on Zero64π
SenseVoice
- Running on Zero50π
CosyVoice 300M
- Running on Zero21π
PicoAudio
- Running on A10G29πͺ©
MusiConGen
- Running14π
Mms Zeroshot
- Running138π
Qwen2 Audio Instruct Demo
- Running on Zero66π€
GPT SoVITS V2
- Running on Zero244π£
EzAudio
- Running on Zero205πΆ
OpenMusic
- Running on Zero435πΌπΆ
Midi Music Generator
- Running on Zero561π€―
Whisper Turbo
- Running on Zero251π€―
Realtime Whisper Turbo
Realtime implementation of Whisper large turbo
- Running109π
Whisper Large V3 Turbo WebGPU
ML-powered speech recognition directly in your browser
- Running on Zero4π
Text2midi
- Running on A10G279π
Fish Speech 1
- Running89π€π
TTS Spaces Arena
Vote on the top HF TTS models!
- Running on Zero14π£οΈ
Diva Realtime Chat
- Running on Zero1.06kπ£οΈ
F5-TTS
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
- Running on Zero183π»
MaskGCT TTS Demo
MaskGCT TTS Demo
- Running on L40S10π¬
Fish Agent
An end-to-end (e2e) Voice Language Model by Fish Audio.