TTS / Audio

Nymbo 's Collections

Hub Models

Hub Datasets

API LMs

Small LMs

Photo / Video

TTS / Audio

Utility

Data Utilities

Zero GPU Spaces

Games / Fun

Local & GGUF

Leaderboards

Templates

Gradio Themes

updated 4 days ago

Upvote

Running

alytts

📊

Generate speech from text using OpenAI API
Running

4

4

Voice Cloning

🎤
Running

10

10

TTS for 1,100+ Languages

🌍

Text-to-Speech, Speech-to-Text, and Language Recognition
Sleeping

H2O Wave Whisper

🎙

Display interactive web applications using H2O Wave
Paused

XTTS

🐸
Paused

MusicGen

🎵
Paused

Seamless M4T v2

📞
Running

16

16

Voice Clone Simple

🏃

Clone a voice using a text and audio sample
Running

1

1

CoquiTTS (Official)

🐸

Generate audio from text using pre-trained models
Paused

Parakeet TDT 1.1b
Paused

Image to Music v2

🎺
Running

Whisper Speech X DreamTalk

😽
Paused

Canary 1b

🐤
Paused

Audiogen

💻
Running

2

2

🎤🗣️EZVoiceCloner

🎤

Create custom voice clones using text input
Running

2

2

Music Playground

💻

Create interactive music playlists with AI assistance
Running

1

1

Whisper.cpp WASM

📉

Convert audio to text using a model
Running

4

4

Video SoundFX

👂

Generate audio effects from video using image caption
Running

4

4

EZ Voice Clone

⚡

Generate voice from text with customizable audio source
Paused

Whisper

📉
Sleeping

Faster Whisper Webui

🚀
Paused

MetaVoice 1B

🗣

A demo of MetaVoice 1B, a new TTS model by MetaVoice.
Sleeping

OpenVoice

🤗
Running

Speech Recognition Vue

👀

Transcribe audio to text using selected models
Sleeping

1

1

SeamlessOnDevice

👩
Running

Text To Speech Client

👀

Convert text to speech
Sleeping

Musiclang

⚡
Running

Ultimate Vocal Remover WebUI

🎵

Run a web-based application
Sleeping

RVC Inference HF

👀
Running

Ratchet + Whisper (Next.js)

🗣

Convert audio to text
Sleeping

Bark Simple

🐕
Running

3

3

Easy GUI (English)

😻

Convert voice to another voice
Runtime error

1

1

Video Dubbing

🚀
Paused

Create Your Own TTS Dataset

🔥
Sleeping

3

3

VoiceCraft

📈

Generate or edit spoken audio from text
Sleeping

2

2

Faster Whisper Webui with translate

✨
Sleeping

Aesthetic RVC Inference HF

🍏
Running on Zero

811

811

Parler-TTS

🥖

High-fidelity Text-To-Speech
Running

1

1

MusicGen Web

🎵

In-browser text-to-music w/ Transformers.js!
Running

1

1

Text To Speech Client

👀

Convert text to speech
Running

Semantic Audio Search w/ Transformers.js

🎵

Search... music by typing a description
Sleeping

1

1

seewav-gui

🔊
Running

246

246

Voice Clone Multilingual

🏃

Languages ru,en,zh-cn,ja,de,fr,it,pt,pl,tr,ko,nl,cs,ar,es,hu
Running

311

311

— AI Jukebox —

🎶

Generate music powered by AI
Sleeping

1

1

Edge TTS

📝
Paused

Hum an idea ➡️ Music

🔥
Sleeping

5

5

JARVIS

🔥

Voice Chat with JARVIS
Configuration error

Ratchet + Whisper Locally

🗣

Run Whisper in Browser
Runtime error

1

1

Clone Your Voice

📚
Running

2

2

Real-time Whisper WebGPU

🎤

Convert voice to text
Sleeping

1

1

Whisper-Auto-Subtitled-Video-Generator

🎥
Running

43

43

XTTS Voice Clone on CPU

🚀

Generate audio by cloning a voice
Running

1

1

ElevenLabs TTS

🗣

Generate speech from text using ElevenLabs voices
Sleeping

Rabbit TTS

🐰

TTS, STT
Sleeping

Voice Chat AI

📊

Voice chat with AI that has web access
Paused

MassivelyMultilingualTTS

🌍
Paused

Mars5 Space

📉
Running on Zero

1.83k

1.83k

Voice Clone

🗣

Clone voice to say text
Running on Zero

403

403

Stable Audio Open Zero

🔥

Generate audio from text prompts
Paused

Audio WebUI

🐨
Sleeping

Transcribe Anything 2

👁
Running

18

18

Fastwhisper

🦀

Transcribe or translate audio files
Running

Local Text To Speech

🚀

Generate speech from text
Running

2

2

BeatManipulator

🥁

Generate a modified audio track and beat image from an uploaded song
Running

Candle Whisper

👀

Transcribe audio to text in the browser
Running

Whisper Timestamped

🕒

In-browser speech recognition w/ word-level timestamps
Running

Whisper Speaker Diarization

🗣

Separate different speakers in an audio conversation
Runtime error

363

363

Whisper Webui

⚡
Running

249

249

Faster Whisper Webui

🚀

Transcribe audio to text with speaker diarization
Running

Openai Whisper Live Transcribe

🎙
Sleeping

Whisper Transcribe

💻
Sleeping

1

1

Efficient Audio Captioning

🔊
Sleeping

1

1

Huggingartists

🐠
Runtime error

13

13

Edge TTS w/ More Options

👁

Generate speech from text using various voices
Sleeping

Video To MP3

🏢
Running

4

4

Media Downloader

💻

easy download youtube audios with gradio
Paused

MusiConGen

🪩
Sleeping

Bark Voice Cloning

🐶
Sleeping

1

1

NeonAI Coqui AI TTS Plugin

🐸
Running

169

169

Qwen2 Audio Instruct Demo

🌍

Interact with a multimodal chatbot using text and audio
Running

41

41

Doc To Dialogue

👀

Transform a report or document into an interview/discussion
Running on Zero

113

113

Llama3.1 S V0.2 Checkpoint 2024 08 20

😻

Convert text to audio and vice versa
Sleeping

Translate 100

👭
Sleeping

5

5

Mini Omni

⚡
Sleeping

1

1

Groq Gradio Voice Assistant

👁
Paused

1

1

FLUX GIFs

📽
Running on Zero

217

217

OpenMusic

🎶

Generate high-quality music from text descriptions
Running

Whisper Large V3 Turbo WebGPU

🚀

ML-powered speech recognition directly in your browser
Sleeping

1

1

FreeTranscriptMaker

📚

Convert audio to text with ease and accuracy.
Running on Zero

61

61

VoiceRestore

🏢

Restore degraded audio using a Transformer-based model
Running

37

37

GPT-SoVITS-3s-cloning-free-TTS

🎙

Generate audio from text using selected character voices
Paused

EzAudio

🟣
Paused

EzAudio ControlNet

🟣
Sleeping

Reverb ASR Demo

🌍
Running on Zero

2.01k

2.01k

F5-TTS

🗣

F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Paused

Midi Music Generator

🎼
Running

5

5

MP3 Transcribe

💻

Whisper Transcribe MP3 files, use a GPU to convert faster!
Paused

VoiceRestore

🔊
Paused

Fish Agent

💬

An end-to-end (e2e) Voice Language Model by Fish Audio.
Paused

Audio🔹Separator

🏃

Vocal and background audio separator
Paused

EchoMimic

🐨

Audio-Driven Portrait Animations
Paused

2

2

Audio SR

🔊

Fixed fork of the original audio sr!
Paused

Hertz Dev

🌍

base model for mono-channel completion
Sleeping

OuteTTS 0.1 350M Demo

📉

Generate speech from text with or without voice cloning
Sleeping

Audio Lyrics Extractor

🎵
Sleeping

OuteTTS 0.2 500M Demo

🐠
Running on L4

455

455

Fish Speech 1

🏆

Generate speech from text
Running

Text-to-Speech WebGPU

🗣

WebGPU text-to-Speech powered by OuteTTS and Transformers.js
Running

Nexa Omni Demo

🎧

Transform audio into text using a web-based model
Running

Moonshine Web

🌙

Real-time in-browser speech recognition
Paused

4

4

MMAudio

🔊

Video to Audio
Running

Kokoro Text-to-Speech

🗣

High-quality speech synthesis powered by Kokoro TTS
Paused

Speech To Speech Translation

🏆

Translate and synthesize speech to English
Paused

1

1

Make Custom Voices With KokoroTTS

⚡

Make Custom Voices With KokoroTTS
Runtime error

278

278

Llasa 3b Tts

🔥

Zero Shot voice cloning with llasa 3b (Unofficial Demo)
Running

1

1

Music Descriptor

🚀

Analyze music to identify genre, instrument, mood, and more
Paused

YuE

👩
Sleeping

SoundwaveDemo

📉

Interpret audio based on text instructions
Configuration error

197

197

AI Podcast Generator

🎙

Generate Podcast using Kokoro-TTS!
Paused

Spark TTS

🌖

A text-to-speech model powered by SparkAudio and Mobvoi.
Running on Zero

321

321

Di♪♪Rhythm

🎶

Blazingly Fast and Embarrassingly Simple Song Generation
Sleeping

EDGE TTS

🚅

Answer in speech

Upvote

Collection guide
Browse collections

TTS / Audio

alytts

Voice Cloning

TTS for 1,100+ Languages

H2O Wave Whisper

XTTS

MusicGen

Seamless M4T v2

Voice Clone Simple

CoquiTTS (Official)

Parakeet TDT 1.1b

Image to Music v2

Whisper Speech X DreamTalk

Canary 1b

Audiogen

🎤🗣️EZVoiceCloner

Music Playground

Whisper.cpp WASM

Video SoundFX

EZ Voice Clone

Whisper

Faster Whisper Webui

MetaVoice 1B

OpenVoice

Speech Recognition Vue

SeamlessOnDevice

Text To Speech Client

Musiclang

Ultimate Vocal Remover WebUI

RVC Inference HF

Ratchet + Whisper (Next.js)

Bark Simple

Easy GUI (English)

Video Dubbing

Create Your Own TTS Dataset

VoiceCraft

Faster Whisper Webui with translate

Aesthetic RVC Inference HF

Parler-TTS

MusicGen Web

Text To Speech Client

Semantic Audio Search w/ Transformers.js

seewav-gui

Voice Clone Multilingual

— AI Jukebox —

Edge TTS

Hum an idea ➡️ Music

JARVIS

Ratchet + Whisper Locally

Clone Your Voice

Real-time Whisper WebGPU

Whisper-Auto-Subtitled-Video-Generator

XTTS Voice Clone on CPU

ElevenLabs TTS

Rabbit TTS

Voice Chat AI

MassivelyMultilingualTTS

Mars5 Space

Voice Clone

Stable Audio Open Zero

Audio WebUI

Transcribe Anything 2

Fastwhisper

Local Text To Speech

BeatManipulator

Candle Whisper

Whisper Timestamped

Whisper Speaker Diarization

Whisper Webui

Faster Whisper Webui

Openai Whisper Live Transcribe

Whisper Transcribe

Efficient Audio Captioning

Huggingartists

Edge TTS w/ More Options

Video To MP3

Media Downloader

MusiConGen

Bark Voice Cloning

NeonAI Coqui AI TTS Plugin