Joshua's picture

Joshua

Xenova

AI & ML interests

None yet

Recent Activity

Organizations

Hugging Face's profile picture Google's profile picture Gradio's profile picture Hugging Face Internal Testing Organization's profile picture onnx's profile picture Huggingface.js's profile picture Huggingface Projects's profile picture Whisper Distillation's profile picture Hugging Face OSS Metrics's profile picture Core ML Projects's profile picture Blog-explorers's profile picture Useful Sensors Inc.'s profile picture Terrain Diffusion's profile picture Hugging Face TB Research's profile picture kotol's profile picture Static Templates's profile picture gg-hf's profile picture Nerfies's profile picture WebML Community's profile picture Llava Hugging Face's profile picture mozilla's profile picture Hugging Face - Visual Blocks's profile picture Social Post Explorers's profile picture hsramall's profile picture gg-tt's profile picture ONNX Community's profile picture Hugging Face Discord Community's profile picture LLHF's profile picture SLLHF's profile picture Hugging Quants's profile picture nltpt's profile picture Hugging Face Party @ PyTorch Conference's profile picture rmbg's profile picture ExecuTorch Community's profile picture Hello Games's profile picture G2P (Grapheme To Phoneme) Exploration's profile picture

Posts 20

view post
Post
4102
We did it. Kokoro TTS (v1.0) can now run 100% locally in your browser w/ WebGPU acceleration. Real-time text-to-speech without a server. ⚡️

Generate 10 seconds of speech in ~1 second for $0.

What will you build? 🔥
webml-community/kokoro-webgpu

The most difficult part was getting the model running in the first place, but the next steps are simple:
✂️ Implement sentence splitting, allowing for streamed responses
🌍 Multilingual support (only phonemization left)

Who wants to help?

Articles 3

Article
58

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope