Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
mrfakename 
posted an update Feb 24
Post
Today, I’m thrilled to release a project I’ve been working on for the past couple weeks in collaboration with Hugging Face: the TTS Arena.

The TTS Arena, inspired by LMSys's Chatbot Arena, allows you to enter text which will be synthesized by two SOTA models. You can then vote on which model generated a better sample. The results will be published on a publicly-accessible leaderboard.

We’ve added several open access models, including Pheme, MetaVoice, XTTS, OpenVoice, & WhisperSpeech. It also includes the proprietary ElevenLabs model.

If you have any questions, suggestions, or feedback, please don’t hesitate to DM me on X (https://twitter.com/realmrfakename) or open a discussion in the Space. More details coming soon!

Try it out: TTS-AGI/TTS-Arena

There is some sort of toxicity test that prevents natural input.

·

The filter should be more relaxed now, please let me know if it’s working better!

maybe this will promote more models to be put on the hub from large companies.

kudos for your awesome work on this @mrfakename