view post Post 1512 Reply Hi,I'm looking for an open-sourced (permissively-licensed) audio/music captioning model.Does anyone have any suggestions?Thanks!
view post Post 2682 Reply Introducing StyleTTS 2 detector, an audio classification model to detect StyleTTS 2 vs human-generated content!Dual-licensed under MIT/Apache 2.0.Model Weights: mrfakename/styletts2-detectorSpaces: mrfakename/styletts2-detector
My Projects Projects I've worked on (includes collabs) Running on CPU Upgrade 416 🏆 TTS Arena Vote on the top TTS models! Running on Zero 374 🖼️ OpenDalle V1.1 GPU Demo A demo of OpenDalle V1.1 on a ZERO GPU. Running 7 🤔 Did StyleTTS 2 Generate It? Did StyleTTS 2 generate that audio?!? Running on T4 296 🗣️ MeloTTS Fast, efficient, & multilingual text-to-speech
Spaces of the Week My spaces or spaces I worked featured on Spaces of the Week! Ones at the top are the oldest, newest at the bottom 🤗 Running on T4 509 🗣️ StyleTTS 2 Efficient, fast, and natural text to speech with StyleTTS 2! Running on Zero 374 🖼️ OpenDalle V1.1 GPU Demo A demo of OpenDalle V1.1 on a ZERO GPU. Running on Zero 65 🎵 RWKV Music Generate MIDI music using RWKV v4! Running on CPU Upgrade 416 🏆 TTS Arena Vote on the top TTS models!