A text-to-speech model powered by SparkAudio and Mobvoi.
西北工业大学ASLP实验室OSUM项目demo展示
Transcribe audio from microphone, file, or YouTube link