Text-handling issues?

#24

by creativeedge - opened 8 days ago

I cannot get any of the examples in text-handling section to sound natural as the examples there.
https://huggingface.co/spaces/Supertone/supertonic#text-handling
How do I get audio like that?

Here is what I used to generate the attached audio. The generated audio failed all of samples there. For example: It says "5.2M" instead of "5.2 Million dollars". It cannot say the phone number correctly.

supertonic tts -o stonic-text-handling.wav 'The startup secured $5.2M in venture capital, a huge leap from their initial $450K seed round. The train delay was announced at 4:45 PM on Wed, Apr 3, 2024 due to track maintenance. You can reach the hotel front desk at (212) 555-0142 ext. 402 anytime. Our drone battery lasts 2.3h when flying at 30kph with full camera payload.' --voice F1 --steps 12

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment