Convert spoken text to another language with original voice
Transcribe audio into text from file or URL
Transform and identify speech with MMS