Instructions to use Supertone/supertonic-3 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Supertonic
How to use Supertone/supertonic-3 with Supertonic:
from supertonic import TTS tts = TTS(auto_download=True) style = tts.get_voice_style(voice_name="M1") text = "The train delay was announced at 4:45 PM on Wed, Apr 3, 2024 due to track maintenance." wav, duration = tts.synthesize(text, voice_style=style) tts.save_audio(wav, "output.wav")
- Notebooks
- Google Colab
- Kaggle
Russian language TTS model misplaces word stress on certain words, affecting pronunciation accuracy
The Russian-language TTS model occasionally places lexical stress on the wrong syllable in certain words, resulting in unnatural or incorrect pronunciation. This issue is most noticeable in:
Multi-syllabic words with non-default stress patterns
Homographs where stress changes meaning (e.g., зАмок / замОк)
Loanwords and proper nouns
While the model generally produces intelligible speech, incorrect stress reduces naturalness and may cause confusion in professional or educational contexts.
Words should be pronounced with correct lexical stress according to standard Russian orthoepy:
дОговор (not договОр)
звОнит (not звонИт)
кУхонный (not кухОнный)
тОрты (not тортЫ)
Russian is a stress-accent language where stress is phonemic (changes meaning) and unpredictable from spelling alone.
The model does not appear to leverage external stress dictionaries (e.g., OpenCorpora, Zaliznyak's grammar) or context-aware disambiguation.