Polish numeral inflection

#13
by szokun - opened

The TTS model incorrectly reads Polish hour expressions using cardinal numbers instead of ordinal numbers.
For example, the sentence:

“Wybiła godzina 13”

is synthesized as:

“wybiła godzina trzynaście”

instead of the grammatically correct:

“wybiła godzina trzynasta”.

In Polish, hour expressions require ordinal numeral inflection agreeing with the noun “godzina”. The model currently fails to apply the correct morphological transformation.

Supertone org

Hi,

Thank you for the clear example and explanation.

For Supertonic 3, we tried to rely less on language-specific text normalization rules and instead let the model learn these reading patterns from diverse training data. However, the amount and coverage of data differs by language, and cases like Polish hour expressions can still fail when the model has not seen enough of the relevant grammatical patterns.

Your example is very helpful:

Wybiła godzina 13 should be read as wybiła godzina trzynasta, not trzynaście.

We’ll document this as a Polish text normalization / morphology issue and use it as a reference case for future updates. In the next versions, we’ll try to improve this kind of language-specific reading behavior so Polish time expressions and similar constructions are handled more naturally.

Thank you again for reporting it.

Sign up or log in to comment