metadata

language:
  - fr
thumbnail: url to a thumbnail used in social sharing
tags:
  - text-generation
datasets:
  - Marxav/frpron
metrics:
  - loss/eval
  - perplexity
widget:
  - text: 'bonjour:'
  - text: 'salut, comment ça va:'
  - text: 'Louis XIII:'
  - text: 'anticonstitutionnellement:'
  - text: 'les animaux:'
inference:
  parameters:
    temperature: 0.01
    return_full_text: true

Fr-word to phonemic pronunciation

This model aims at predicting the syllabized phonemic pronunciation of the French words.

The generated pronunciation is:

A text string made of International Phonetic Alphabet (IPA) characters;
Phonemic (i.e. remains at the phoneme-level, not deeper);
Syllabized (i.e. characters '.' and '‿' are used to identify syllabes).

Such pronunciation is used in the French Wiktionary in the {{pron|...|fr}} tag.

To use this model, simply give an input containing the word that you want to translate followed by ":", for example: "bonjour:". It will generate its predicted pronunciation, for example "bɔ̃.ʒuʁ".

This model remains experimental. Additional finetuning is needed for:

The input length is currently limited to a maximum of 60 letters.

This work is derived from the OTEANN paper and code, which used minGTP.

More information on the model, dataset, hardware, environmental consideration:

The training data

The dataset used for training this models comes from data of the French Wiktionary.

The model

The model is build on gpt2