zinc75 commited on
Commit
4129c9e
1 Parent(s): 962644d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -40,6 +40,11 @@ Fine-tuned [facebook/wav2vec2-base-fr-voxpopuli-v2](https://huggingface.co/faceb
40
 
41
  When using this model, make sure that your speech input is **sampled at 16kHz**.
42
 
 
 
 
 
 
43
  ## Training procedure
44
 
45
  The model has been finetuned on Coommonvoice-v13 (FR) for 14 epochs on 4x2080 Ti GPUs using a ddp strategy and gradient-accumulation procedure (256 audios per update, corresponding roughly to 25 minutes of speech per update -> 2k updates per epoch)
 
40
 
41
  When using this model, make sure that your speech input is **sampled at 16kHz**.
42
 
43
+ ## Output
44
+
45
+ As this model is specifically trained for a speech-to-phoneme task, the output is sequence of [IPA-encoded](https://en.wikipedia.org/wiki/International_Phonetic_Alphabet) words, without punctuation.
46
+ If you don't read the phonetic alphabet fluently, you can use this excellent [IPA reader website](http://ipa-reader.xyz) to convert the transcript back to audio synthetic speech in order to check the quality of the phonetic transcription.
47
+
48
  ## Training procedure
49
 
50
  The model has been finetuned on Coommonvoice-v13 (FR) for 14 epochs on 4x2080 Ti GPUs using a ddp strategy and gradient-accumulation procedure (256 audios per update, corresponding roughly to 25 minutes of speech per update -> 2k updates per epoch)