Ilyes
/

wav2vec2-large-xlsr-53-french

Automatic Speech Recognition

xlsr-fine-tuning-week

Inference Endpoints

Model card Files Files and versions Community

Ilyes commited on Apr 3, 2021

Commit

252499b

•

1 Parent(s): d47dcf9

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -45,7 +45,7 @@ ds = load_dataset("common_voice", "fr", split="test", cache_dir="./data/fr")
-chars_to_ignore_regex = '[\,\?\.\!\-\;\:\"\“\%\‘\”\�\‘\’\’\’\‘\…\·\!\ǃ\?\«\‹\»\›“\”\\ʿ\ʾ\„\∞\\|\.\,\;\:\*\—\–\─\―\_\/\:\ː\;\,\=\«\»\→]'
 def map_to_array(batch):
     speech, _ = torchaudio.load(batch["path"])
     batch["speech"] = resampler.forward(speech.squeeze(0)).numpy()
@@ -79,7 +79,9 @@ print(wer.compute(predictions=result["predicted"], references=result["target"]))
 ## Testing
 All the Common Voice `Test` dataset (15763 files) were used for testing.
 Results:
 WER=20.89%
 SER=77.56%

+chars_to_ignore_regex = '[\\,\\?\\.\\!\\-\\;\\:\\"\\“\\%\\‘\\”\\�\\‘\\’\\’\\’\\‘\\…\\·\\!\\ǃ\\?\\«\\‹\\»\\›“\\”\\\\ʿ\\ʾ\\„\\∞\\\\|\\.\\,\\;\\:\\*\\—\\–\\─\\―\\_\\/\\:\\ː\\;\\,\\=\\«\\»\\→]'
 def map_to_array(batch):
     speech, _ = torchaudio.load(batch["path"])
     batch["speech"] = resampler.forward(speech.squeeze(0)).numpy()
 ## Testing
 All the Common Voice `Test` dataset (15763 files) were used for testing.
 Results:
 WER=20.89%
 SER=77.56%