tanmaylaud
/

wav2vec2-large-xlsr-hindi-marathi

Automatic Speech Recognition

xlsr-fine-tuning-week

Inference Endpoints

Model card Files Files and versions Community

tanmaylaud commited on Mar 30, 2021

Commit

6750fe1

•

1 Parent(s): 0c60532

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -32,7 +32,9 @@ model-index:
 Fine-tuned facebook/wav2vec2-large-xlsr-53 on Hindi and Marathi using the OpenSLR SLR64 datasets. When using this model, make sure that your speech input is sampled at 16kHz.
 ## Installation
 pip install git+https://github.com/huggingface/transformers.git datasets librosa torch==1.7.0 torchaudio==0.7.0 jiwer
 ## Eval dataset:
 ```bash
@@ -99,7 +101,7 @@ import re
 test = Dataset.from_csv('test.csv')
-chars_to_ignore_regex = '[\\,\\?\\.\\!\\-\\;\\:\\"\\“\\%\\‘\\”\\�\\।]'
 # Preprocessing the datasets.
 # We need to read the audio files as arrays
@@ -139,7 +141,7 @@ import numpy as np
 import re
 from datasets import load_dataset
-chars_to_ignore_regex = '[\\,\\?\\.\\!\\-\\;\\:\\"\\“\\%\\‘\\”\\�\\।]'
 # Preprocessing the datasets.
 # We need to read the audio files as arrays

 Fine-tuned facebook/wav2vec2-large-xlsr-53 on Hindi and Marathi using the OpenSLR SLR64 datasets. When using this model, make sure that your speech input is sampled at 16kHz.
 ## Installation
+```bash
 pip install git+https://github.com/huggingface/transformers.git datasets librosa torch==1.7.0 torchaudio==0.7.0 jiwer
+```
 ## Eval dataset:
 ```bash
 test = Dataset.from_csv('test.csv')
+chars_to_ignore_regex = '[\\\\,\\\\?\\\\.\\\\!\\\\-\\\\;\\\\:\\\\"\\\\“\\\\%\\\\‘\\\\”\\\\�\\\\।]'
 # Preprocessing the datasets.
 # We need to read the audio files as arrays
 import re
 from datasets import load_dataset
+chars_to_ignore_regex = '[\\\\,\\\\?\\\\.\\\\!\\\\-\\\\;\\\\:\\\\"\\\\“\\\\%\\\\‘\\\\”\\\\�\\\\।]'
 # Preprocessing the datasets.
 # We need to read the audio files as arrays