dragonSwing commited on
Commit
e0ac1cb
1 Parent(s): 1c0c6e6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -2
README.md CHANGED
@@ -46,8 +46,8 @@ model-index:
46
  value: 4.04
47
  ---
48
  # Wav2Vec2-Base-Vietnamese-270h
49
- Fine-tuned Wav2Vec2 model on Vietnamese Speech Recognition task using about 270h labelled data combined from multiple datasets including [Common Voice](https://huggingface.co/datasets/common_voice), [VIVOS](https://huggingface.co/datasets/vivos), [VLSP2020](https://vlsp.org.vn/vlsp2020/eval/asr). The model was fine-tuned using SpeechBrain toolkit with a custom tokenizer. For a better experience, we encourage you to learn more about [SpeechBrain](https://speechbrain.github.io/).
50
- When using this model, make sure that your speech input is sampled at 16kHz.
51
  Please refer to [huggingface blog](https://huggingface.co/blog/fine-tune-wav2vec2-english) on how to fine-tune this model on a specific language.
52
 
53
  ### Benchmark WER result:
@@ -58,6 +58,13 @@ Please refer to [huggingface blog](https://huggingface.co/blog/fine-tune-wav2vec
58
 
59
  The language model was trained using [Oscar](https://huggingface.co/datasets/oscar-corpus/OSCAR-2109) dataset on about 32GB of written text.
60
 
 
 
 
 
 
 
 
61
  ### Usage
62
  The model can be used directly (without a language model) as follows:
63
  ```python
 
46
  value: 4.04
47
  ---
48
  # Wav2Vec2-Base-Vietnamese-270h
49
+ Fine-tuned Wav2Vec2 model on Vietnamese Speech Recognition task using about 270h labelled data combined from multiple datasets including [Common Voice](https://huggingface.co/datasets/common_voice), [VIVOS](https://huggingface.co/datasets/vivos), [VLSP2020](https://vlsp.org.vn/vlsp2020/eval/asr). The model was fine-tuned using SpeechBrain toolkit with a custom tokenizer. For a better experience, we encourage you to learn more about [SpeechBrain](https://speechbrain.github.io/).
50
+ When using this model, make sure that your speech input is sampled at 16kHz.
51
  Please refer to [huggingface blog](https://huggingface.co/blog/fine-tune-wav2vec2-english) on how to fine-tune this model on a specific language.
52
 
53
  ### Benchmark WER result:
 
58
 
59
  The language model was trained using [Oscar](https://huggingface.co/datasets/oscar-corpus/OSCAR-2109) dataset on about 32GB of written text.
60
 
61
+ ### Install SpeechBrain
62
+ To use this model, you should install speechbrain from source. This is not required for speechbrain version > 0.5.10
63
+
64
+ ```bash
65
+ pip install git+https://github.com/speechbrain/speechbrain.git@develop
66
+ ```
67
+
68
  ### Usage
69
  The model can be used directly (without a language model) as follows:
70
  ```python