dragonSwing
commited on
Commit
·
e0ac1cb
1
Parent(s):
1c0c6e6
Update README.md
Browse files
README.md
CHANGED
@@ -46,8 +46,8 @@ model-index:
|
|
46 |
value: 4.04
|
47 |
---
|
48 |
# Wav2Vec2-Base-Vietnamese-270h
|
49 |
-
Fine-tuned Wav2Vec2 model on Vietnamese Speech Recognition task using about 270h labelled data combined from multiple datasets including [Common Voice](https://huggingface.co/datasets/common_voice), [VIVOS](https://huggingface.co/datasets/vivos), [VLSP2020](https://vlsp.org.vn/vlsp2020/eval/asr). The model was fine-tuned using SpeechBrain toolkit with a custom tokenizer. For a better experience, we encourage you to learn more about [SpeechBrain](https://speechbrain.github.io/).
|
50 |
-
When using this model, make sure that your speech input is sampled at 16kHz.
|
51 |
Please refer to [huggingface blog](https://huggingface.co/blog/fine-tune-wav2vec2-english) on how to fine-tune this model on a specific language.
|
52 |
|
53 |
### Benchmark WER result:
|
@@ -58,6 +58,13 @@ Please refer to [huggingface blog](https://huggingface.co/blog/fine-tune-wav2vec
|
|
58 |
|
59 |
The language model was trained using [Oscar](https://huggingface.co/datasets/oscar-corpus/OSCAR-2109) dataset on about 32GB of written text.
|
60 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
61 |
### Usage
|
62 |
The model can be used directly (without a language model) as follows:
|
63 |
```python
|
|
|
46 |
value: 4.04
|
47 |
---
|
48 |
# Wav2Vec2-Base-Vietnamese-270h
|
49 |
+
Fine-tuned Wav2Vec2 model on Vietnamese Speech Recognition task using about 270h labelled data combined from multiple datasets including [Common Voice](https://huggingface.co/datasets/common_voice), [VIVOS](https://huggingface.co/datasets/vivos), [VLSP2020](https://vlsp.org.vn/vlsp2020/eval/asr). The model was fine-tuned using SpeechBrain toolkit with a custom tokenizer. For a better experience, we encourage you to learn more about [SpeechBrain](https://speechbrain.github.io/).
|
50 |
+
When using this model, make sure that your speech input is sampled at 16kHz.
|
51 |
Please refer to [huggingface blog](https://huggingface.co/blog/fine-tune-wav2vec2-english) on how to fine-tune this model on a specific language.
|
52 |
|
53 |
### Benchmark WER result:
|
|
|
58 |
|
59 |
The language model was trained using [Oscar](https://huggingface.co/datasets/oscar-corpus/OSCAR-2109) dataset on about 32GB of written text.
|
60 |
|
61 |
+
### Install SpeechBrain
|
62 |
+
To use this model, you should install speechbrain from source. This is not required for speechbrain version > 0.5.10
|
63 |
+
|
64 |
+
```bash
|
65 |
+
pip install git+https://github.com/speechbrain/speechbrain.git@develop
|
66 |
+
```
|
67 |
+
|
68 |
### Usage
|
69 |
The model can be used directly (without a language model) as follows:
|
70 |
```python
|