Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,24 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
# Fine-tuned Whisper-V3-Turbo for Vietnamese ASR
|
2 |
|
3 |
This project involves fine-tuning the Whisper-V3-Turbo model to improve its performance for Automatic Speech Recognition (ASR) in the Vietnamese language. The model was trained for 240 hours using a single Nvidia A6000 GPU.
|
@@ -66,17 +87,17 @@ To use the fine-tuned model, follow the steps below:
|
|
66 |
|
67 |
This project would not be possible without the following datasets:
|
68 |
|
69 |
-
- [capleaf/viVoice](https://
|
70 |
-
- [NhutP/VSV-1100](https://
|
71 |
-
- [doof-ferb/fpt_fosd](https://
|
72 |
-
- [doof-ferb/infore1_25hours](https://
|
73 |
-
- [google/fleurs](https://
|
74 |
-
- [doof-ferb/LSVSC](https://
|
75 |
-
- [quocanh34/viet_vlsp](https://
|
76 |
-
- [linhtran92/viet_youtube_asr_corpus_v2](https://
|
77 |
-
- [doof-ferb/infore2_audiobooks](https://
|
78 |
-
- [linhtran92/viet_bud500](https://
|
79 |
|
80 |
## License
|
81 |
|
82 |
-
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
|
|
|
1 |
+
---
|
2 |
+
license: mit
|
3 |
+
datasets:
|
4 |
+
- capleaf/viVoice
|
5 |
+
- NhutP/VSV-1100
|
6 |
+
- doof-ferb/fpt_fosd
|
7 |
+
- doof-ferb/infore1_25hours
|
8 |
+
- google/fleurs
|
9 |
+
- doof-ferb/LSVSC
|
10 |
+
- quocanh34/viet_vlsp
|
11 |
+
- linhtran92/viet_youtube_asr_corpus_v2
|
12 |
+
- doof-ferb/infore2_audiobooks
|
13 |
+
- linhtran92/viet_bud500
|
14 |
+
language:
|
15 |
+
- vi
|
16 |
+
metrics:
|
17 |
+
- wer
|
18 |
+
base_model:
|
19 |
+
- openai/whisper-large-v3-turbo
|
20 |
+
library_name: transformers
|
21 |
+
---
|
22 |
# Fine-tuned Whisper-V3-Turbo for Vietnamese ASR
|
23 |
|
24 |
This project involves fine-tuning the Whisper-V3-Turbo model to improve its performance for Automatic Speech Recognition (ASR) in the Vietnamese language. The model was trained for 240 hours using a single Nvidia A6000 GPU.
|
|
|
87 |
|
88 |
This project would not be possible without the following datasets:
|
89 |
|
90 |
+
- [capleaf/viVoice](https://huggingface.co/datasets/capleaf/viVoice)
|
91 |
+
- [NhutP/VSV-1100](https://huggingface.co/datasets/nhutp/vsv-1100)
|
92 |
+
- [doof-ferb/fpt_fosd](https://huggingface.co/datasets/doof-ferb/fpt_fosd)
|
93 |
+
- [doof-ferb/infore1_25hours](https://huggingface.co/datasets/doof-ferb/infore1_25hours)
|
94 |
+
- [google/fleurs](https://huggingface.co/datasets/google/fleurs)
|
95 |
+
- [doof-ferb/LSVSC](https://huggingface.co/datasets/doof-ferb/LSVSC)
|
96 |
+
- [quocanh34/viet_vlsp](https://huggingface.co/datasets/quocanh34/viet-vlsp)
|
97 |
+
- [linhtran92/viet_youtube_asr_corpus_v2](https://huggingface.co/datasets/linhtran92/viet_youtube_asr_corpus_v2)
|
98 |
+
- [doof-ferb/infore2_audiobooks](https://huggingface.co/datasets/doof-ferb/infore2_audiobooks/)
|
99 |
+
- [linhtran92/viet_bud500](https://huggingface.co/datasets/linhtran92/viet_bud500)
|
100 |
|
101 |
## License
|
102 |
|
103 |
+
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
|