suzii commited on
Commit
e05c588
·
verified ·
1 Parent(s): 1e9075e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +32 -11
README.md CHANGED
@@ -1,3 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  # Fine-tuned Whisper-V3-Turbo for Vietnamese ASR
2
 
3
  This project involves fine-tuning the Whisper-V3-Turbo model to improve its performance for Automatic Speech Recognition (ASR) in the Vietnamese language. The model was trained for 240 hours using a single Nvidia A6000 GPU.
@@ -66,17 +87,17 @@ To use the fine-tuned model, follow the steps below:
66
 
67
  This project would not be possible without the following datasets:
68
 
69
- - [capleaf/viVoice](https://www.kaggle.com/datasets/capleaf/viVoice)
70
- - [NhutP/VSV-1100](https://www.kaggle.com/datasets/nhutp/vsv-1100)
71
- - [doof-ferb/fpt_fosd](https://www.kaggle.com/datasets/doof-ferb/fpt_fosd)
72
- - [doof-ferb/infore1_25hours](https://www.kaggle.com/datasets/doof-ferb/infore1-25hours)
73
- - [google/fleurs](https://www.kaggle.com/datasets/google/fleurs)
74
- - [doof-ferb/LSVSC](https://www.kaggle.com/datasets/doof-ferb/LSVSC)
75
- - [quocanh34/viet_vlsp](https://www.kaggle.com/datasets/quocanh34/viet-vlsp)
76
- - [linhtran92/viet_youtube_asr_corpus_v2](https://www.kaggle.com/datasets/linhtran92/viet-youtube-asr-corpus-v2)
77
- - [doof-ferb/infore2_audiobooks](https://www.kaggle.com/datasets/doof-ferb/infore2-audiobooks)
78
- - [linhtran92/viet_bud500](https://www.kaggle.com/datasets/linhtran92/viet-bud500)
79
 
80
  ## License
81
 
82
- This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
 
1
+ ---
2
+ license: mit
3
+ datasets:
4
+ - capleaf/viVoice
5
+ - NhutP/VSV-1100
6
+ - doof-ferb/fpt_fosd
7
+ - doof-ferb/infore1_25hours
8
+ - google/fleurs
9
+ - doof-ferb/LSVSC
10
+ - quocanh34/viet_vlsp
11
+ - linhtran92/viet_youtube_asr_corpus_v2
12
+ - doof-ferb/infore2_audiobooks
13
+ - linhtran92/viet_bud500
14
+ language:
15
+ - vi
16
+ metrics:
17
+ - wer
18
+ base_model:
19
+ - openai/whisper-large-v3-turbo
20
+ library_name: transformers
21
+ ---
22
  # Fine-tuned Whisper-V3-Turbo for Vietnamese ASR
23
 
24
  This project involves fine-tuning the Whisper-V3-Turbo model to improve its performance for Automatic Speech Recognition (ASR) in the Vietnamese language. The model was trained for 240 hours using a single Nvidia A6000 GPU.
 
87
 
88
  This project would not be possible without the following datasets:
89
 
90
+ - [capleaf/viVoice](https://huggingface.co/datasets/capleaf/viVoice)
91
+ - [NhutP/VSV-1100](https://huggingface.co/datasets/nhutp/vsv-1100)
92
+ - [doof-ferb/fpt_fosd](https://huggingface.co/datasets/doof-ferb/fpt_fosd)
93
+ - [doof-ferb/infore1_25hours](https://huggingface.co/datasets/doof-ferb/infore1_25hours)
94
+ - [google/fleurs](https://huggingface.co/datasets/google/fleurs)
95
+ - [doof-ferb/LSVSC](https://huggingface.co/datasets/doof-ferb/LSVSC)
96
+ - [quocanh34/viet_vlsp](https://huggingface.co/datasets/quocanh34/viet-vlsp)
97
+ - [linhtran92/viet_youtube_asr_corpus_v2](https://huggingface.co/datasets/linhtran92/viet_youtube_asr_corpus_v2)
98
+ - [doof-ferb/infore2_audiobooks](https://huggingface.co/datasets/doof-ferb/infore2_audiobooks/)
99
+ - [linhtran92/viet_bud500](https://huggingface.co/datasets/linhtran92/viet_bud500)
100
 
101
  ## License
102
 
103
+ This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.