Sercan commited on
Commit
1e24d31
1 Parent(s): 0c9c0e9

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +49 -0
README.md ADDED
@@ -0,0 +1,49 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # distil-whisper-large-v3-tr
2
+
3
+ ## Model Description
4
+
5
+ `distil-whisper-large-v3-tr` is a distilled version of the Whisper model, fine-tuned for Turkish language tasks. This model has been trained and evaluated using a comprehensive dataset to achieve high accuracy in Turkish speech recognition.
6
+
7
+ ## Training and Evaluation Metrics
8
+
9
+ The model was trained and evaluated using the `wandb` tool, with the following results:
10
+
11
+ ### Evaluation Metrics
12
+
13
+ - **Cross-Entropy Loss (eval/ce_loss):** 0.53218
14
+ - **Epoch (eval/epoch):** 28
15
+ - **KL Loss (eval/kl_loss):** 0.34883
16
+ - **Total Loss (eval/loss):** 0.77457
17
+ - **Evaluation Time (eval/time):** 397.1784 seconds
18
+ - **Word Error Rate (eval/wer):** 14.43288%
19
+ - **Orthographic Word Error Rate (eval/wer_ortho):** 21.55298%
20
+
21
+ ### Training Metrics
22
+
23
+ - **Cross-Entropy Loss (train/ce_loss):** 0.04695
24
+ - **Epoch (train/epoch):** 28
25
+ - **KL Loss (train/kl_loss):** 0.24143
26
+ - **Learning Rate (train/learning_rate):** 0.0001
27
+ - **Total Loss (train/loss):** 0.27899
28
+ - **Training Time (train/time):** 12426.92106 seconds
29
+
30
+ ## Run History
31
+
32
+ ### Overall Metrics
33
+
34
+ - **Real-Time Factor (all/rtf):** 392.23396
35
+ - **Word Error Rate (all/wer):** 14.33829
36
+
37
+ ### Common Voice 17.0 Turkish Pseudo-Labelled Dataset
38
+
39
+ - **Real-Time Factor (common_voice_17_0_tr_pseudo_labelled/test/rtf):** 392.23396
40
+ - **Word Error Rate (common_voice_17_0_tr_pseudo_labelled/test/wer):** 14.33829
41
+
42
+ ## Author
43
+
44
+ **Sercan Çepni**
45
+ Email: turkelf@gmail.com
46
+
47
+ ---
48
+
49
+ For any questions or further information, please feel free to contact the author.