pere commited on
Commit
7b1c102
1 Parent(s): 62929be
README.md CHANGED
@@ -1,3 +1,68 @@
1
  ---
 
 
2
  license: apache-2.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - no
4
  license: apache-2.0
5
+ tags:
6
+ - whisper-event
7
+ - norwegian
8
+ datasets:
9
+ - NbAiLab/NCC_S
10
+ - NbAiLab/NPSC
11
+ - NbAiLab/NST
12
+ metrics:
13
+ - wer
14
+ model-index:
15
+ - name: Whisper Large Norwegian Bokmål
16
+ results:
17
+ - task:
18
+ name: Automatic Speech Recognition
19
+ type: automatic-speech-recognition
20
+ dataset:
21
+ name: FLEURS
22
+ type: google/fleurs
23
+ config: nb_no
24
+ split: test
25
+ args: nb_no
26
+ metrics:
27
+ - name: Wer
28
+ type: wer
29
+ value: 11.91
30
  ---
31
+
32
+ # Whisper Tiny Norwegian Bokmål
33
+
34
+ This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) trained on several datasets.
35
+
36
+ It is currently in the middle of a large training. Currently achieves the following results on the evaluation set:
37
+ - Loss: 0.2751
38
+ - Wer: 11.91
39
+
40
+ ## Model description
41
+
42
+ The model is trained on a large corpus of roughly 5.000 hours of voice. The sources are subtitles from the Norwegian broadcaster NRK, transcribed speeches from the Norwegian parliament and voice recordings from Norsk Språkteknologi.
43
+
44
+ ## Intended uses & limitations
45
+
46
+ The model will be free for everyone to use when it is finished.
47
+
48
+ ### Training hyperparameters
49
+
50
+ The following hyperparameters were used during training:
51
+ - learning_rate: 3e-06
52
+ - train_batch_size: 64
53
+ - gradient_accumulation_steps: 2
54
+ - eval_batch_size: 32
55
+ - seed: 42
56
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
57
+ - lr_scheduler_type: constant with warmpu
58
+ - lr_scheduler_warmup_steps: 100
59
+ - training_steps: 10.000 (currently 1.000)
60
+ - mixed_precision_training: fp16
61
+ - deepspee: true
62
+
63
+ ### Training results
64
+ See [Tensorboad Metrics](https://huggingface.co/NbAiLab/whisper-large-v2-nob/tensorboard)
65
+
66
+
67
+
68
+
runs/Dec15_10-01-04_dante/events.out.tfevents.1671094918.dante.2735709.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a75e63ed2765f93eb467e359b58d65d1f7b71fc9455e049231e3aca37b75daa6
3
- size 10880
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3191c89b81a026aafe64733678af918ad6d7c9783a7e1da9e036f1bcfb5b70f6
3
+ size 15747