emilios commited on
Commit
5c23c43
β€’
1 Parent(s): 1303559

[INFO|trainer.py:2956] 2022-12-12 12:04:03,940 >> ***** Running Evaluation *****

Browse files

[INFO|trainer.py:2958] 2022-12-12 12:04:03,940 >> Num examples = 1696
[INFO|trainer.py:2961] 2022-12-12 12:04:03,941 >> Batch size = 16
100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 106/106 [18:20<00:00, 10.38s/it]
***** eval metrics *****
epoch = 12.44
eval_loss = 0.3801
eval_runtime = 0:18:30.86
eval_samples_per_second = 1.527
eval_steps_per_second = 0.095
eval_wer = 11.1999

***** eval metrics *****
epoch = 12.44
eval_loss = 0.3801
eval_runtime = 0:18:30.86
eval_samples_per_second = 1.527
eval_steps_per_second = 0.095
eval_wer = 11.1999

Mon Dec 12 12:32:15 UTC 2022

.ipynb_checkpoints/README-checkpoint.md DELETED
@@ -1,85 +0,0 @@
1
- ---
2
- language:
3
- - el
4
- license: apache-2.0
5
- tags:
6
- - whisper-event
7
- - generated_from_trainer
8
- - hf-asr-leaderboard
9
- datasets:
10
- - mozilla-foundation/common_voice_11_0
11
- - google/fleurs
12
- metrics:
13
- - wer
14
- model-index:
15
- - name: Whisper Medium El Greco
16
- results:
17
- - task:
18
- name: Automatic Speech Recognition
19
- type: automatic-speech-recognition
20
- dataset:
21
- name: Common Voice 11.0
22
- type: mozilla-foundation/common_voice_11_0
23
- config: el
24
- split: test
25
- args: el
26
- metrics:
27
- - name: Wer
28
- type: wer
29
- value: 13.976597325408619
30
- ---
31
-
32
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
33
- should probably proofread and complete it, then remove this comment. -->
34
-
35
- # Whisper Medium El - Greek One
36
-
37
- This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) on the Common Voice 11.0 dataset.
38
- It achieves the following results on the evaluation set:
39
- - Loss: 0.4707
40
- - Wer: 13.9766
41
-
42
- ## Model description
43
-
44
- More information needed
45
-
46
- ## Intended uses & limitations
47
-
48
- More information needed
49
-
50
- ## Training and evaluation data
51
-
52
- More information needed
53
-
54
- ## Training procedure
55
-
56
- ### Training hyperparameters
57
-
58
- The following hyperparameters were used during training:
59
- - learning_rate: 1e-05
60
- - train_batch_size: 20
61
- - eval_batch_size: 8
62
- - seed: 42
63
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
64
- - lr_scheduler_type: linear
65
- - lr_scheduler_warmup_steps: 500
66
- - training_steps: 5000
67
- - mixed_precision_training: Native AMP
68
-
69
- ### Training results
70
-
71
- | Training Loss | Epoch | Step | Validation Loss | Wer |
72
- |:-------------:|:-----:|:----:|:---------------:|:-------:|
73
- | 0.0036 | 10.01 | 1000 | 0.4461 | 15.9082 |
74
- | 0.0001 | 20.02 | 2000 | 0.4250 | 14.5245 |
75
- | 0.0 | 31.0 | 3000 | 0.4526 | 14.1902 |
76
- | 0.0 | 41.01 | 4000 | 0.4657 | 14.1252 |
77
- | 0.0 | 52.0 | 5000 | 0.4707 | 13.9766 |
78
-
79
-
80
- ### Framework versions
81
-
82
- - Transformers 4.26.0.dev0
83
- - Pytorch 1.13.0+cu117
84
- - Datasets 2.7.1.dev0
85
- - Tokenizers 0.13.2
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
README.md CHANGED
@@ -3,30 +3,38 @@ language:
3
  - el
4
  license: apache-2.0
5
  tags:
6
- - hf-asr-leaderboard, whisper-medium, mozilla-foundation/common_voice_11_0, greek,
7
- whisper-event
8
  - generated_from_trainer
9
  datasets:
10
- - mozilla-foundation/common_voice_11_0
 
 
11
  model-index:
12
- - name: Whisper Medium El Greco Greek
13
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
14
  ---
15
 
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
17
  should probably proofread and complete it, then remove this comment. -->
18
 
19
- # Whisper Medium El Greco Greek
20
 
21
- This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) on the Common Voice 11.0 dataset.
22
  It achieves the following results on the evaluation set:
23
- - eval_loss: 0.3924
24
- - eval_wer: 12.4443
25
- - eval_runtime: 1211.1631
26
- - eval_samples_per_second: 1.4
27
- - eval_steps_per_second: 0.088
28
- - epoch: 4.04
29
- - step: 5000
30
 
31
  ## Model description
32
 
@@ -55,6 +63,17 @@ The following hyperparameters were used during training:
55
  - training_steps: 5000
56
  - mixed_precision_training: Native AMP
57
 
 
 
 
 
 
 
 
 
 
 
 
58
  ### Framework versions
59
 
60
  - Transformers 4.26.0.dev0
 
3
  - el
4
  license: apache-2.0
5
  tags:
6
+ - whisper-event
 
7
  - generated_from_trainer
8
  datasets:
9
+ - mozilla-foundation/common_voice_11_0,google/fleurs
10
+ metrics:
11
+ - wer
12
  model-index:
13
+ - name: Whisper Medium El Greco
14
+ results:
15
+ - task:
16
+ name: Automatic Speech Recognition
17
+ type: automatic-speech-recognition
18
+ dataset:
19
+ name: mozilla-foundation/common_voice_11_0,google/fleurs el,el_gr
20
+ type: mozilla-foundation/common_voice_11_0,google/fleurs
21
+ config: null
22
+ split: None
23
+ metrics:
24
+ - name: Wer
25
+ type: wer
26
+ value: 11.199851411589897
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
30
  should probably proofread and complete it, then remove this comment. -->
31
 
32
+ # Whisper Medium El Greco
33
 
34
+ This model is a fine-tuned version of [emilios/whisper-medium-el](https://huggingface.co/emilios/whisper-medium-el) on the mozilla-foundation/common_voice_11_0,google/fleurs el,el_gr dataset.
35
  It achieves the following results on the evaluation set:
36
+ - Loss: 0.3801
37
+ - Wer: 11.1999
 
 
 
 
 
38
 
39
  ## Model description
40
 
 
63
  - training_steps: 5000
64
  - mixed_precision_training: Native AMP
65
 
66
+ ### Training results
67
+
68
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
69
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|
70
+ | 0.0176 | 2.49 | 1000 | 0.2945 | 12.6114 |
71
+ | 0.0064 | 4.98 | 2000 | 0.3423 | 12.2307 |
72
+ | 0.0022 | 7.46 | 3000 | 0.3632 | 11.5899 |
73
+ | 0.0014 | 9.95 | 4000 | 0.3788 | 11.2556 |
74
+ | 0.0008 | 12.44 | 5000 | 0.3801 | 11.1999 |
75
+
76
+
77
  ### Framework versions
78
 
79
  - Transformers 4.26.0.dev0
all_results.json ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 12.44,
3
+ "eval_loss": 0.38012266159057617,
4
+ "eval_runtime": 1110.8612,
5
+ "eval_samples_per_second": 1.527,
6
+ "eval_steps_per_second": 0.095,
7
+ "eval_wer": 11.199851411589897,
8
+ "train_loss": 0.020854657278954983,
9
+ "train_runtime": 27142.1177,
10
+ "train_samples_per_second": 5.895,
11
+ "train_steps_per_second": 0.184
12
+ }
config.json CHANGED
@@ -34,7 +34,6 @@
34
  "num_mel_bins": 80,
35
  "pad_token_id": 50257,
36
  "scale_embedding": false,
37
- "suppress_tokens": [],
38
  "torch_dtype": "float32",
39
  "transformers_version": "4.26.0.dev0",
40
  "use_cache": false,
 
34
  "num_mel_bins": 80,
35
  "pad_token_id": 50257,
36
  "scale_embedding": false,
 
37
  "torch_dtype": "float32",
38
  "transformers_version": "4.26.0.dev0",
39
  "use_cache": false,
e3_interleaving-cl.ipynb CHANGED
@@ -120,106 +120,68 @@
120
  "output_type": "stream",
121
  "text": [
122
  "Defaulting to user installation because normal site-packages is not writeable\n",
123
- "Requirement already satisfied: pip in ./.local/lib/python3.8/site-packages (22.3.1)\n",
124
- "Defaulting to user installation because normal site-packages is not writeable\n",
125
- "Requirement already satisfied: numpy<1.23.0 in ./.local/lib/python3.8/site-packages (1.22.4)\n",
126
- "Defaulting to user installation because normal site-packages is not writeable\n",
127
- "Collecting torch\n",
128
- " Using cached torch-1.13.0-cp38-cp38-manylinux1_x86_64.whl (890.2 MB)\n",
129
- "Collecting torchaudio\n",
130
- " Using cached torchaudio-0.13.0-cp38-cp38-manylinux1_x86_64.whl (4.2 MB)\n",
131
- "Collecting torchvision\n",
132
- " Downloading torchvision-0.14.0-cp38-cp38-manylinux1_x86_64.whl (24.3 MB)\n",
133
- "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m24.3/24.3 MB\u001b[0m \u001b[31m103.1 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m00:01\u001b[0m00:01\u001b[0m\n",
134
- "\u001b[?25hCollecting nvidia-cuda-runtime-cu11==11.7.99\n",
135
- " Using cached nvidia_cuda_runtime_cu11-11.7.99-py3-none-manylinux1_x86_64.whl (849 kB)\n",
136
- "Collecting typing-extensions\n",
137
- " Using cached typing_extensions-4.4.0-py3-none-any.whl (26 kB)\n",
138
- "Collecting nvidia-cuda-nvrtc-cu11==11.7.99\n",
139
- " Using cached nvidia_cuda_nvrtc_cu11-11.7.99-2-py3-none-manylinux1_x86_64.whl (21.0 MB)\n",
140
- "Collecting nvidia-cudnn-cu11==8.5.0.96\n",
141
- " Using cached nvidia_cudnn_cu11-8.5.0.96-2-py3-none-manylinux1_x86_64.whl (557.1 MB)\n",
142
- "Collecting nvidia-cublas-cu11==11.10.3.66\n",
143
- " Using cached nvidia_cublas_cu11-11.10.3.66-py3-none-manylinux1_x86_64.whl (317.1 MB)\n",
144
- "Collecting setuptools\n",
145
- " Using cached setuptools-65.6.3-py3-none-any.whl (1.2 MB)\n",
146
- "Collecting wheel\n",
147
- " Using cached wheel-0.38.4-py3-none-any.whl (36 kB)\n",
148
- "Collecting numpy\n",
149
- " Downloading numpy-1.24.0rc2-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (17.3 MB)\n",
150
- "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m17.3/17.3 MB\u001b[0m \u001b[31m100.5 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m00:01\u001b[0m00:01\u001b[0m\n",
151
- "\u001b[?25hCollecting pillow!=8.3.*,>=5.3.0\n",
152
- " Downloading Pillow-9.3.0-cp38-cp38-manylinux_2_28_x86_64.whl (3.3 MB)\n",
153
- "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m3.3/3.3 MB\u001b[0m \u001b[31m145.5 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
154
- "\u001b[?25hCollecting requests\n",
155
- " Using cached requests-2.28.1-py3-none-any.whl (62 kB)\n",
156
- "Collecting certifi>=2017.4.17\n",
157
- " Downloading certifi-2022.12.7-py3-none-any.whl (155 kB)\n",
158
- "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m155.3/155.3 kB\u001b[0m \u001b[31m44.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
159
- "\u001b[?25hCollecting charset-normalizer<3,>=2\n",
160
- " Using cached charset_normalizer-2.1.1-py3-none-any.whl (39 kB)\n",
161
- "Collecting urllib3<1.27,>=1.21.1\n",
162
- " Downloading urllib3-1.26.13-py2.py3-none-any.whl (140 kB)\n",
163
- "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m140.6/140.6 kB\u001b[0m \u001b[31m38.5 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
164
- "\u001b[?25hCollecting idna<4,>=2.5\n",
165
- " Downloading idna-3.4-py3-none-any.whl (61 kB)\n",
166
- "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m61.5/61.5 kB\u001b[0m \u001b[31m16.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
167
- "\u001b[?25hInstalling collected packages: wheel, urllib3, typing-extensions, setuptools, pillow, nvidia-cuda-nvrtc-cu11, numpy, idna, charset-normalizer, certifi, requests, nvidia-cuda-runtime-cu11, nvidia-cublas-cu11, nvidia-cudnn-cu11, torch, torchvision, torchaudio\n",
168
- " Attempting uninstall: typing-extensions\n",
169
- " Found existing installation: typing_extensions 4.4.0\n",
170
- " Uninstalling typing_extensions-4.4.0:\n",
171
- " Successfully uninstalled typing_extensions-4.4.0\n",
172
- " Attempting uninstall: nvidia-cuda-nvrtc-cu11\n",
173
- " Found existing installation: nvidia-cuda-nvrtc-cu11 11.7.99\n",
174
- " Uninstalling nvidia-cuda-nvrtc-cu11-11.7.99:\n",
175
- " Successfully uninstalled nvidia-cuda-nvrtc-cu11-11.7.99\n",
176
- " Attempting uninstall: numpy\n",
177
- " Found existing installation: numpy 1.22.4\n",
178
- " Uninstalling numpy-1.22.4:\n",
179
- " Successfully uninstalled numpy-1.22.4\n",
180
- " Attempting uninstall: charset-normalizer\n",
181
- " Found existing installation: charset-normalizer 2.1.1\n",
182
- " Uninstalling charset-normalizer-2.1.1:\n",
183
- " Successfully uninstalled charset-normalizer-2.1.1\n",
184
- " Attempting uninstall: requests\n",
185
- " Found existing installation: requests 2.28.1\n",
186
- " Uninstalling requests-2.28.1:\n",
187
- " Successfully uninstalled requests-2.28.1\n",
188
- " Attempting uninstall: nvidia-cuda-runtime-cu11\n",
189
- " Found existing installation: nvidia-cuda-runtime-cu11 11.7.99\n",
190
- " Uninstalling nvidia-cuda-runtime-cu11-11.7.99:\n",
191
- " Successfully uninstalled nvidia-cuda-runtime-cu11-11.7.99\n",
192
- " Attempting uninstall: nvidia-cublas-cu11\n",
193
- " Found existing installation: nvidia-cublas-cu11 11.10.3.66\n",
194
- " Uninstalling nvidia-cublas-cu11-11.10.3.66:\n",
195
- " Successfully uninstalled nvidia-cublas-cu11-11.10.3.66\n",
196
- " Attempting uninstall: nvidia-cudnn-cu11\n",
197
- " Found existing installation: nvidia-cudnn-cu11 8.5.0.96\n",
198
- " Uninstalling nvidia-cudnn-cu11-8.5.0.96:\n",
199
- " Successfully uninstalled nvidia-cudnn-cu11-8.5.0.96\n",
200
- " Attempting uninstall: torch\n",
201
- " Found existing installation: torch 1.13.0\n",
202
- " Uninstalling torch-1.13.0:\n",
203
- " Successfully uninstalled torch-1.13.0\n",
204
- " Attempting uninstall: torchaudio\n",
205
- " Found existing installation: torchaudio 0.13.0\n",
206
- " Uninstalling torchaudio-0.13.0:\n",
207
- " Successfully uninstalled torchaudio-0.13.0\n",
208
- "\u001b[31mERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.\n",
209
- "launchpadlib 1.10.13 requires testresources, which is not installed.\n",
210
- "pandas-profiling 3.4.0 requires numpy<1.24,>=1.16.0, but you have numpy 1.24.0rc2 which is incompatible.\n",
211
- "numba 0.56.4 requires numpy<1.24,>=1.18, but you have numpy 1.24.0rc2 which is incompatible.\u001b[0m\u001b[31m\n",
212
- "\u001b[0mSuccessfully installed certifi-2022.12.7 charset-normalizer-2.1.1 idna-3.4 numpy-1.24.0rc2 nvidia-cublas-cu11-11.10.3.66 nvidia-cuda-nvrtc-cu11-11.7.99 nvidia-cuda-runtime-cu11-11.7.99 nvidia-cudnn-cu11-8.5.0.96 pillow-9.3.0 requests-2.28.1 setuptools-65.6.3 torch-1.13.0 torchaudio-0.13.0 torchvision-0.14.0 typing-extensions-4.4.0 urllib3-1.26.13 wheel-0.38.4\n"
213
  ]
214
  }
215
  ],
216
  "source": [
217
  "!pip3 install --upgrade pip\n",
218
- "!pip3 install \"numpy<1.23.0\"\n",
 
 
219
  "\n",
220
- "!pip3 install --pre torch torchaudio torchvision --force-reinstall\n",
221
  "\n",
222
- "!pip3 install bitsandbytes\n",
223
  "\n",
224
  "\n",
225
  "#!pip3 install --pre torch torchaudio --extra-index-url https://download.pytorch.org/whl/nightly/cu116\n",
@@ -269,7 +231,7 @@
269
  "name": "stdout",
270
  "output_type": "stream",
271
  "text": [
272
- "Thu Dec 8 18:45:37 2022 \n",
273
  "+-----------------------------------------------------------------------------+\n",
274
  "| NVIDIA-SMI 515.65.01 Driver Version: 515.65.01 CUDA Version: 11.7 |\n",
275
  "|-------------------------------+----------------------+----------------------+\n",
@@ -278,7 +240,7 @@
278
  "| | | MIG M. |\n",
279
  "|===============================+======================+======================|\n",
280
  "| 0 NVIDIA A100-SXM... On | 00000000:06:00.0 Off | 0 |\n",
281
- "| N/A 31C P0 47W / 400W | 0MiB / 40960MiB | 0% Default |\n",
282
  "| | | Disabled |\n",
283
  "+-------------------------------+----------------------+----------------------+\n",
284
  " \n",
@@ -330,40 +292,40 @@
330
  "output_type": "stream",
331
  "text": [
332
  "Get:1 https://nvidia.github.io/libnvidia-container/stable/ubuntu18.04/amd64 InRelease [1484 B]\n",
333
- "Hit:2 https://nvidia.github.io/nvidia-container-runtime/stable/ubuntu18.04/amd64 InRelease\n",
334
- "Hit:3 https://download.docker.com/linux/ubuntu focal InRelease \n",
335
- "Hit:4 http://archive.lambdalabs.com/ubuntu focal InRelease \n",
336
- "Hit:5 https://packages.cloud.google.com/apt cloud-sdk InRelease \n",
337
- "Hit:6 http://security.ubuntu.com/ubuntu focal-security InRelease \n",
338
- "Ign:7 http://ppa.launchpad.net/jonathonf/ffmpeg-4/ubuntu focal InRelease \n",
339
- "Hit:8 http://archive.ubuntu.com/ubuntu focal InRelease \n",
340
- "Hit:9 https://packages.microsoft.com/repos/azure-cli focal InRelease \n",
341
- "Hit:10 http://archive.ubuntu.com/ubuntu focal-updates InRelease \n",
342
- "Hit:11 https://pkg.cloudflare.com/cloudflared focal InRelease \n",
343
- "Hit:12 http://archive.ubuntu.com/ubuntu focal-backports InRelease \n",
344
- "Err:13 http://ppa.launchpad.net/jonathonf/ffmpeg-4/ubuntu focal Release \n",
345
  " 404 Not Found [IP: 185.125.190.52 80]\n",
346
- "Hit:14 https://ppa.launchpadcontent.net/deadsnakes/ppa/ubuntu focal InRelease \n",
347
- "Reading package lists... Done\n",
 
 
 
 
348
  "E: The repository 'http://ppa.launchpad.net/jonathonf/ffmpeg-4/ubuntu focal Release' does not have a Release file.\n",
349
  "N: Updating from such a repository can't be done securely, and is therefore disabled by default.\n",
350
  "N: See apt-secure(8) manpage for repository creation and user configuration details.\n",
351
- "Get:1 https://nvidia.github.io/libnvidia-container/stable/ubuntu18.04/amd64 InRelease [1484 B]\n",
352
- "Hit:2 https://download.docker.com/linux/ubuntu focal InRelease \u001b[0m\u001b[33m\n",
353
  "Hit:3 https://nvidia.github.io/nvidia-container-runtime/stable/ubuntu18.04/amd64 InRelease\n",
354
- "Hit:4 https://packages.cloud.google.com/apt cloud-sdk InRelease \u001b[0m\u001b[33m\u001b[33m\n",
355
- "Hit:5 http://archive.lambdalabs.com/ubuntu focal InRelease \u001b[0m\n",
356
- "Hit:6 http://archive.ubuntu.com/ubuntu focal InRelease \u001b[0m\u001b[33m\n",
357
- "Hit:7 http://security.ubuntu.com/ubuntu focal-security InRelease \u001b[0m\n",
358
- "Ign:8 http://ppa.launchpad.net/jonathonf/ffmpeg-4/ubuntu focal InRelease \u001b[0m\n",
359
- "Hit:9 https://packages.microsoft.com/repos/azure-cli focal InRelease \u001b[0m\u001b[33m\n",
360
- "Hit:10 http://archive.ubuntu.com/ubuntu focal-updates InRelease \u001b[0m\n",
361
- "Hit:11 https://pkg.cloudflare.com/cloudflared focal InRelease \u001b[0m\u001b[33m\n",
362
- "Hit:12 http://archive.ubuntu.com/ubuntu focal-backports InRelease \u001b[0m \u001b[0m\u001b[33m\n",
363
- "Hit:13 https://ppa.launchpadcontent.net/deadsnakes/ppa/ubuntu focal InRelease\n",
364
- "Err:14 http://ppa.launchpad.net/jonathonf/ffmpeg-4/ubuntu focal Release\n",
365
  " 404 Not Found [IP: 185.125.190.52 80]\n",
366
- "Reading package lists... Done\u001b[33m\n",
 
367
  "\u001b[1;31mE: \u001b[0mThe repository 'http://ppa.launchpad.net/jonathonf/ffmpeg-4/ubuntu focal Release' does not have a Release file.\u001b[0m\n",
368
  "\u001b[33mN: \u001b[0mUpdating from such a repository can't be done securely, and is therefore disabled by default.\u001b[0m\n",
369
  "\u001b[33mN: \u001b[0mSee apt-secure(8) manpage for repository creation and user configuration details.\u001b[0m\n",
@@ -413,172 +375,207 @@
413
  "output_type": "stream",
414
  "text": [
415
  "Defaulting to user installation because normal site-packages is not writeable\n",
416
- "Collecting git+https://github.com/huggingface/datasets\n",
417
- " Cloning https://github.com/huggingface/datasets to /tmp/pip-req-build-_aqg2yxr\n",
418
- " Running command git clone --filter=blob:none --quiet https://github.com/huggingface/datasets /tmp/pip-req-build-_aqg2yxr\n",
419
- " Resolved https://github.com/huggingface/datasets to commit 45508f7d8858579c62d93779873ef5eb6b05bc74\n",
 
 
420
  " Installing build dependencies ... \u001b[?25ldone\n",
421
  "\u001b[?25h Getting requirements to build wheel ... \u001b[?25ldone\n",
422
  "\u001b[?25h Preparing metadata (pyproject.toml) ... \u001b[?25ldone\n",
423
- "\u001b[?25hRequirement already satisfied: dill<0.3.7 in /usr/local/lib/python3.8/dist-packages (from datasets==2.7.1.dev0) (0.3.6)\n",
 
 
 
 
 
 
424
  "Requirement already satisfied: pyarrow>=6.0.0 in /usr/local/lib/python3.8/dist-packages (from datasets==2.7.1.dev0) (10.0.1)\n",
 
425
  "Requirement already satisfied: pyyaml>=5.1 in /usr/lib/python3/dist-packages (from datasets==2.7.1.dev0) (5.3.1)\n",
426
  "Requirement already satisfied: pandas in ./.local/lib/python3.8/site-packages (from datasets==2.7.1.dev0) (1.5.1)\n",
427
- "Requirement already satisfied: huggingface-hub<1.0.0,>=0.2.0 in /usr/local/lib/python3.8/dist-packages (from datasets==2.7.1.dev0) (0.11.1)\n",
428
- "Requirement already satisfied: aiohttp in /usr/local/lib/python3.8/dist-packages (from datasets==2.7.1.dev0) (3.8.3)\n",
429
- "Requirement already satisfied: numpy>=1.17 in ./.local/lib/python3.8/site-packages (from datasets==2.7.1.dev0) (1.24.0rc2)\n",
430
- "Requirement already satisfied: multiprocess in /usr/local/lib/python3.8/dist-packages (from datasets==2.7.1.dev0) (0.70.14)\n",
431
- "Requirement already satisfied: fsspec[http]>=2021.11.1 in /usr/local/lib/python3.8/dist-packages (from datasets==2.7.1.dev0) (2022.11.0)\n",
432
- "Requirement already satisfied: packaging in ./.local/lib/python3.8/site-packages (from datasets==2.7.1.dev0) (21.3)\n",
433
- "Requirement already satisfied: responses<0.19 in /usr/local/lib/python3.8/dist-packages (from datasets==2.7.1.dev0) (0.18.0)\n",
434
  "Requirement already satisfied: requests>=2.19.0 in ./.local/lib/python3.8/site-packages (from datasets==2.7.1.dev0) (2.28.1)\n",
435
- "Requirement already satisfied: tqdm>=4.62.1 in ./.local/lib/python3.8/site-packages (from datasets==2.7.1.dev0) (4.64.1)\n",
436
- "Requirement already satisfied: xxhash in /usr/local/lib/python3.8/dist-packages (from datasets==2.7.1.dev0) (3.1.0)\n",
437
  "Requirement already satisfied: attrs>=17.3.0 in /usr/lib/python3/dist-packages (from aiohttp->datasets==2.7.1.dev0) (19.3.0)\n",
 
438
  "Requirement already satisfied: frozenlist>=1.1.1 in /usr/local/lib/python3.8/dist-packages (from aiohttp->datasets==2.7.1.dev0) (1.3.3)\n",
439
- "Requirement already satisfied: async-timeout<5.0,>=4.0.0a3 in /usr/local/lib/python3.8/dist-packages (from aiohttp->datasets==2.7.1.dev0) (4.0.2)\n",
440
  "Requirement already satisfied: charset-normalizer<3.0,>=2.0 in ./.local/lib/python3.8/site-packages (from aiohttp->datasets==2.7.1.dev0) (2.1.1)\n",
 
441
  "Requirement already satisfied: yarl<2.0,>=1.0 in /usr/local/lib/python3.8/dist-packages (from aiohttp->datasets==2.7.1.dev0) (1.8.2)\n",
442
- "Requirement already satisfied: multidict<7.0,>=4.5 in /usr/local/lib/python3.8/dist-packages (from aiohttp->datasets==2.7.1.dev0) (6.0.3)\n",
443
  "Requirement already satisfied: aiosignal>=1.1.2 in /usr/local/lib/python3.8/dist-packages (from aiohttp->datasets==2.7.1.dev0) (1.3.1)\n",
444
- "Requirement already satisfied: filelock in /usr/lib/python3/dist-packages (from huggingface-hub<1.0.0,>=0.2.0->datasets==2.7.1.dev0) (3.0.12)\n",
445
  "Requirement already satisfied: typing-extensions>=3.7.4.3 in ./.local/lib/python3.8/site-packages (from huggingface-hub<1.0.0,>=0.2.0->datasets==2.7.1.dev0) (4.4.0)\n",
446
- "Requirement already satisfied: pyparsing!=3.0.5,>=2.0.2 in /usr/lib/python3/dist-packages (from packaging->datasets==2.7.1.dev0) (2.4.6)\n",
 
447
  "Requirement already satisfied: urllib3<1.27,>=1.21.1 in ./.local/lib/python3.8/site-packages (from requests>=2.19.0->datasets==2.7.1.dev0) (1.26.13)\n",
448
  "Requirement already satisfied: certifi>=2017.4.17 in ./.local/lib/python3.8/site-packages (from requests>=2.19.0->datasets==2.7.1.dev0) (2022.12.7)\n",
449
- "Requirement already satisfied: idna<4,>=2.5 in ./.local/lib/python3.8/site-packages (from requests>=2.19.0->datasets==2.7.1.dev0) (3.4)\n",
450
  "Requirement already satisfied: python-dateutil>=2.8.1 in ./.local/lib/python3.8/site-packages (from pandas->datasets==2.7.1.dev0) (2.8.2)\n",
451
  "Requirement already satisfied: pytz>=2020.1 in ./.local/lib/python3.8/site-packages (from pandas->datasets==2.7.1.dev0) (2022.5)\n",
452
  "Requirement already satisfied: six>=1.5 in /usr/lib/python3/dist-packages (from python-dateutil>=2.8.1->pandas->datasets==2.7.1.dev0) (1.14.0)\n",
453
- "Defaulting to user installation because normal site-packages is not writeable\n",
454
- "Collecting git+https://github.com/huggingface/transformers\n",
455
- " Cloning https://github.com/huggingface/transformers to /tmp/pip-req-build-x539p2ep\n",
456
- " Running command git clone --filter=blob:none --quiet https://github.com/huggingface/transformers /tmp/pip-req-build-x539p2ep\n",
457
- " Resolved https://github.com/huggingface/transformers to commit e3cc4487fe66e03ec85970ea2db8e5fb34c455f4\n",
 
 
 
 
 
 
458
  " Installing build dependencies ... \u001b[?25ldone\n",
459
  "\u001b[?25h Getting requirements to build wheel ... \u001b[?25ldone\n",
460
  "\u001b[?25h Preparing metadata (pyproject.toml) ... \u001b[?25ldone\n",
461
- "\u001b[?25hRequirement already satisfied: huggingface-hub<1.0,>=0.10.0 in /usr/local/lib/python3.8/dist-packages (from transformers==4.26.0.dev0) (0.11.1)\n",
 
 
 
462
  "Requirement already satisfied: regex!=2019.12.17 in /usr/local/lib/python3.8/dist-packages (from transformers==4.26.0.dev0) (2022.10.31)\n",
463
- "Requirement already satisfied: numpy>=1.17 in ./.local/lib/python3.8/site-packages (from transformers==4.26.0.dev0) (1.24.0rc2)\n",
464
- "Requirement already satisfied: tokenizers!=0.11.3,<0.14,>=0.11.1 in /usr/local/lib/python3.8/dist-packages (from transformers==4.26.0.dev0) (0.13.2)\n",
465
- "Requirement already satisfied: filelock in /usr/lib/python3/dist-packages (from transformers==4.26.0.dev0) (3.0.12)\n",
466
  "Requirement already satisfied: requests in ./.local/lib/python3.8/site-packages (from transformers==4.26.0.dev0) (2.28.1)\n",
467
- "Requirement already satisfied: pyyaml>=5.1 in /usr/lib/python3/dist-packages (from transformers==4.26.0.dev0) (5.3.1)\n",
468
- "Requirement already satisfied: packaging>=20.0 in ./.local/lib/python3.8/site-packages (from transformers==4.26.0.dev0) (21.3)\n",
469
- "Requirement already satisfied: tqdm>=4.27 in ./.local/lib/python3.8/site-packages (from transformers==4.26.0.dev0) (4.64.1)\n",
470
  "Requirement already satisfied: typing-extensions>=3.7.4.3 in ./.local/lib/python3.8/site-packages (from huggingface-hub<1.0,>=0.10.0->transformers==4.26.0.dev0) (4.4.0)\n",
471
- "Requirement already satisfied: pyparsing!=3.0.5,>=2.0.2 in /usr/lib/python3/dist-packages (from packaging>=20.0->transformers==4.26.0.dev0) (2.4.6)\n",
472
- "Requirement already satisfied: idna<4,>=2.5 in ./.local/lib/python3.8/site-packages (from requests->transformers==4.26.0.dev0) (3.4)\n",
473
- "Requirement already satisfied: charset-normalizer<3,>=2 in ./.local/lib/python3.8/site-packages (from requests->transformers==4.26.0.dev0) (2.1.1)\n",
474
  "Requirement already satisfied: certifi>=2017.4.17 in ./.local/lib/python3.8/site-packages (from requests->transformers==4.26.0.dev0) (2022.12.7)\n",
 
475
  "Requirement already satisfied: urllib3<1.27,>=1.21.1 in ./.local/lib/python3.8/site-packages (from requests->transformers==4.26.0.dev0) (1.26.13)\n",
476
- "/usr/bin/sh: 1: cannot create =2.7.1: Permission denied\n",
477
- "Defaulting to user installation because normal site-packages is not writeable\n",
478
- "Requirement already satisfied: librosa in /usr/local/lib/python3.8/dist-packages (0.9.2)\n",
479
- "Requirement already satisfied: scikit-learn>=0.19.1 in /usr/lib/python3/dist-packages (from librosa) (0.22.2.post1)\n",
480
- "Requirement already satisfied: audioread>=2.1.9 in /usr/local/lib/python3.8/dist-packages (from librosa) (3.0.0)\n",
481
- "Requirement already satisfied: joblib>=0.14 in ./.local/lib/python3.8/site-packages (from librosa) (1.2.0)\n",
 
 
 
 
 
 
 
 
 
482
  "Requirement already satisfied: decorator>=4.0.10 in /usr/lib/python3/dist-packages (from librosa) (4.4.2)\n",
483
- "Requirement already satisfied: soundfile>=0.10.2 in /usr/local/lib/python3.8/dist-packages (from librosa) (0.11.0)\n",
484
  "Requirement already satisfied: pooch>=1.0 in /usr/local/lib/python3.8/dist-packages (from librosa) (1.6.0)\n",
485
- "Requirement already satisfied: packaging>=20.0 in ./.local/lib/python3.8/site-packages (from librosa) (21.3)\n",
486
- "Requirement already satisfied: numpy>=1.17.0 in ./.local/lib/python3.8/site-packages (from librosa) (1.24.0rc2)\n",
487
  "Requirement already satisfied: scipy>=1.2.0 in ./.local/lib/python3.8/site-packages (from librosa) (1.9.3)\n",
 
 
 
 
 
488
  "Requirement already satisfied: resampy>=0.2.2 in /usr/local/lib/python3.8/dist-packages (from librosa) (0.4.2)\n",
489
- "Requirement already satisfied: numba>=0.45.1 in /usr/local/lib/python3.8/dist-packages (from librosa) (0.56.4)\n",
490
- "Collecting numpy>=1.17.0\n",
491
- " Downloading numpy-1.23.5-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (17.1 MB)\n",
492
- "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m17.1/17.1 MB\u001b[0m \u001b[31m104.1 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m00:01\u001b[0m00:01\u001b[0m\n",
493
- "\u001b[?25hRequirement already satisfied: setuptools in ./.local/lib/python3.8/site-packages (from numba>=0.45.1->librosa) (65.6.3)\n",
494
  "Requirement already satisfied: importlib-metadata in ./.local/lib/python3.8/site-packages (from numba>=0.45.1->librosa) (5.0.0)\n",
495
  "Requirement already satisfied: llvmlite<0.40,>=0.39.0dev0 in /usr/local/lib/python3.8/dist-packages (from numba>=0.45.1->librosa) (0.39.1)\n",
496
- "Requirement already satisfied: pyparsing!=3.0.5,>=2.0.2 in /usr/lib/python3/dist-packages (from packaging>=20.0->librosa) (2.4.6)\n",
497
  "Requirement already satisfied: requests>=2.19.0 in ./.local/lib/python3.8/site-packages (from pooch>=1.0->librosa) (2.28.1)\n",
498
  "Requirement already satisfied: appdirs>=1.3.0 in /usr/lib/python3/dist-packages (from pooch>=1.0->librosa) (1.4.3)\n",
499
  "Requirement already satisfied: cffi>=1.0 in /usr/lib/python3/dist-packages (from soundfile>=0.10.2->librosa) (1.14.0)\n",
500
  "Requirement already satisfied: certifi>=2017.4.17 in ./.local/lib/python3.8/site-packages (from requests>=2.19.0->pooch>=1.0->librosa) (2022.12.7)\n",
 
501
  "Requirement already satisfied: idna<4,>=2.5 in ./.local/lib/python3.8/site-packages (from requests>=2.19.0->pooch>=1.0->librosa) (3.4)\n",
502
  "Requirement already satisfied: urllib3<1.27,>=1.21.1 in ./.local/lib/python3.8/site-packages (from requests>=2.19.0->pooch>=1.0->librosa) (1.26.13)\n",
503
- "Requirement already satisfied: charset-normalizer<3,>=2 in ./.local/lib/python3.8/site-packages (from requests>=2.19.0->pooch>=1.0->librosa) (2.1.1)\n",
504
  "Requirement already satisfied: zipp>=0.5 in /usr/lib/python3/dist-packages (from importlib-metadata->numba>=0.45.1->librosa) (1.0.0)\n",
505
- "Installing collected packages: numpy\n",
506
- " Attempting uninstall: numpy\n",
507
- " Found existing installation: numpy 1.24.0rc2\n",
508
- " Uninstalling numpy-1.24.0rc2:\n",
509
- " Successfully uninstalled numpy-1.24.0rc2\n",
510
- "Successfully installed numpy-1.23.5\n",
511
- "/usr/bin/sh: 1: cannot create =0.3.0: Permission denied\n",
512
- "Defaulting to user installation because normal site-packages is not writeable\n",
513
- "Requirement already satisfied: jiwer in /usr/local/lib/python3.8/dist-packages (2.5.1)\n",
 
 
 
 
 
514
  "Requirement already satisfied: levenshtein==0.20.2 in /usr/local/lib/python3.8/dist-packages (from jiwer) (0.20.2)\n",
515
- "Requirement already satisfied: rapidfuzz<3.0.0,>=2.3.0 in /usr/local/lib/python3.8/dist-packages (from levenshtein==0.20.2->jiwer) (2.13.3)\n",
516
- "Defaulting to user installation because normal site-packages is not writeable\n",
517
- "Requirement already satisfied: gradio in /usr/local/lib/python3.8/dist-packages (3.12.0)\n",
518
- "Requirement already satisfied: pillow in ./.local/lib/python3.8/site-packages (from gradio) (9.3.0)\n",
519
- "Requirement already satisfied: requests in ./.local/lib/python3.8/site-packages (from gradio) (2.28.1)\n",
 
 
 
 
 
 
 
 
 
 
520
  "Requirement already satisfied: pyyaml in /usr/lib/python3/dist-packages (from gradio) (5.3.1)\n",
521
- "Requirement already satisfied: pycryptodome in /usr/local/lib/python3.8/dist-packages (from gradio) (3.16.0)\n",
522
- "Requirement already satisfied: ffmpy in /usr/local/lib/python3.8/dist-packages (from gradio) (0.3.0)\n",
523
- "Requirement already satisfied: websockets>=10.0 in /usr/local/lib/python3.8/dist-packages (from gradio) (10.4)\n",
 
524
  "Requirement already satisfied: aiohttp in /usr/local/lib/python3.8/dist-packages (from gradio) (3.8.3)\n",
525
- "Requirement already satisfied: paramiko in /usr/local/lib/python3.8/dist-packages (from gradio) (2.12.0)\n",
526
- "Requirement already satisfied: pydub in /usr/local/lib/python3.8/dist-packages (from gradio) (0.25.1)\n",
527
- "Requirement already satisfied: fsspec in /usr/local/lib/python3.8/dist-packages (from gradio) (2022.11.0)\n",
528
- "Requirement already satisfied: fastapi in /usr/local/lib/python3.8/dist-packages (from gradio) (0.88.0)\n",
529
- "Requirement already satisfied: python-multipart in /usr/local/lib/python3.8/dist-packages (from gradio) (0.0.5)\n",
530
  "Requirement already satisfied: pydantic in ./.local/lib/python3.8/site-packages (from gradio) (1.10.2)\n",
531
- "Requirement already satisfied: orjson in /usr/local/lib/python3.8/dist-packages (from gradio) (3.8.3)\n",
532
- "Requirement already satisfied: h11<0.13,>=0.11 in /usr/local/lib/python3.8/dist-packages (from gradio) (0.12.0)\n",
533
- "Requirement already satisfied: matplotlib in ./.local/lib/python3.8/site-packages (from gradio) (3.5.3)\n",
534
- "Requirement already satisfied: markdown-it-py[linkify,plugins] in /usr/local/lib/python3.8/dist-packages (from gradio) (2.1.0)\n",
535
  "Requirement already satisfied: jinja2 in ./.local/lib/python3.8/site-packages (from gradio) (3.1.2)\n",
536
- "Requirement already satisfied: numpy in ./.local/lib/python3.8/site-packages (from gradio) (1.23.5)\n",
537
- "Requirement already satisfied: pandas in ./.local/lib/python3.8/site-packages (from gradio) (1.5.1)\n",
538
- "Requirement already satisfied: uvicorn in /usr/local/lib/python3.8/dist-packages (from gradio) (0.20.0)\n",
539
- "Requirement already satisfied: httpx in /usr/local/lib/python3.8/dist-packages (from gradio) (0.23.1)\n",
540
  "Requirement already satisfied: yarl<2.0,>=1.0 in /usr/local/lib/python3.8/dist-packages (from aiohttp->gradio) (1.8.2)\n",
 
541
  "Requirement already satisfied: attrs>=17.3.0 in /usr/lib/python3/dist-packages (from aiohttp->gradio) (19.3.0)\n",
542
- "Requirement already satisfied: async-timeout<5.0,>=4.0.0a3 in /usr/local/lib/python3.8/dist-packages (from aiohttp->gradio) (4.0.2)\n",
543
  "Requirement already satisfied: multidict<7.0,>=4.5 in /usr/local/lib/python3.8/dist-packages (from aiohttp->gradio) (6.0.3)\n",
544
  "Requirement already satisfied: frozenlist>=1.1.1 in /usr/local/lib/python3.8/dist-packages (from aiohttp->gradio) (1.3.3)\n",
545
- "Requirement already satisfied: charset-normalizer<3.0,>=2.0 in ./.local/lib/python3.8/site-packages (from aiohttp->gradio) (2.1.1)\n",
546
- "Requirement already satisfied: aiosignal>=1.1.2 in /usr/local/lib/python3.8/dist-packages (from aiohttp->gradio) (1.3.1)\n",
547
- "Requirement already satisfied: starlette==0.22.0 in /usr/local/lib/python3.8/dist-packages (from fastapi->gradio) (0.22.0)\n",
548
  "Requirement already satisfied: typing-extensions>=3.10.0 in ./.local/lib/python3.8/site-packages (from starlette==0.22.0->fastapi->gradio) (4.4.0)\n",
549
  "Requirement already satisfied: anyio<5,>=3.4.0 in ./.local/lib/python3.8/site-packages (from starlette==0.22.0->fastapi->gradio) (3.6.2)\n",
550
- "Requirement already satisfied: httpcore<0.17.0,>=0.15.0 in /usr/local/lib/python3.8/dist-packages (from httpx->gradio) (0.15.0)\n",
551
- "Requirement already satisfied: sniffio in ./.local/lib/python3.8/site-packages (from httpx->gradio) (1.3.0)\n",
552
- "Requirement already satisfied: rfc3986[idna2008]<2,>=1.3 in /usr/local/lib/python3.8/dist-packages (from httpx->gradio) (1.5.0)\n",
553
  "Requirement already satisfied: certifi in ./.local/lib/python3.8/site-packages (from httpx->gradio) (2022.12.7)\n",
 
554
  "Requirement already satisfied: MarkupSafe>=2.0 in ./.local/lib/python3.8/site-packages (from jinja2->gradio) (2.1.1)\n",
555
- "Requirement already satisfied: mdurl~=0.1 in /usr/local/lib/python3.8/dist-packages (from markdown-it-py[linkify,plugins]->gradio) (0.1.2)\n",
556
- "Requirement already satisfied: mdit-py-plugins in /usr/local/lib/python3.8/dist-packages (from markdown-it-py[linkify,plugins]->gradio) (0.3.3)\n",
557
- "Requirement already satisfied: linkify-it-py~=1.0 in /usr/local/lib/python3.8/dist-packages (from markdown-it-py[linkify,plugins]->gradio) (1.0.3)\n",
558
- "Requirement already satisfied: kiwisolver>=1.0.1 in /usr/lib/python3/dist-packages (from matplotlib->gradio) (1.0.1)\n",
559
  "Requirement already satisfied: cycler>=0.10 in /usr/lib/python3/dist-packages (from matplotlib->gradio) (0.10.0)\n",
560
- "Requirement already satisfied: fonttools>=4.22.0 in ./.local/lib/python3.8/site-packages (from matplotlib->gradio) (4.38.0)\n",
561
  "Requirement already satisfied: pyparsing>=2.2.1 in /usr/lib/python3/dist-packages (from matplotlib->gradio) (2.4.6)\n",
 
 
562
  "Requirement already satisfied: python-dateutil>=2.7 in ./.local/lib/python3.8/site-packages (from matplotlib->gradio) (2.8.2)\n",
563
- "Requirement already satisfied: packaging>=20.0 in ./.local/lib/python3.8/site-packages (from matplotlib->gradio) (21.3)\n",
564
  "Requirement already satisfied: pytz>=2020.1 in ./.local/lib/python3.8/site-packages (from pandas->gradio) (2022.5)\n",
565
- "Requirement already satisfied: cryptography>=2.5 in /usr/lib/python3/dist-packages (from paramiko->gradio) (2.8)\n",
566
  "Requirement already satisfied: pynacl>=1.0.1 in /usr/lib/python3/dist-packages (from paramiko->gradio) (1.3.0)\n",
567
  "Requirement already satisfied: six in /usr/lib/python3/dist-packages (from paramiko->gradio) (1.14.0)\n",
568
- "Requirement already satisfied: bcrypt>=3.1.3 in /usr/local/lib/python3.8/dist-packages (from paramiko->gradio) (4.0.1)\n",
 
569
  "Requirement already satisfied: urllib3<1.27,>=1.21.1 in ./.local/lib/python3.8/site-packages (from requests->gradio) (1.26.13)\n",
570
  "Requirement already satisfied: idna<4,>=2.5 in ./.local/lib/python3.8/site-packages (from requests->gradio) (3.4)\n",
571
  "Requirement already satisfied: click>=7.0 in /usr/lib/python3/dist-packages (from uvicorn->gradio) (7.0)\n",
572
- "Requirement already satisfied: uc-micro-py in /usr/local/lib/python3.8/dist-packages (from linkify-it-py~=1.0->markdown-it-py[linkify,plugins]->gradio) (1.0.1)\n",
573
- "Defaulting to user installation because normal site-packages is not writeable\n",
574
- "Requirement already satisfied: more-itertools in /usr/lib/python3/dist-packages (4.2.0)\n"
 
 
 
 
 
 
 
 
 
 
 
575
  ]
576
  }
577
  ],
578
  "source": [
579
  "!pip install git+https://github.com/huggingface/datasets\n",
580
  "!pip install git+https://github.com/huggingface/transformers\n",
581
- "!pip3 install numexpr>=2.7.1\n",
582
  "!pip install librosa\n",
583
  "!pip install evaluate>=0.3.0\n",
584
  "!pip install jiwer\n",
@@ -636,7 +633,7 @@
636
  {
637
  "data": {
638
  "application/vnd.jupyter.widget-view+json": {
639
- "model_id": "cee65d4b203d4b2a910d65aba8ff273c",
640
  "version_major": 2,
641
  "version_minor": 0
642
  },
@@ -652,7 +649,8 @@
652
  "from huggingface_hub import notebook_login\n",
653
  "\n",
654
  "notebook_login()\n",
655
- "\n"
 
656
  ]
657
  },
658
  {
@@ -725,15 +723,22 @@
725
  },
726
  "outputs": [
727
  {
728
- "name": "stderr",
729
  "output_type": "stream",
730
  "text": [
731
- "/home/ubuntu/.local/lib/python3.8/site-packages/pandas/core/computation/expressions.py:20: UserWarning: Pandas requires version '2.7.3' or newer of 'numexpr' (version '2.7.1' currently installed).\n",
732
- " from pandas.core.computation.check import NUMEXPR_INSTALLED\n"
 
 
 
 
 
733
  ]
734
  }
735
  ],
736
  "source": [
 
 
737
  "from datasets import Audio, interleave_datasets, IterableDataset, load_dataset\n",
738
  "from typing import List, Optional\n",
739
  "\n",
@@ -895,7 +900,8 @@
895
  "execution_count": 11,
896
  "id": "77d9f0c5-8607-4642-a8ac-c3ab2e223ea6",
897
  "metadata": {
898
- "id": "77d9f0c5-8607-4642-a8ac-c3ab2e223ea6"
 
899
  },
900
  "outputs": [],
901
  "source": [
@@ -1448,6 +1454,7 @@
1448
  "source": [
1449
  "from transformers import WhisperForConditionalGeneration\n",
1450
  "\n",
 
1451
  "model = WhisperForConditionalGeneration.from_pretrained(\"emilios/whisper-medium-el\")\n",
1452
  "#model = WhisperForConditionalGeneration.from_pretrained(\"farsipal/whisper-small-el\")\n",
1453
  "\n",
@@ -1500,7 +1507,7 @@
1500
  },
1501
  {
1502
  "cell_type": "code",
1503
- "execution_count": 45,
1504
  "id": "0ae3e9af-97b7-4aa0-ae85-20b23b5bcb3a",
1505
  "metadata": {
1506
  "colab": {
@@ -1509,15 +1516,7 @@
1509
  "id": "0ae3e9af-97b7-4aa0-ae85-20b23b5bcb3a",
1510
  "outputId": "7290f729-fb46-4190-dcc5-5cf01e1c9808"
1511
  },
1512
- "outputs": [
1513
- {
1514
- "name": "stderr",
1515
- "output_type": "stream",
1516
- "text": [
1517
- "PyTorch: setting up devices\n"
1518
- ]
1519
- }
1520
- ],
1521
  "source": [
1522
  "from transformers import Seq2SeqTrainingArguments\n",
1523
  "\n",
@@ -1525,14 +1524,14 @@
1525
  " output_dir=\"./whisper-medium-el\", # your repo name\n",
1526
  " #output_dir=\"./whisper-small-el\", # your repo name\n",
1527
  " per_device_train_batch_size=32,\n",
1528
- " gradient_accumulation_steps=2, # increase by 2x for every 2x decrease in batch size\n",
1529
  " learning_rate=1e-5,\n",
1530
  " warmup_steps=500,\n",
1531
  " max_steps=5000,\n",
1532
  " gradient_checkpointing=True,\n",
1533
  " fp16=True,\n",
1534
  " evaluation_strategy=\"steps\",\n",
1535
- " per_device_eval_batch_size=8,\n",
1536
  " predict_with_generate=True,\n",
1537
  " generation_max_length=225,\n",
1538
  " save_steps=1000,\n",
@@ -1543,13 +1542,15 @@
1543
  " metric_for_best_model=\"wer\",\n",
1544
  " greater_is_better=False,\n",
1545
  " push_to_hub=True,\n",
 
 
1546
  " #optim=\"adamw_bnb_8bit\"\n",
1547
  ")\n"
1548
  ]
1549
  },
1550
  {
1551
  "cell_type": "code",
1552
- "execution_count": 46,
1553
  "id": "o72eOpGzD_sK",
1554
  "metadata": {
1555
  "colab": {
@@ -1563,7 +1564,7 @@
1563
  "name": "stdout",
1564
  "output_type": "stream",
1565
  "text": [
1566
- "Thu Dec 8 18:57:05 2022 \n",
1567
  "+-----------------------------------------------------------------------------+\n",
1568
  "| NVIDIA-SMI 515.65.01 Driver Version: 515.65.01 CUDA Version: 11.7 |\n",
1569
  "|-------------------------------+----------------------+----------------------+\n",
@@ -1572,7 +1573,7 @@
1572
  "| | | MIG M. |\n",
1573
  "|===============================+======================+======================|\n",
1574
  "| 0 NVIDIA A100-SXM... On | 00000000:06:00.0 Off | 0 |\n",
1575
- "| N/A 32C P0 55W / 400W | 39261MiB / 40960MiB | 0% Default |\n",
1576
  "| | | Disabled |\n",
1577
  "+-------------------------------+----------------------+----------------------+\n",
1578
  " \n",
@@ -1581,7 +1582,7 @@
1581
  "| GPU GI CI PID Type Process name GPU Memory |\n",
1582
  "| ID ID Usage |\n",
1583
  "|=============================================================================|\n",
1584
- "| 0 N/A N/A 130327 C /usr/bin/python3 39259MiB |\n",
1585
  "+-----------------------------------------------------------------------------+\n"
1586
  ]
1587
  }
@@ -1613,7 +1614,7 @@
1613
  },
1614
  {
1615
  "cell_type": "code",
1616
- "execution_count": 47,
1617
  "id": "3ac16b62-b3c0-4c68-8f3d-9ecf471534b2",
1618
  "metadata": {
1619
  "id": "3ac16b62-b3c0-4c68-8f3d-9ecf471534b2"
@@ -1646,7 +1647,7 @@
1646
  },
1647
  {
1648
  "cell_type": "code",
1649
- "execution_count": 48,
1650
  "id": "d546d7fe-0543-479a-b708-2ebabec19493",
1651
  "metadata": {
1652
  "colab": {
@@ -2217,8 +2218,29 @@
2217
  ]
2218
  },
2219
  "id": "d546d7fe-0543-479a-b708-2ebabec19493",
2220
- "outputId": "2c83cd5c-b1cc-4a3b-8d6c-57239408a297"
 
2221
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2222
  "outputs": [
2223
  {
2224
  "name": "stderr",
@@ -2231,6 +2253,7 @@
2231
  }
2232
  ],
2233
  "source": [
 
2234
  "from transformers import Seq2SeqTrainer\n",
2235
  "\n",
2236
  "#import torch._dynamo as dynamo\n",
@@ -2260,7 +2283,7 @@
2260
  },
2261
  {
2262
  "cell_type": "code",
2263
- "execution_count": 49,
2264
  "id": "a1ccb9ed-cbc8-4419-91c0-651e9424b672",
2265
  "metadata": {
2266
  "id": "a1ccb9ed-cbc8-4419-91c0-651e9424b672"
@@ -2343,7 +2366,7 @@
2343
  },
2344
  {
2345
  "cell_type": "code",
2346
- "execution_count": null,
2347
  "id": "ee8b7b8e-1c9a-4d77-9137-1778a629e6de",
2348
  "metadata": {
2349
  "id": "ee8b7b8e-1c9a-4d77-9137-1778a629e6de"
@@ -2353,22 +2376,118 @@
2353
  "name": "stderr",
2354
  "output_type": "stream",
2355
  "text": [
 
2356
  "/home/ubuntu/.local/lib/python3.8/site-packages/transformers/optimization.py:306: FutureWarning: This implementation of AdamW is deprecated and will be removed in a future version. Use the PyTorch implementation torch.optim.AdamW instead, or set `no_deprecation_warning=True` to disable this warning\n",
2357
  " warnings.warn(\n",
2358
  "***** Running training *****\n",
2359
- " Num examples = 320000\n",
2360
  " Num Epochs = 9223372036854775807\n",
2361
  " Instantaneous batch size per device = 32\n",
2362
- " Total train batch size (w. parallel, distributed & accumulation) = 64\n",
2363
- " Gradient Accumulation steps = 2\n",
2364
  " Total optimization steps = 5000\n",
2365
  " Number of trainable parameters = 763857920\n",
2366
- "Reading metadata...: 1914it [00:00, 13829.19it/s]\n"
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2367
  ]
2368
  }
2369
  ],
2370
  "source": [
2371
- "trainer.train()"
 
2372
  ]
2373
  },
2374
  {
@@ -2396,7 +2515,7 @@
2396
  },
2397
  {
2398
  "cell_type": "code",
2399
- "execution_count": null,
2400
  "id": "6dd0e310-9b07-4133-ac14-2ed2d7524e22",
2401
  "metadata": {
2402
  "id": "6dd0e310-9b07-4133-ac14-2ed2d7524e22"
@@ -2409,7 +2528,7 @@
2409
  " \"dataset\": \"Common Voice 11.0\", # a 'pretty' name for the training dataset\n",
2410
  " #\"dataset\": \"Google FLEURS\", # a 'pretty' name for the training dataset\n",
2411
  " \"language\": \"el\",\n",
2412
- " \"model_name\": \"Whisper Medium El - Greek One\", # a 'pretty' name for your model\n",
2413
  " \"finetuned_from\": \"openai/whisper-medium\",\n",
2414
  " \"tasks\": \"automatic-speech-recognition\",\n",
2415
  " \"tags\": \"hf-asr-leaderboard, whisper-medium, mozilla-foundation/common_voice_11_0, greek, whisper-event\",\n",
@@ -2428,15 +2547,50 @@
2428
  },
2429
  {
2430
  "cell_type": "code",
2431
- "execution_count": null,
2432
  "id": "95737cda-c5dd-4887-a4d0-dfcb0d61d977",
2433
  "metadata": {
2434
  "id": "95737cda-c5dd-4887-a4d0-dfcb0d61d977"
2435
  },
2436
- "outputs": [],
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2437
  "source": [
2438
  "trainer.push_to_hub(**kwargs)"
2439
  ]
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2440
  }
2441
  ],
2442
  "metadata": {
 
120
  "output_type": "stream",
121
  "text": [
122
  "Defaulting to user installation because normal site-packages is not writeable\n",
123
+ "\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
124
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
125
+ "\u001b[0mRequirement already satisfied: pip in ./.local/lib/python3.8/site-packages (22.3.1)\n",
126
+ "\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
127
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
128
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
129
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
130
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
131
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
132
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
133
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
134
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
135
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
136
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
137
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
138
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
139
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
140
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
141
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
142
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
143
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
144
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
145
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
146
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
147
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
148
+ "\u001b[0mDefaulting to user installation because normal site-packages is not writeable\n",
149
+ "\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
150
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
151
+ "\u001b[0mRequirement already satisfied: torch in ./.local/lib/python3.8/site-packages (1.13.0)\n",
152
+ "Requirement already satisfied: torchaudio in ./.local/lib/python3.8/site-packages (0.13.0)\n",
153
+ "Requirement already satisfied: torchvision in ./.local/lib/python3.8/site-packages (0.14.0)\n",
154
+ "Requirement already satisfied: typing-extensions in ./.local/lib/python3.8/site-packages (from torch) (4.4.0)\n",
155
+ "Requirement already satisfied: nvidia-cudnn-cu11==8.5.0.96 in ./.local/lib/python3.8/site-packages (from torch) (8.5.0.96)\n",
156
+ "Requirement already satisfied: nvidia-cublas-cu11==11.10.3.66 in ./.local/lib/python3.8/site-packages (from torch) (11.10.3.66)\n",
157
+ "Requirement already satisfied: nvidia-cuda-nvrtc-cu11==11.7.99 in ./.local/lib/python3.8/site-packages (from torch) (11.7.99)\n",
158
+ "Requirement already satisfied: nvidia-cuda-runtime-cu11==11.7.99 in ./.local/lib/python3.8/site-packages (from torch) (11.7.99)\n",
159
+ "Requirement already satisfied: setuptools in ./.local/lib/python3.8/site-packages (from nvidia-cublas-cu11==11.10.3.66->torch) (65.6.3)\n",
160
+ "Requirement already satisfied: wheel in ./.local/lib/python3.8/site-packages (from nvidia-cublas-cu11==11.10.3.66->torch) (0.38.4)\n",
161
+ "Requirement already satisfied: numpy in ./.local/lib/python3.8/site-packages (from torchvision) (1.23.5)\n",
162
+ "Requirement already satisfied: requests in ./.local/lib/python3.8/site-packages (from torchvision) (2.28.1)\n",
163
+ "Requirement already satisfied: pillow!=8.3.*,>=5.3.0 in ./.local/lib/python3.8/site-packages (from torchvision) (9.3.0)\n",
164
+ "Requirement already satisfied: idna<4,>=2.5 in ./.local/lib/python3.8/site-packages (from requests->torchvision) (3.4)\n",
165
+ "Requirement already satisfied: certifi>=2017.4.17 in ./.local/lib/python3.8/site-packages (from requests->torchvision) (2022.12.7)\n",
166
+ "Requirement already satisfied: urllib3<1.27,>=1.21.1 in ./.local/lib/python3.8/site-packages (from requests->torchvision) (1.26.13)\n",
167
+ "Requirement already satisfied: charset-normalizer<3,>=2 in ./.local/lib/python3.8/site-packages (from requests->torchvision) (2.1.1)\n",
168
+ "\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
169
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
170
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
171
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
172
+ "\u001b[0m"
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
173
  ]
174
  }
175
  ],
176
  "source": [
177
  "!pip3 install --upgrade pip\n",
178
+ "!pip3 install --upgrade numpy>=1.18\n",
179
+ "!pip3 install --upgrade packaging>=20.9\n",
180
+ "!pip3 install --upgrade typing-extensions>=3.7.4.3\n",
181
  "\n",
182
+ "!pip3 install --pre torch torchaudio torchvision --upgrade\n",
183
  "\n",
184
+ "#!pip3 install bitsandbytes\n",
185
  "\n",
186
  "\n",
187
  "#!pip3 install --pre torch torchaudio --extra-index-url https://download.pytorch.org/whl/nightly/cu116\n",
 
231
  "name": "stdout",
232
  "output_type": "stream",
233
  "text": [
234
+ "Mon Dec 12 01:24:00 2022 \n",
235
  "+-----------------------------------------------------------------------------+\n",
236
  "| NVIDIA-SMI 515.65.01 Driver Version: 515.65.01 CUDA Version: 11.7 |\n",
237
  "|-------------------------------+----------------------+----------------------+\n",
 
240
  "| | | MIG M. |\n",
241
  "|===============================+======================+======================|\n",
242
  "| 0 NVIDIA A100-SXM... On | 00000000:06:00.0 Off | 0 |\n",
243
+ "| N/A 33C P0 44W / 400W | 0MiB / 40960MiB | 0% Default |\n",
244
  "| | | Disabled |\n",
245
  "+-------------------------------+----------------------+----------------------+\n",
246
  " \n",
 
292
  "output_type": "stream",
293
  "text": [
294
  "Get:1 https://nvidia.github.io/libnvidia-container/stable/ubuntu18.04/amd64 InRelease [1484 B]\n",
295
+ "Hit:2 https://download.docker.com/linux/ubuntu focal InRelease \n",
296
+ "Hit:3 https://nvidia.github.io/nvidia-container-runtime/stable/ubuntu18.04/amd64 InRelease\n",
297
+ "Hit:4 https://packages.cloud.google.com/apt cloud-sdk InRelease \n",
298
+ "Hit:5 https://packages.microsoft.com/repos/azure-cli focal InRelease \n",
299
+ "Ign:6 http://ppa.launchpad.net/jonathonf/ffmpeg-4/ubuntu focal InRelease \n",
300
+ "Hit:7 http://archive.lambdalabs.com/ubuntu focal InRelease \n",
301
+ "Hit:8 https://pkg.cloudflare.com/cloudflared focal InRelease \n",
302
+ "Err:9 http://ppa.launchpad.net/jonathonf/ffmpeg-4/ubuntu focal Release \n",
 
 
 
 
303
  " 404 Not Found [IP: 185.125.190.52 80]\n",
304
+ "Hit:10 https://ppa.launchpadcontent.net/deadsnakes/ppa/ubuntu focal InRelease \n",
305
+ "Hit:11 http://archive.ubuntu.com/ubuntu focal InRelease \n",
306
+ "Hit:12 http://archive.ubuntu.com/ubuntu focal-updates InRelease\n",
307
+ "Get:13 http://security.ubuntu.com/ubuntu focal-security InRelease [114 kB]\n",
308
+ "Get:14 http://archive.ubuntu.com/ubuntu focal-backports InRelease [108 kB]\n",
309
+ "Reading package lists... Done \n",
310
  "E: The repository 'http://ppa.launchpad.net/jonathonf/ffmpeg-4/ubuntu focal Release' does not have a Release file.\n",
311
  "N: Updating from such a repository can't be done securely, and is therefore disabled by default.\n",
312
  "N: See apt-secure(8) manpage for repository creation and user configuration details.\n",
313
+ "Hit:1 https://download.docker.com/linux/ubuntu focal InRelease\n",
314
+ "Get:2 https://nvidia.github.io/libnvidia-container/stable/ubuntu18.04/amd64 InRelease [1484 B]\n",
315
  "Hit:3 https://nvidia.github.io/nvidia-container-runtime/stable/ubuntu18.04/amd64 InRelease\n",
316
+ "Hit:4 https://packages.microsoft.com/repos/azure-cli focal InRelease \u001b[0m\n",
317
+ "Get:5 http://security.ubuntu.com/ubuntu focal-security InRelease [114 kB] \u001b[0m\n",
318
+ "Hit:6 https://packages.cloud.google.com/apt cloud-sdk InRelease \u001b[0m\n",
319
+ "Hit:7 http://archive.ubuntu.com/ubuntu focal InRelease \u001b[0m\u001b[33m\u001b[33m\n",
320
+ "Hit:8 http://archive.ubuntu.com/ubuntu focal-updates InRelease \u001b[0m\n",
321
+ "Get:9 http://archive.ubuntu.com/ubuntu focal-backports InRelease [108 kB] \u001b[0m\n",
322
+ "Ign:10 http://ppa.launchpad.net/jonathonf/ffmpeg-4/ubuntu focal InRelease \u001b[0m\u001b[33m\n",
323
+ "Hit:11 http://archive.lambdalabs.com/ubuntu focal InRelease \u001b[0m\u001b[33m\u001b[33m\n",
324
+ "Hit:12 https://pkg.cloudflare.com/cloudflared focal InRelease \u001b[0m\u001b[33m\n",
325
+ "Err:13 http://ppa.launchpad.net/jonathonf/ffmpeg-4/ubuntu focal Release \u001b[0m\u001b[33m\n",
 
326
  " 404 Not Found [IP: 185.125.190.52 80]\n",
327
+ "Hit:14 https://ppa.launchpadcontent.net/deadsnakes/ppa/ubuntu focal InRelease\n",
328
+ "Reading package lists... Done\u001b[33m\u001b[33m\u001b[33m\n",
329
  "\u001b[1;31mE: \u001b[0mThe repository 'http://ppa.launchpad.net/jonathonf/ffmpeg-4/ubuntu focal Release' does not have a Release file.\u001b[0m\n",
330
  "\u001b[33mN: \u001b[0mUpdating from such a repository can't be done securely, and is therefore disabled by default.\u001b[0m\n",
331
  "\u001b[33mN: \u001b[0mSee apt-secure(8) manpage for repository creation and user configuration details.\u001b[0m\n",
 
375
  "output_type": "stream",
376
  "text": [
377
  "Defaulting to user installation because normal site-packages is not writeable\n",
378
+ "\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
379
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
380
+ "\u001b[0mCollecting git+https://github.com/huggingface/datasets\n",
381
+ " Cloning https://github.com/huggingface/datasets to /tmp/pip-req-build-fzgi4uj2\n",
382
+ " Running command git clone --filter=blob:none --quiet https://github.com/huggingface/datasets /tmp/pip-req-build-fzgi4uj2\n",
383
+ " Resolved https://github.com/huggingface/datasets to commit 5266c81430628edc175013692f02f5f2747ff29e\n",
384
  " Installing build dependencies ... \u001b[?25ldone\n",
385
  "\u001b[?25h Getting requirements to build wheel ... \u001b[?25ldone\n",
386
  "\u001b[?25h Preparing metadata (pyproject.toml) ... \u001b[?25ldone\n",
387
+ "\u001b[?25hRequirement already satisfied: xxhash in /usr/local/lib/python3.8/dist-packages (from datasets==2.7.1.dev0) (3.1.0)\n",
388
+ "Requirement already satisfied: fsspec[http]>=2021.11.1 in /usr/local/lib/python3.8/dist-packages (from datasets==2.7.1.dev0) (2022.11.0)\n",
389
+ "Requirement already satisfied: numpy>=1.17 in ./.local/lib/python3.8/site-packages (from datasets==2.7.1.dev0) (1.23.5)\n",
390
+ "Requirement already satisfied: huggingface-hub<1.0.0,>=0.2.0 in /usr/local/lib/python3.8/dist-packages (from datasets==2.7.1.dev0) (0.11.1)\n",
391
+ "Requirement already satisfied: tqdm>=4.62.1 in ./.local/lib/python3.8/site-packages (from datasets==2.7.1.dev0) (4.64.1)\n",
392
+ "Requirement already satisfied: dill<0.3.7 in /usr/local/lib/python3.8/dist-packages (from datasets==2.7.1.dev0) (0.3.6)\n",
393
+ "Requirement already satisfied: packaging in ./.local/lib/python3.8/site-packages (from datasets==2.7.1.dev0) (22.0)\n",
394
  "Requirement already satisfied: pyarrow>=6.0.0 in /usr/local/lib/python3.8/dist-packages (from datasets==2.7.1.dev0) (10.0.1)\n",
395
+ "Requirement already satisfied: aiohttp in /usr/local/lib/python3.8/dist-packages (from datasets==2.7.1.dev0) (3.8.3)\n",
396
  "Requirement already satisfied: pyyaml>=5.1 in /usr/lib/python3/dist-packages (from datasets==2.7.1.dev0) (5.3.1)\n",
397
  "Requirement already satisfied: pandas in ./.local/lib/python3.8/site-packages (from datasets==2.7.1.dev0) (1.5.1)\n",
 
 
 
 
 
 
 
398
  "Requirement already satisfied: requests>=2.19.0 in ./.local/lib/python3.8/site-packages (from datasets==2.7.1.dev0) (2.28.1)\n",
399
+ "Requirement already satisfied: responses<0.19 in /usr/local/lib/python3.8/dist-packages (from datasets==2.7.1.dev0) (0.18.0)\n",
400
+ "Requirement already satisfied: multiprocess in /usr/local/lib/python3.8/dist-packages (from datasets==2.7.1.dev0) (0.70.14)\n",
401
  "Requirement already satisfied: attrs>=17.3.0 in /usr/lib/python3/dist-packages (from aiohttp->datasets==2.7.1.dev0) (19.3.0)\n",
402
+ "Requirement already satisfied: multidict<7.0,>=4.5 in /usr/local/lib/python3.8/dist-packages (from aiohttp->datasets==2.7.1.dev0) (6.0.3)\n",
403
  "Requirement already satisfied: frozenlist>=1.1.1 in /usr/local/lib/python3.8/dist-packages (from aiohttp->datasets==2.7.1.dev0) (1.3.3)\n",
 
404
  "Requirement already satisfied: charset-normalizer<3.0,>=2.0 in ./.local/lib/python3.8/site-packages (from aiohttp->datasets==2.7.1.dev0) (2.1.1)\n",
405
+ "Requirement already satisfied: async-timeout<5.0,>=4.0.0a3 in /usr/local/lib/python3.8/dist-packages (from aiohttp->datasets==2.7.1.dev0) (4.0.2)\n",
406
  "Requirement already satisfied: yarl<2.0,>=1.0 in /usr/local/lib/python3.8/dist-packages (from aiohttp->datasets==2.7.1.dev0) (1.8.2)\n",
 
407
  "Requirement already satisfied: aiosignal>=1.1.2 in /usr/local/lib/python3.8/dist-packages (from aiohttp->datasets==2.7.1.dev0) (1.3.1)\n",
 
408
  "Requirement already satisfied: typing-extensions>=3.7.4.3 in ./.local/lib/python3.8/site-packages (from huggingface-hub<1.0.0,>=0.2.0->datasets==2.7.1.dev0) (4.4.0)\n",
409
+ "Requirement already satisfied: filelock in /usr/lib/python3/dist-packages (from huggingface-hub<1.0.0,>=0.2.0->datasets==2.7.1.dev0) (3.0.12)\n",
410
+ "Requirement already satisfied: idna<4,>=2.5 in ./.local/lib/python3.8/site-packages (from requests>=2.19.0->datasets==2.7.1.dev0) (3.4)\n",
411
  "Requirement already satisfied: urllib3<1.27,>=1.21.1 in ./.local/lib/python3.8/site-packages (from requests>=2.19.0->datasets==2.7.1.dev0) (1.26.13)\n",
412
  "Requirement already satisfied: certifi>=2017.4.17 in ./.local/lib/python3.8/site-packages (from requests>=2.19.0->datasets==2.7.1.dev0) (2022.12.7)\n",
 
413
  "Requirement already satisfied: python-dateutil>=2.8.1 in ./.local/lib/python3.8/site-packages (from pandas->datasets==2.7.1.dev0) (2.8.2)\n",
414
  "Requirement already satisfied: pytz>=2020.1 in ./.local/lib/python3.8/site-packages (from pandas->datasets==2.7.1.dev0) (2022.5)\n",
415
  "Requirement already satisfied: six>=1.5 in /usr/lib/python3/dist-packages (from python-dateutil>=2.8.1->pandas->datasets==2.7.1.dev0) (1.14.0)\n",
416
+ "\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
417
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
418
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
419
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
420
+ "\u001b[0mDefaulting to user installation because normal site-packages is not writeable\n",
421
+ "\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
422
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
423
+ "\u001b[0mCollecting git+https://github.com/huggingface/transformers\n",
424
+ " Cloning https://github.com/huggingface/transformers to /tmp/pip-req-build-3doeohfs\n",
425
+ " Running command git clone --filter=blob:none --quiet https://github.com/huggingface/transformers /tmp/pip-req-build-3doeohfs\n",
426
+ " Resolved https://github.com/huggingface/transformers to commit 799cea64ac1029d66e9e58f18bc6f47892270723\n",
427
  " Installing build dependencies ... \u001b[?25ldone\n",
428
  "\u001b[?25h Getting requirements to build wheel ... \u001b[?25ldone\n",
429
  "\u001b[?25h Preparing metadata (pyproject.toml) ... \u001b[?25ldone\n",
430
+ "\u001b[?25hRequirement already satisfied: pyyaml>=5.1 in /usr/lib/python3/dist-packages (from transformers==4.26.0.dev0) (5.3.1)\n",
431
+ "Requirement already satisfied: packaging>=20.0 in ./.local/lib/python3.8/site-packages (from transformers==4.26.0.dev0) (22.0)\n",
432
+ "Requirement already satisfied: tqdm>=4.27 in ./.local/lib/python3.8/site-packages (from transformers==4.26.0.dev0) (4.64.1)\n",
433
+ "Requirement already satisfied: huggingface-hub<1.0,>=0.10.0 in /usr/local/lib/python3.8/dist-packages (from transformers==4.26.0.dev0) (0.11.1)\n",
434
  "Requirement already satisfied: regex!=2019.12.17 in /usr/local/lib/python3.8/dist-packages (from transformers==4.26.0.dev0) (2022.10.31)\n",
 
 
 
435
  "Requirement already satisfied: requests in ./.local/lib/python3.8/site-packages (from transformers==4.26.0.dev0) (2.28.1)\n",
436
+ "Requirement already satisfied: filelock in /usr/lib/python3/dist-packages (from transformers==4.26.0.dev0) (3.0.12)\n",
437
+ "Requirement already satisfied: tokenizers!=0.11.3,<0.14,>=0.11.1 in /usr/local/lib/python3.8/dist-packages (from transformers==4.26.0.dev0) (0.13.2)\n",
438
+ "Requirement already satisfied: numpy>=1.17 in ./.local/lib/python3.8/site-packages (from transformers==4.26.0.dev0) (1.23.5)\n",
439
  "Requirement already satisfied: typing-extensions>=3.7.4.3 in ./.local/lib/python3.8/site-packages (from huggingface-hub<1.0,>=0.10.0->transformers==4.26.0.dev0) (4.4.0)\n",
 
 
 
440
  "Requirement already satisfied: certifi>=2017.4.17 in ./.local/lib/python3.8/site-packages (from requests->transformers==4.26.0.dev0) (2022.12.7)\n",
441
+ "Requirement already satisfied: idna<4,>=2.5 in ./.local/lib/python3.8/site-packages (from requests->transformers==4.26.0.dev0) (3.4)\n",
442
  "Requirement already satisfied: urllib3<1.27,>=1.21.1 in ./.local/lib/python3.8/site-packages (from requests->transformers==4.26.0.dev0) (1.26.13)\n",
443
+ "Requirement already satisfied: charset-normalizer<3,>=2 in ./.local/lib/python3.8/site-packages (from requests->transformers==4.26.0.dev0) (2.1.1)\n",
444
+ "\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
445
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
446
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
447
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
448
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
449
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
450
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
451
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
452
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
453
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
454
+ "\u001b[0mDefaulting to user installation because normal site-packages is not writeable\n",
455
+ "\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
456
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
457
+ "\u001b[0mRequirement already satisfied: librosa in /usr/local/lib/python3.8/dist-packages (0.9.2)\n",
458
  "Requirement already satisfied: decorator>=4.0.10 in /usr/lib/python3/dist-packages (from librosa) (4.4.2)\n",
 
459
  "Requirement already satisfied: pooch>=1.0 in /usr/local/lib/python3.8/dist-packages (from librosa) (1.6.0)\n",
460
+ "Requirement already satisfied: numba>=0.45.1 in /usr/local/lib/python3.8/dist-packages (from librosa) (0.56.4)\n",
461
+ "Requirement already satisfied: scikit-learn>=0.19.1 in /usr/lib/python3/dist-packages (from librosa) (0.22.2.post1)\n",
462
  "Requirement already satisfied: scipy>=1.2.0 in ./.local/lib/python3.8/site-packages (from librosa) (1.9.3)\n",
463
+ "Requirement already satisfied: packaging>=20.0 in ./.local/lib/python3.8/site-packages (from librosa) (22.0)\n",
464
+ "Requirement already satisfied: soundfile>=0.10.2 in /usr/local/lib/python3.8/dist-packages (from librosa) (0.11.0)\n",
465
+ "Requirement already satisfied: numpy>=1.17.0 in ./.local/lib/python3.8/site-packages (from librosa) (1.23.5)\n",
466
+ "Requirement already satisfied: audioread>=2.1.9 in /usr/local/lib/python3.8/dist-packages (from librosa) (3.0.0)\n",
467
+ "Requirement already satisfied: joblib>=0.14 in ./.local/lib/python3.8/site-packages (from librosa) (1.2.0)\n",
468
  "Requirement already satisfied: resampy>=0.2.2 in /usr/local/lib/python3.8/dist-packages (from librosa) (0.4.2)\n",
 
 
 
 
 
469
  "Requirement already satisfied: importlib-metadata in ./.local/lib/python3.8/site-packages (from numba>=0.45.1->librosa) (5.0.0)\n",
470
  "Requirement already satisfied: llvmlite<0.40,>=0.39.0dev0 in /usr/local/lib/python3.8/dist-packages (from numba>=0.45.1->librosa) (0.39.1)\n",
471
+ "Requirement already satisfied: setuptools in ./.local/lib/python3.8/site-packages (from numba>=0.45.1->librosa) (65.6.3)\n",
472
  "Requirement already satisfied: requests>=2.19.0 in ./.local/lib/python3.8/site-packages (from pooch>=1.0->librosa) (2.28.1)\n",
473
  "Requirement already satisfied: appdirs>=1.3.0 in /usr/lib/python3/dist-packages (from pooch>=1.0->librosa) (1.4.3)\n",
474
  "Requirement already satisfied: cffi>=1.0 in /usr/lib/python3/dist-packages (from soundfile>=0.10.2->librosa) (1.14.0)\n",
475
  "Requirement already satisfied: certifi>=2017.4.17 in ./.local/lib/python3.8/site-packages (from requests>=2.19.0->pooch>=1.0->librosa) (2022.12.7)\n",
476
+ "Requirement already satisfied: charset-normalizer<3,>=2 in ./.local/lib/python3.8/site-packages (from requests>=2.19.0->pooch>=1.0->librosa) (2.1.1)\n",
477
  "Requirement already satisfied: idna<4,>=2.5 in ./.local/lib/python3.8/site-packages (from requests>=2.19.0->pooch>=1.0->librosa) (3.4)\n",
478
  "Requirement already satisfied: urllib3<1.27,>=1.21.1 in ./.local/lib/python3.8/site-packages (from requests>=2.19.0->pooch>=1.0->librosa) (1.26.13)\n",
 
479
  "Requirement already satisfied: zipp>=0.5 in /usr/lib/python3/dist-packages (from importlib-metadata->numba>=0.45.1->librosa) (1.0.0)\n",
480
+ "\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
481
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
482
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
483
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
484
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
485
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
486
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
487
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
488
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
489
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
490
+ "\u001b[0mDefaulting to user installation because normal site-packages is not writeable\n",
491
+ "\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
492
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
493
+ "\u001b[0mRequirement already satisfied: jiwer in /usr/local/lib/python3.8/dist-packages (2.5.1)\n",
494
  "Requirement already satisfied: levenshtein==0.20.2 in /usr/local/lib/python3.8/dist-packages (from jiwer) (0.20.2)\n",
495
+ "Requirement already satisfied: rapidfuzz<3.0.0,>=2.3.0 in /usr/local/lib/python3.8/dist-packages (from levenshtein==0.20.2->jiwer) (2.13.5)\n",
496
+ "\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
497
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
498
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
499
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
500
+ "\u001b[0mDefaulting to user installation because normal site-packages is not writeable\n",
501
+ "\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
502
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
503
+ "\u001b[0mRequirement already satisfied: gradio in ./.local/lib/python3.8/site-packages (3.12.0)\n",
504
+ "Requirement already satisfied: matplotlib in ./.local/lib/python3.8/site-packages (from gradio) (3.5.3)\n",
505
+ "Requirement already satisfied: fastapi in ./.local/lib/python3.8/site-packages (from gradio) (0.88.0)\n",
506
+ "Requirement already satisfied: h11<0.13,>=0.11 in ./.local/lib/python3.8/site-packages (from gradio) (0.12.0)\n",
507
+ "Requirement already satisfied: orjson in ./.local/lib/python3.8/site-packages (from gradio) (3.8.3)\n",
508
+ "Requirement already satisfied: httpx in ./.local/lib/python3.8/site-packages (from gradio) (0.23.1)\n",
509
+ "Requirement already satisfied: pycryptodome in ./.local/lib/python3.8/site-packages (from gradio) (3.16.0)\n",
510
  "Requirement already satisfied: pyyaml in /usr/lib/python3/dist-packages (from gradio) (5.3.1)\n",
511
+ "Requirement already satisfied: websockets>=10.0 in ./.local/lib/python3.8/site-packages (from gradio) (10.4)\n",
512
+ "Requirement already satisfied: requests in ./.local/lib/python3.8/site-packages (from gradio) (2.28.1)\n",
513
+ "Requirement already satisfied: ffmpy in ./.local/lib/python3.8/site-packages (from gradio) (0.3.0)\n",
514
+ "Requirement already satisfied: numpy in ./.local/lib/python3.8/site-packages (from gradio) (1.23.5)\n",
515
  "Requirement already satisfied: aiohttp in /usr/local/lib/python3.8/dist-packages (from gradio) (3.8.3)\n",
516
+ "Requirement already satisfied: paramiko in ./.local/lib/python3.8/site-packages (from gradio) (2.12.0)\n",
517
+ "Requirement already satisfied: pandas in ./.local/lib/python3.8/site-packages (from gradio) (1.5.1)\n",
518
+ "Requirement already satisfied: pydub in ./.local/lib/python3.8/site-packages (from gradio) (0.25.1)\n",
519
+ "Requirement already satisfied: pillow in ./.local/lib/python3.8/site-packages (from gradio) (9.3.0)\n",
520
+ "Requirement already satisfied: uvicorn in ./.local/lib/python3.8/site-packages (from gradio) (0.20.0)\n",
521
  "Requirement already satisfied: pydantic in ./.local/lib/python3.8/site-packages (from gradio) (1.10.2)\n",
522
+ "Requirement already satisfied: markdown-it-py[linkify,plugins] in ./.local/lib/python3.8/site-packages (from gradio) (2.1.0)\n",
523
+ "Requirement already satisfied: python-multipart in ./.local/lib/python3.8/site-packages (from gradio) (0.0.5)\n",
524
+ "Requirement already satisfied: fsspec in /usr/local/lib/python3.8/dist-packages (from gradio) (2022.11.0)\n",
 
525
  "Requirement already satisfied: jinja2 in ./.local/lib/python3.8/site-packages (from gradio) (3.1.2)\n",
 
 
 
 
526
  "Requirement already satisfied: yarl<2.0,>=1.0 in /usr/local/lib/python3.8/dist-packages (from aiohttp->gradio) (1.8.2)\n",
527
+ "Requirement already satisfied: charset-normalizer<3.0,>=2.0 in ./.local/lib/python3.8/site-packages (from aiohttp->gradio) (2.1.1)\n",
528
  "Requirement already satisfied: attrs>=17.3.0 in /usr/lib/python3/dist-packages (from aiohttp->gradio) (19.3.0)\n",
529
+ "Requirement already satisfied: aiosignal>=1.1.2 in /usr/local/lib/python3.8/dist-packages (from aiohttp->gradio) (1.3.1)\n",
530
  "Requirement already satisfied: multidict<7.0,>=4.5 in /usr/local/lib/python3.8/dist-packages (from aiohttp->gradio) (6.0.3)\n",
531
  "Requirement already satisfied: frozenlist>=1.1.1 in /usr/local/lib/python3.8/dist-packages (from aiohttp->gradio) (1.3.3)\n",
532
+ "Requirement already satisfied: async-timeout<5.0,>=4.0.0a3 in /usr/local/lib/python3.8/dist-packages (from aiohttp->gradio) (4.0.2)\n",
533
+ "Requirement already satisfied: starlette==0.22.0 in ./.local/lib/python3.8/site-packages (from fastapi->gradio) (0.22.0)\n",
 
534
  "Requirement already satisfied: typing-extensions>=3.10.0 in ./.local/lib/python3.8/site-packages (from starlette==0.22.0->fastapi->gradio) (4.4.0)\n",
535
  "Requirement already satisfied: anyio<5,>=3.4.0 in ./.local/lib/python3.8/site-packages (from starlette==0.22.0->fastapi->gradio) (3.6.2)\n",
536
+ "Requirement already satisfied: rfc3986[idna2008]<2,>=1.3 in ./.local/lib/python3.8/site-packages (from httpx->gradio) (1.5.0)\n",
537
+ "Requirement already satisfied: httpcore<0.17.0,>=0.15.0 in ./.local/lib/python3.8/site-packages (from httpx->gradio) (0.15.0)\n",
 
538
  "Requirement already satisfied: certifi in ./.local/lib/python3.8/site-packages (from httpx->gradio) (2022.12.7)\n",
539
+ "Requirement already satisfied: sniffio in ./.local/lib/python3.8/site-packages (from httpx->gradio) (1.3.0)\n",
540
  "Requirement already satisfied: MarkupSafe>=2.0 in ./.local/lib/python3.8/site-packages (from jinja2->gradio) (2.1.1)\n",
541
+ "Requirement already satisfied: mdurl~=0.1 in ./.local/lib/python3.8/site-packages (from markdown-it-py[linkify,plugins]->gradio) (0.1.2)\n",
542
+ "Requirement already satisfied: mdit-py-plugins in ./.local/lib/python3.8/site-packages (from markdown-it-py[linkify,plugins]->gradio) (0.3.3)\n",
543
+ "Requirement already satisfied: linkify-it-py~=1.0 in ./.local/lib/python3.8/site-packages (from markdown-it-py[linkify,plugins]->gradio) (1.0.3)\n",
544
+ "Requirement already satisfied: packaging>=20.0 in ./.local/lib/python3.8/site-packages (from matplotlib->gradio) (22.0)\n",
545
  "Requirement already satisfied: cycler>=0.10 in /usr/lib/python3/dist-packages (from matplotlib->gradio) (0.10.0)\n",
 
546
  "Requirement already satisfied: pyparsing>=2.2.1 in /usr/lib/python3/dist-packages (from matplotlib->gradio) (2.4.6)\n",
547
+ "Requirement already satisfied: kiwisolver>=1.0.1 in /usr/lib/python3/dist-packages (from matplotlib->gradio) (1.0.1)\n",
548
+ "Requirement already satisfied: fonttools>=4.22.0 in ./.local/lib/python3.8/site-packages (from matplotlib->gradio) (4.38.0)\n",
549
  "Requirement already satisfied: python-dateutil>=2.7 in ./.local/lib/python3.8/site-packages (from matplotlib->gradio) (2.8.2)\n",
 
550
  "Requirement already satisfied: pytz>=2020.1 in ./.local/lib/python3.8/site-packages (from pandas->gradio) (2022.5)\n",
 
551
  "Requirement already satisfied: pynacl>=1.0.1 in /usr/lib/python3/dist-packages (from paramiko->gradio) (1.3.0)\n",
552
  "Requirement already satisfied: six in /usr/lib/python3/dist-packages (from paramiko->gradio) (1.14.0)\n",
553
+ "Requirement already satisfied: cryptography>=2.5 in /usr/lib/python3/dist-packages (from paramiko->gradio) (2.8)\n",
554
+ "Requirement already satisfied: bcrypt>=3.1.3 in ./.local/lib/python3.8/site-packages (from paramiko->gradio) (4.0.1)\n",
555
  "Requirement already satisfied: urllib3<1.27,>=1.21.1 in ./.local/lib/python3.8/site-packages (from requests->gradio) (1.26.13)\n",
556
  "Requirement already satisfied: idna<4,>=2.5 in ./.local/lib/python3.8/site-packages (from requests->gradio) (3.4)\n",
557
  "Requirement already satisfied: click>=7.0 in /usr/lib/python3/dist-packages (from uvicorn->gradio) (7.0)\n",
558
+ "Requirement already satisfied: uc-micro-py in ./.local/lib/python3.8/site-packages (from linkify-it-py~=1.0->markdown-it-py[linkify,plugins]->gradio) (1.0.1)\n",
559
+ "\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
560
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
561
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
562
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
563
+ "\u001b[0mDefaulting to user installation because normal site-packages is not writeable\n",
564
+ "\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
565
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
566
+ "\u001b[0mRequirement already satisfied: more-itertools in /usr/local/lib/python3.8/dist-packages (9.0.0)\n",
567
+ "\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
568
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
569
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
570
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
571
+ "\u001b[0m"
572
  ]
573
  }
574
  ],
575
  "source": [
576
  "!pip install git+https://github.com/huggingface/datasets\n",
577
  "!pip install git+https://github.com/huggingface/transformers\n",
578
+ "!pip3 install numexpr>=2.7.3\n",
579
  "!pip install librosa\n",
580
  "!pip install evaluate>=0.3.0\n",
581
  "!pip install jiwer\n",
 
633
  {
634
  "data": {
635
  "application/vnd.jupyter.widget-view+json": {
636
+ "model_id": "eb5276b0c5844a94939f868f9c37e908",
637
  "version_major": 2,
638
  "version_minor": 0
639
  },
 
649
  "from huggingface_hub import notebook_login\n",
650
  "\n",
651
  "notebook_login()\n",
652
+ "\n",
653
+ "# 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"
654
  ]
655
  },
656
  {
 
723
  },
724
  "outputs": [
725
  {
726
+ "name": "stdout",
727
  "output_type": "stream",
728
  "text": [
729
+ "\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
730
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
731
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
732
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
733
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
734
+ "\u001b[0m\u001b[33mWARNING: Ignoring invalid distribution -orch (/home/ubuntu/.local/lib/python3.8/site-packages)\u001b[0m\u001b[33m\n",
735
+ "\u001b[0m"
736
  ]
737
  }
738
  ],
739
  "source": [
740
+ "!pip3 install --upgrade numexpr>=2.7.3\n",
741
+ "\n",
742
  "from datasets import Audio, interleave_datasets, IterableDataset, load_dataset\n",
743
  "from typing import List, Optional\n",
744
  "\n",
 
900
  "execution_count": 11,
901
  "id": "77d9f0c5-8607-4642-a8ac-c3ab2e223ea6",
902
  "metadata": {
903
+ "id": "77d9f0c5-8607-4642-a8ac-c3ab2e223ea6",
904
+ "tags": []
905
  },
906
  "outputs": [],
907
  "source": [
 
1454
  "source": [
1455
  "from transformers import WhisperForConditionalGeneration\n",
1456
  "\n",
1457
+ "#model = WhisperForConditionalGeneration.from_pretrained(\"emilios/whisper-medium-el-wer-14\")\n",
1458
  "model = WhisperForConditionalGeneration.from_pretrained(\"emilios/whisper-medium-el\")\n",
1459
  "#model = WhisperForConditionalGeneration.from_pretrained(\"farsipal/whisper-small-el\")\n",
1460
  "\n",
 
1507
  },
1508
  {
1509
  "cell_type": "code",
1510
+ "execution_count": 26,
1511
  "id": "0ae3e9af-97b7-4aa0-ae85-20b23b5bcb3a",
1512
  "metadata": {
1513
  "colab": {
 
1516
  "id": "0ae3e9af-97b7-4aa0-ae85-20b23b5bcb3a",
1517
  "outputId": "7290f729-fb46-4190-dcc5-5cf01e1c9808"
1518
  },
1519
+ "outputs": [],
 
 
 
 
 
 
 
 
1520
  "source": [
1521
  "from transformers import Seq2SeqTrainingArguments\n",
1522
  "\n",
 
1524
  " output_dir=\"./whisper-medium-el\", # your repo name\n",
1525
  " #output_dir=\"./whisper-small-el\", # your repo name\n",
1526
  " per_device_train_batch_size=32,\n",
1527
+ " gradient_accumulation_steps=1, # increase by 2x for every 2x decrease in batch size\n",
1528
  " learning_rate=1e-5,\n",
1529
  " warmup_steps=500,\n",
1530
  " max_steps=5000,\n",
1531
  " gradient_checkpointing=True,\n",
1532
  " fp16=True,\n",
1533
  " evaluation_strategy=\"steps\",\n",
1534
+ " per_device_eval_batch_size=16,\n",
1535
  " predict_with_generate=True,\n",
1536
  " generation_max_length=225,\n",
1537
  " save_steps=1000,\n",
 
1542
  " metric_for_best_model=\"wer\",\n",
1543
  " greater_is_better=False,\n",
1544
  " push_to_hub=True,\n",
1545
+ " #ignore_data_skip=True,\n",
1546
+ "# resume_from_checkpoint=\"checkpoint-4000\"\n",
1547
  " #optim=\"adamw_bnb_8bit\"\n",
1548
  ")\n"
1549
  ]
1550
  },
1551
  {
1552
  "cell_type": "code",
1553
+ "execution_count": 27,
1554
  "id": "o72eOpGzD_sK",
1555
  "metadata": {
1556
  "colab": {
 
1564
  "name": "stdout",
1565
  "output_type": "stream",
1566
  "text": [
1567
+ "Mon Dec 12 01:25:00 2022 \n",
1568
  "+-----------------------------------------------------------------------------+\n",
1569
  "| NVIDIA-SMI 515.65.01 Driver Version: 515.65.01 CUDA Version: 11.7 |\n",
1570
  "|-------------------------------+----------------------+----------------------+\n",
 
1573
  "| | | MIG M. |\n",
1574
  "|===============================+======================+======================|\n",
1575
  "| 0 NVIDIA A100-SXM... On | 00000000:06:00.0 Off | 0 |\n",
1576
+ "| N/A 32C P0 44W / 400W | 2MiB / 40960MiB | 0% Default |\n",
1577
  "| | | Disabled |\n",
1578
  "+-------------------------------+----------------------+----------------------+\n",
1579
  " \n",
 
1582
  "| GPU GI CI PID Type Process name GPU Memory |\n",
1583
  "| ID ID Usage |\n",
1584
  "|=============================================================================|\n",
1585
+ "| No running processes found |\n",
1586
  "+-----------------------------------------------------------------------------+\n"
1587
  ]
1588
  }
 
1614
  },
1615
  {
1616
  "cell_type": "code",
1617
+ "execution_count": 28,
1618
  "id": "3ac16b62-b3c0-4c68-8f3d-9ecf471534b2",
1619
  "metadata": {
1620
  "id": "3ac16b62-b3c0-4c68-8f3d-9ecf471534b2"
 
1647
  },
1648
  {
1649
  "cell_type": "code",
1650
+ "execution_count": 29,
1651
  "id": "d546d7fe-0543-479a-b708-2ebabec19493",
1652
  "metadata": {
1653
  "colab": {
 
2218
  ]
2219
  },
2220
  "id": "d546d7fe-0543-479a-b708-2ebabec19493",
2221
+ "outputId": "2c83cd5c-b1cc-4a3b-8d6c-57239408a297",
2222
+ "tags": []
2223
  },
2224
+ "outputs": [],
2225
+ "source": [
2226
+ "#from huggingface_hub import Repository\n",
2227
+ "#repo = Repository(local_dir=\"huggingface-hub\", clone_from=\"emilios/whisper-medium-el\")\n",
2228
+ "#repo.git_pull()\n"
2229
+ ]
2230
+ },
2231
+ {
2232
+ "cell_type": "code",
2233
+ "execution_count": null,
2234
+ "id": "bfa35273-00f8-49c1-846a-d413cd2f072d",
2235
+ "metadata": {},
2236
+ "outputs": [],
2237
+ "source": []
2238
+ },
2239
+ {
2240
+ "cell_type": "code",
2241
+ "execution_count": 30,
2242
+ "id": "7118041a-5ff8-48ae-b02b-3efdb0d13490",
2243
+ "metadata": {},
2244
  "outputs": [
2245
  {
2246
  "name": "stderr",
 
2253
  }
2254
  ],
2255
  "source": [
2256
+ "\n",
2257
  "from transformers import Seq2SeqTrainer\n",
2258
  "\n",
2259
  "#import torch._dynamo as dynamo\n",
 
2283
  },
2284
  {
2285
  "cell_type": "code",
2286
+ "execution_count": 31,
2287
  "id": "a1ccb9ed-cbc8-4419-91c0-651e9424b672",
2288
  "metadata": {
2289
  "id": "a1ccb9ed-cbc8-4419-91c0-651e9424b672"
 
2366
  },
2367
  {
2368
  "cell_type": "code",
2369
+ "execution_count": 32,
2370
  "id": "ee8b7b8e-1c9a-4d77-9137-1778a629e6de",
2371
  "metadata": {
2372
  "id": "ee8b7b8e-1c9a-4d77-9137-1778a629e6de"
 
2376
  "name": "stderr",
2377
  "output_type": "stream",
2378
  "text": [
2379
+ "Loading model from ./whisper-medium-el/checkpoint-4000.\n",
2380
  "/home/ubuntu/.local/lib/python3.8/site-packages/transformers/optimization.py:306: FutureWarning: This implementation of AdamW is deprecated and will be removed in a future version. Use the PyTorch implementation torch.optim.AdamW instead, or set `no_deprecation_warning=True` to disable this warning\n",
2381
  " warnings.warn(\n",
2382
  "***** Running training *****\n",
2383
+ " Num examples = 160000\n",
2384
  " Num Epochs = 9223372036854775807\n",
2385
  " Instantaneous batch size per device = 32\n",
2386
+ " Total train batch size (w. parallel, distributed & accumulation) = 32\n",
2387
+ " Gradient Accumulation steps = 1\n",
2388
  " Total optimization steps = 5000\n",
2389
  " Number of trainable parameters = 763857920\n",
2390
+ " Continuing training from checkpoint, will skip to saved global_step\n",
2391
+ " Continuing training from epoch 0\n",
2392
+ " Continuing training from global step 4000\n",
2393
+ "Reading metadata...: 1914it [00:00, 10645.75it/s]\n",
2394
+ "The following columns in the training set don't have a corresponding argument in `WhisperForConditionalGeneration.forward` and have been ignored: input_length. If input_length are not expected by `WhisperForConditionalGeneration.forward`, you can safely ignore this message.\n"
2395
+ ]
2396
+ },
2397
+ {
2398
+ "data": {
2399
+ "text/html": [
2400
+ "\n",
2401
+ " <div>\n",
2402
+ " \n",
2403
+ " <progress value='5001' max='5000' style='width:300px; height:20px; vertical-align: middle;'></progress>\n",
2404
+ " [5000/5000 1:46:44, Epoch 4.04/9223372036854775807]\n",
2405
+ " </div>\n",
2406
+ " <table border=\"1\" class=\"dataframe\">\n",
2407
+ " <thead>\n",
2408
+ " <tr style=\"text-align: left;\">\n",
2409
+ " <th>Step</th>\n",
2410
+ " <th>Training Loss</th>\n",
2411
+ " <th>Validation Loss</th>\n",
2412
+ " <th>Wer</th>\n",
2413
+ " </tr>\n",
2414
+ " </thead>\n",
2415
+ " <tbody>\n",
2416
+ " <tr>\n",
2417
+ " <td>5000</td>\n",
2418
+ " <td>0.002400</td>\n",
2419
+ " <td>0.392416</td>\n",
2420
+ " <td>12.444279</td>\n",
2421
+ " </tr>\n",
2422
+ " </tbody>\n",
2423
+ "</table><p>"
2424
+ ],
2425
+ "text/plain": [
2426
+ "<IPython.core.display.HTML object>"
2427
+ ]
2428
+ },
2429
+ "metadata": {},
2430
+ "output_type": "display_data"
2431
+ },
2432
+ {
2433
+ "name": "stderr",
2434
+ "output_type": "stream",
2435
+ "text": [
2436
+ "Reading metadata...: 1914it [00:00, 49446.87it/s]\n",
2437
+ "Reading metadata...: 1914it [00:01, 1168.90it/s]\n",
2438
+ "Reading metadata...: 1914it [00:00, 53478.68it/s]\n",
2439
+ "Reading metadata...: 1914it [00:00, 56742.28it/s]\n",
2440
+ "Reading metadata...: 1914it [00:00, 43779.78it/s]\n",
2441
+ "Reading metadata...: 1914it [00:00, 47903.73it/s]\n",
2442
+ "Reading metadata...: 1914it [00:00, 41580.46it/s]\n",
2443
+ "Reading metadata...: 1914it [00:00, 45482.55it/s]\n",
2444
+ "Reading metadata...: 1914it [00:00, 35803.98it/s]\n",
2445
+ "***** Running Evaluation *****\n",
2446
+ " Num examples: Unknown\n",
2447
+ " Batch size = 16\n",
2448
+ "Reading metadata...: 1696it [00:00, 12322.32it/s]\n",
2449
+ "The following columns in the evaluation set don't have a corresponding argument in `WhisperForConditionalGeneration.forward` and have been ignored: down_votes, up_votes, segment, gender, accent, locale, input_length, path, age, client_id. If down_votes, up_votes, segment, gender, accent, locale, input_length, path, age, client_id are not expected by `WhisperForConditionalGeneration.forward`, you can safely ignore this message.\n",
2450
+ "Saving model checkpoint to ./whisper-medium-el/checkpoint-5000\n",
2451
+ "Configuration saved in ./whisper-medium-el/checkpoint-5000/config.json\n",
2452
+ "Model weights saved in ./whisper-medium-el/checkpoint-5000/pytorch_model.bin\n",
2453
+ "Feature extractor saved in ./whisper-medium-el/checkpoint-5000/preprocessor_config.json\n",
2454
+ "tokenizer config file saved in ./whisper-medium-el/checkpoint-5000/tokenizer_config.json\n",
2455
+ "Special tokens file saved in ./whisper-medium-el/checkpoint-5000/special_tokens_map.json\n",
2456
+ "added tokens file saved in ./whisper-medium-el/checkpoint-5000/added_tokens.json\n",
2457
+ "Feature extractor saved in ./whisper-medium-el/preprocessor_config.json\n",
2458
+ "tokenizer config file saved in ./whisper-medium-el/tokenizer_config.json\n",
2459
+ "Special tokens file saved in ./whisper-medium-el/special_tokens_map.json\n",
2460
+ "added tokens file saved in ./whisper-medium-el/added_tokens.json\n"
2461
+ ]
2462
+ },
2463
+ {
2464
+ "ename": "OSError",
2465
+ "evalue": "error: insufficient permission for adding an object to repository database .git/objects\nerror: runs/Dec12_01-10-01_150-136-33-0/events.out.tfevents.1670807412.150-136-33-0.2756654.4: failed to insert into database\nerror: unable to index file 'runs/Dec12_01-10-01_150-136-33-0/events.out.tfevents.1670807412.150-136-33-0.2756654.4'\nfatal: adding files failed\nEncountered 1 file(s) that may not have been copied correctly on Windows:\n\tcheckpoint-5000/optimizer.pt\n\nSee: `git lfs help smudge` for more details.\n",
2466
+ "output_type": "error",
2467
+ "traceback": [
2468
+ "\u001b[0;31m---------------------------------------------------------------------------\u001b[0m",
2469
+ "\u001b[0;31mCalledProcessError\u001b[0m Traceback (most recent call last)",
2470
+ "\u001b[0;32m/usr/local/lib/python3.8/dist-packages/huggingface_hub/repository.py\u001b[0m in \u001b[0;36mgit_add\u001b[0;34m(self, pattern, auto_lfs_track)\u001b[0m\n\u001b[1;32m 1121\u001b[0m \u001b[0;32mtry\u001b[0m\u001b[0;34m:\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0;32m-> 1122\u001b[0;31m \u001b[0mresult\u001b[0m \u001b[0;34m=\u001b[0m \u001b[0mrun_subprocess\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0;34m\"git add -v\"\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0msplit\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0;34m)\u001b[0m \u001b[0;34m+\u001b[0m \u001b[0;34m[\u001b[0m\u001b[0mpattern\u001b[0m\u001b[0;34m]\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mlocal_dir\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0m\u001b[1;32m 1123\u001b[0m \u001b[0mlogger\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0minfo\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0;34mf\"Adding to index:\\n{result.stdout}\\n\"\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n",
2471
+ "\u001b[0;32m/usr/local/lib/python3.8/dist-packages/huggingface_hub/utils/_subprocess.py\u001b[0m in \u001b[0;36mrun_subprocess\u001b[0;34m(command, folder, check, **kwargs)\u001b[0m\n\u001b[1;32m 60\u001b[0m \u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0;32m---> 61\u001b[0;31m return subprocess.run(\n\u001b[0m\u001b[1;32m 62\u001b[0m \u001b[0mcommand\u001b[0m\u001b[0;34m,\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n",
2472
+ "\u001b[0;32m/usr/lib/python3.8/subprocess.py\u001b[0m in \u001b[0;36mrun\u001b[0;34m(input, capture_output, timeout, check, *popenargs, **kwargs)\u001b[0m\n\u001b[1;32m 515\u001b[0m \u001b[0;32mif\u001b[0m \u001b[0mcheck\u001b[0m \u001b[0;32mand\u001b[0m \u001b[0mretcode\u001b[0m\u001b[0;34m:\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0;32m--> 516\u001b[0;31m raise CalledProcessError(retcode, process.args,\n\u001b[0m\u001b[1;32m 517\u001b[0m output=stdout, stderr=stderr)\n",
2473
+ "\u001b[0;31mCalledProcessError\u001b[0m: Command '['git', 'add', '-v', '.']' returned non-zero exit status 128.",
2474
+ "\nDuring handling of the above exception, another exception occurred:\n",
2475
+ "\u001b[0;31mOSError\u001b[0m Traceback (most recent call last)",
2476
+ "\u001b[0;32m<ipython-input-32-d2e90939d965>\u001b[0m in \u001b[0;36m<module>\u001b[0;34m\u001b[0m\n\u001b[1;32m 1\u001b[0m \u001b[0;31m#trainer.train()\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0;32m----> 2\u001b[0;31m \u001b[0mtrainer\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mtrain\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0mresume_from_checkpoint\u001b[0m \u001b[0;34m=\u001b[0m \u001b[0;32mTrue\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0m",
2477
+ "\u001b[0;32m~/.local/lib/python3.8/site-packages/transformers/trainer.py\u001b[0m in \u001b[0;36mtrain\u001b[0;34m(self, resume_from_checkpoint, trial, ignore_keys_for_eval, **kwargs)\u001b[0m\n\u001b[1;32m 1533\u001b[0m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0m_inner_training_loop\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0m_train_batch_size\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0margs\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mauto_find_batch_size\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[1;32m 1534\u001b[0m )\n\u001b[0;32m-> 1535\u001b[0;31m return inner_training_loop(\n\u001b[0m\u001b[1;32m 1536\u001b[0m \u001b[0margs\u001b[0m\u001b[0;34m=\u001b[0m\u001b[0margs\u001b[0m\u001b[0;34m,\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[1;32m 1537\u001b[0m \u001b[0mresume_from_checkpoint\u001b[0m\u001b[0;34m=\u001b[0m\u001b[0mresume_from_checkpoint\u001b[0m\u001b[0;34m,\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n",
2478
+ "\u001b[0;32m~/.local/lib/python3.8/site-packages/transformers/trainer.py\u001b[0m in \u001b[0;36m_inner_training_loop\u001b[0;34m(self, batch_size, args, resume_from_checkpoint, trial, ignore_keys_for_eval)\u001b[0m\n\u001b[1;32m 1858\u001b[0m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mcontrol\u001b[0m \u001b[0;34m=\u001b[0m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mcallback_handler\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mon_step_end\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0margs\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mstate\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mcontrol\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[1;32m 1859\u001b[0m \u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0;32m-> 1860\u001b[0;31m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0m_maybe_log_save_evaluate\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0mtr_loss\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mmodel\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mtrial\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mepoch\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mignore_keys_for_eval\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0m\u001b[1;32m 1861\u001b[0m \u001b[0;32melse\u001b[0m\u001b[0;34m:\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[1;32m 1862\u001b[0m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mcontrol\u001b[0m \u001b[0;34m=\u001b[0m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mcallback_handler\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mon_substep_end\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0margs\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mstate\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mcontrol\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n",
2479
+ "\u001b[0;32m~/.local/lib/python3.8/site-packages/transformers/trainer.py\u001b[0m in \u001b[0;36m_maybe_log_save_evaluate\u001b[0;34m(self, tr_loss, model, trial, epoch, ignore_keys_for_eval)\u001b[0m\n\u001b[1;32m 2125\u001b[0m \u001b[0;34m\u001b[0m\u001b[0m\n\u001b[1;32m 2126\u001b[0m \u001b[0;32mif\u001b[0m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mcontrol\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mshould_save\u001b[0m\u001b[0;34m:\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0;32m-> 2127\u001b[0;31m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0m_save_checkpoint\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0mmodel\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mtrial\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mmetrics\u001b[0m\u001b[0;34m=\u001b[0m\u001b[0mmetrics\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0m\u001b[1;32m 2128\u001b[0m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mcontrol\u001b[0m \u001b[0;34m=\u001b[0m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mcallback_handler\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mon_save\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0margs\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mstate\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mcontrol\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[1;32m 2129\u001b[0m \u001b[0;34m\u001b[0m\u001b[0m\n",
2480
+ "\u001b[0;32m~/.local/lib/python3.8/site-packages/transformers/trainer.py\u001b[0m in \u001b[0;36m_save_checkpoint\u001b[0;34m(self, model, trial, metrics)\u001b[0m\n\u001b[1;32m 2269\u001b[0m \u001b[0;34m\u001b[0m\u001b[0m\n\u001b[1;32m 2270\u001b[0m \u001b[0;32mif\u001b[0m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0margs\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mpush_to_hub\u001b[0m\u001b[0;34m:\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0;32m-> 2271\u001b[0;31m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0m_push_from_checkpoint\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0moutput_dir\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0m\u001b[1;32m 2272\u001b[0m \u001b[0;34m\u001b[0m\u001b[0m\n\u001b[1;32m 2273\u001b[0m \u001b[0;31m# Maybe delete some older checkpoints.\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n",
2481
+ "\u001b[0;32m~/.local/lib/python3.8/site-packages/transformers/trainer.py\u001b[0m in \u001b[0;36m_push_from_checkpoint\u001b[0;34m(self, checkpoint_folder)\u001b[0m\n\u001b[1;32m 3441\u001b[0m \u001b[0;32melse\u001b[0m\u001b[0;34m:\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[1;32m 3442\u001b[0m \u001b[0mcommit_message\u001b[0m \u001b[0;34m=\u001b[0m \u001b[0;34mf\"Training in progress, epoch {int(self.state.epoch)}\"\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0;32m-> 3443\u001b[0;31m _, self.push_in_progress = self.repo.push_to_hub(\n\u001b[0m\u001b[1;32m 3444\u001b[0m \u001b[0mcommit_message\u001b[0m\u001b[0;34m=\u001b[0m\u001b[0mcommit_message\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mblocking\u001b[0m\u001b[0;34m=\u001b[0m\u001b[0;32mFalse\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mauto_lfs_prune\u001b[0m\u001b[0;34m=\u001b[0m\u001b[0;32mTrue\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[1;32m 3445\u001b[0m )\n",
2482
+ "\u001b[0;32m/usr/local/lib/python3.8/dist-packages/huggingface_hub/repository.py\u001b[0m in \u001b[0;36mpush_to_hub\u001b[0;34m(self, commit_message, blocking, clean_ok, auto_lfs_prune)\u001b[0m\n\u001b[1;32m 1428\u001b[0m \u001b[0mlogger\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0minfo\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0;34m\"Repo currently clean. Ignoring push_to_hub\"\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[1;32m 1429\u001b[0m \u001b[0;32mreturn\u001b[0m \u001b[0;32mNone\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0;32m-> 1430\u001b[0;31m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mgit_add\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0mauto_lfs_track\u001b[0m\u001b[0;34m=\u001b[0m\u001b[0;32mTrue\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0m\u001b[1;32m 1431\u001b[0m \u001b[0mself\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mgit_commit\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0mcommit_message\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[1;32m 1432\u001b[0m return self.git_push(\n",
2483
+ "\u001b[0;32m/usr/local/lib/python3.8/dist-packages/huggingface_hub/repository.py\u001b[0m in \u001b[0;36mgit_add\u001b[0;34m(self, pattern, auto_lfs_track)\u001b[0m\n\u001b[1;32m 1123\u001b[0m \u001b[0mlogger\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0minfo\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0;34mf\"Adding to index:\\n{result.stdout}\\n\"\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[1;32m 1124\u001b[0m \u001b[0;32mexcept\u001b[0m \u001b[0msubprocess\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mCalledProcessError\u001b[0m \u001b[0;32mas\u001b[0m \u001b[0mexc\u001b[0m\u001b[0;34m:\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0;32m-> 1125\u001b[0;31m \u001b[0;32mraise\u001b[0m \u001b[0mEnvironmentError\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0mexc\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mstderr\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0m\u001b[1;32m 1126\u001b[0m \u001b[0;34m\u001b[0m\u001b[0m\n\u001b[1;32m 1127\u001b[0m \u001b[0;32mdef\u001b[0m \u001b[0mgit_commit\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0mself\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mcommit_message\u001b[0m\u001b[0;34m:\u001b[0m \u001b[0mstr\u001b[0m \u001b[0;34m=\u001b[0m \u001b[0;34m\"commit files to HF hub\"\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m:\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n",
2484
+ "\u001b[0;31mOSError\u001b[0m: error: insufficient permission for adding an object to repository database .git/objects\nerror: runs/Dec12_01-10-01_150-136-33-0/events.out.tfevents.1670807412.150-136-33-0.2756654.4: failed to insert into database\nerror: unable to index file 'runs/Dec12_01-10-01_150-136-33-0/events.out.tfevents.1670807412.150-136-33-0.2756654.4'\nfatal: adding files failed\nEncountered 1 file(s) that may not have been copied correctly on Windows:\n\tcheckpoint-5000/optimizer.pt\n\nSee: `git lfs help smudge` for more details.\n"
2485
  ]
2486
  }
2487
  ],
2488
  "source": [
2489
+ "#trainer.train()\n",
2490
+ "trainer.train(resume_from_checkpoint = True)\n"
2491
  ]
2492
  },
2493
  {
 
2515
  },
2516
  {
2517
  "cell_type": "code",
2518
+ "execution_count": 33,
2519
  "id": "6dd0e310-9b07-4133-ac14-2ed2d7524e22",
2520
  "metadata": {
2521
  "id": "6dd0e310-9b07-4133-ac14-2ed2d7524e22"
 
2528
  " \"dataset\": \"Common Voice 11.0\", # a 'pretty' name for the training dataset\n",
2529
  " #\"dataset\": \"Google FLEURS\", # a 'pretty' name for the training dataset\n",
2530
  " \"language\": \"el\",\n",
2531
+ " \"model_name\": \"Whisper Medium El Greco Greek\", # a 'pretty' name for your model\n",
2532
  " \"finetuned_from\": \"openai/whisper-medium\",\n",
2533
  " \"tasks\": \"automatic-speech-recognition\",\n",
2534
  " \"tags\": \"hf-asr-leaderboard, whisper-medium, mozilla-foundation/common_voice_11_0, greek, whisper-event\",\n",
 
2547
  },
2548
  {
2549
  "cell_type": "code",
2550
+ "execution_count": 35,
2551
  "id": "95737cda-c5dd-4887-a4d0-dfcb0d61d977",
2552
  "metadata": {
2553
  "id": "95737cda-c5dd-4887-a4d0-dfcb0d61d977"
2554
  },
2555
+ "outputs": [
2556
+ {
2557
+ "name": "stderr",
2558
+ "output_type": "stream",
2559
+ "text": [
2560
+ "Saving model checkpoint to ./whisper-medium-el\n",
2561
+ "Configuration saved in ./whisper-medium-el/config.json\n",
2562
+ "Model weights saved in ./whisper-medium-el/pytorch_model.bin\n",
2563
+ "Feature extractor saved in ./whisper-medium-el/preprocessor_config.json\n",
2564
+ "tokenizer config file saved in ./whisper-medium-el/tokenizer_config.json\n",
2565
+ "Special tokens file saved in ./whisper-medium-el/special_tokens_map.json\n",
2566
+ "added tokens file saved in ./whisper-medium-el/added_tokens.json\n",
2567
+ "Dropping the following result as it does not have all the necessary fields:\n",
2568
+ "{'task': {'name': 'Automatic Speech Recognition', 'type': 'automatic-speech-recognition'}, 'dataset': {'name': 'Common Voice 11.0', 'type': 'mozilla-foundation/common_voice_11_0', 'config': 'el', 'split': 'test', 'args': 'el'}}\n",
2569
+ "To https://huggingface.co/emilios/whisper-medium-el\n",
2570
+ " 761a953..1303559 main -> main\n",
2571
+ "\n"
2572
+ ]
2573
+ }
2574
+ ],
2575
  "source": [
2576
  "trainer.push_to_hub(**kwargs)"
2577
  ]
2578
+ },
2579
+ {
2580
+ "cell_type": "code",
2581
+ "execution_count": null,
2582
+ "id": "e7a702f1-f48f-4d2a-8139-89fd67308d94",
2583
+ "metadata": {},
2584
+ "outputs": [],
2585
+ "source": []
2586
+ },
2587
+ {
2588
+ "cell_type": "code",
2589
+ "execution_count": null,
2590
+ "id": "b8327ca7-cb48-4fd3-90d6-bb5097f8f307",
2591
+ "metadata": {},
2592
+ "outputs": [],
2593
+ "source": []
2594
  }
2595
  ],
2596
  "metadata": {
e5_interleaving-cl.ipynb ADDED
The diff for this file is too large to render. See raw diff
 
eval_results.json ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 12.44,
3
+ "eval_loss": 0.38012266159057617,
4
+ "eval_runtime": 1110.8612,
5
+ "eval_samples_per_second": 1.527,
6
+ "eval_steps_per_second": 0.095,
7
+ "eval_wer": 11.199851411589897
8
+ }
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2ea24fb6272350e35c7f819c3836f437778f46ef44aef6816a93d4a3869b9cef
3
  size 3055754841
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:aa9c1cc306eaa444f99ef35b5e3ffb717f457ea531fd3a8d4727ffdbd4b4cccd
3
  size 3055754841
run_inter_1gpu.sh CHANGED
@@ -1,10 +1,16 @@
1
- python run_interleave.py --model_name_or_path="emilios/whisper-medium-el" \
 
 
2
  --dataset_name="mozilla-foundation/common_voice_11_0,google/fleurs" \
3
  --dataset_config_name="el,el_gr" \
4
- --language="greek" \
5
- --train_split_name="train+validation" \
6
- --eval_split_name="test" \
7
- --model_index_name="Whisper Medium El - Greek One" \
 
 
 
 
8
  --max_steps="5000" \
9
  --output_dir="./" \
10
  --per_device_train_batch_size="32" \
@@ -33,6 +39,5 @@ python run_interleave.py --model_name_or_path="emilios/whisper-medium-el" \
33
  --do_eval \
34
  --predict_with_generate \
35
  --do_normalize_eval \
36
- --streaming \
37
  --use_auth_token \
38
  --push_to_hub
 
1
+ python run_interleave.py \
2
+ --model_name_or_path="emilios/whisper-medium-el" \
3
+ --language="greek" \
4
  --dataset_name="mozilla-foundation/common_voice_11_0,google/fleurs" \
5
  --dataset_config_name="el,el_gr" \
6
+ --train_split_name="train+validation,train+validation" \
7
+ --eval_split_name="test,-" \
8
+ --text_column_name 'sentence,transcription' \
9
+ --audio_column_name 'audio,audio' \
10
+ --task="transcribe" \
11
+ --streaming="False" \
12
+ --model_index_name="Whisper Medium El Greco" \
13
+ --dropout="0.1" \
14
  --max_steps="5000" \
15
  --output_dir="./" \
16
  --per_device_train_batch_size="32" \
 
39
  --do_eval \
40
  --predict_with_generate \
41
  --do_normalize_eval \
 
42
  --use_auth_token \
43
  --push_to_hub
run_interleave.py CHANGED
@@ -60,6 +60,7 @@ from transformers.utils.versions import require_version
60
  from transformers.models.whisper.english_normalizer import BasicTextNormalizer
61
 
62
  TEXT_COL_NAME="text"
 
63
  AUDIO_COL_NAME="audio"
64
 
65
  # Will error if the minimal version of Transformers is not installed. Remove at your own risks.
 
60
  from transformers.models.whisper.english_normalizer import BasicTextNormalizer
61
 
62
  TEXT_COL_NAME="text"
63
+ TEXT_COL_NAME="sentence,transcription"
64
  AUDIO_COL_NAME="audio"
65
 
66
  # Will error if the minimal version of Transformers is not installed. Remove at your own risks.
run_whisper-md-el-intlv-xs.sh CHANGED
@@ -1,13 +1,11 @@
1
- # whisper-md-el-intlv-xs
2
- python run_speech_recognition_seq2seq_streaming.py \
3
- --model_name_or_path 'openai/whisper-medium' \
4
  --model_revision main \
5
  --do_train True \
6
  --do_eval True \
7
- --use_auth_token False \
8
  --freeze_feature_encoder False \
9
  --freeze_encoder False \
10
- --model_index_name 'whisper-md-el-intlv-xs' \
11
  --dataset_name 'mozilla-foundation/common_voice_11_0,google/fleurs' \
12
  --dataset_config_name 'el,el_gr' \
13
  --train_split_name 'train+validation,train+validation' \
@@ -22,10 +20,11 @@ python run_speech_recognition_seq2seq_streaming.py \
22
  --language greek \
23
  --task transcribe \
24
  --shuffle_buffer_size 500 \
25
- --output_dir './data/finetuningRuns/whisper-md-el-intlv-xs' \
26
  --overwrite_output_dir True \
27
  --per_device_train_batch_size 32 \
28
- --gradient_accumulation_steps 2 \
 
29
  --learning_rate 1e-5 \
30
  --dropout 0.1 \
31
  --warmup_steps 500 \
@@ -35,7 +34,6 @@ python run_speech_recognition_seq2seq_streaming.py \
35
  --cache_dir '~/.cache' \
36
  --fp16 True \
37
  --evaluation_strategy steps \
38
- --per_device_eval_batch_size 8 \
39
  --predict_with_generate True \
40
  --generation_max_length 225 \
41
  --save_steps 1000 \
 
1
+ python run_interleave.py \
2
+ --model_name_or_path 'emilios/whisper-medium-el' \
 
3
  --model_revision main \
4
  --do_train True \
5
  --do_eval True \
 
6
  --freeze_feature_encoder False \
7
  --freeze_encoder False \
8
+ --model_index_name 'Whisper Medium El Greco' \
9
  --dataset_name 'mozilla-foundation/common_voice_11_0,google/fleurs' \
10
  --dataset_config_name 'el,el_gr' \
11
  --train_split_name 'train+validation,train+validation' \
 
20
  --language greek \
21
  --task transcribe \
22
  --shuffle_buffer_size 500 \
23
+ --output_dir './' \
24
  --overwrite_output_dir True \
25
  --per_device_train_batch_size 32 \
26
+ --gradient_accumulation_steps 1 \
27
+ --per_device_eval_batch_size 16 \
28
  --learning_rate 1e-5 \
29
  --dropout 0.1 \
30
  --warmup_steps 500 \
 
34
  --cache_dir '~/.cache' \
35
  --fp16 True \
36
  --evaluation_strategy steps \
 
37
  --predict_with_generate True \
38
  --generation_max_length 225 \
39
  --save_steps 1000 \
runs/Dec12_03-57-18_150-136-33-0/1670819499.156932/events.out.tfevents.1670819499.150-136-33-0.3405254.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:08d8d9887d71b5ab49ebfe74af431c20dbdf9f4008efef745317368d7cd9a003
3
+ size 5862
runs/Dec12_03-57-18_150-136-33-0/events.out.tfevents.1670819499.150-136-33-0.3405254.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:982a32db879d80cb27cd4cd9a1268b9a39ff843b045a1882012739dc4d6da91a
3
+ size 37595
runs/Dec12_03-57-18_150-136-33-0/events.out.tfevents.1670847754.150-136-33-0.3405254.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a4e8c731d53f21751b9ee9166ca526802d8a90c459680e2c59a3e71b3f256bc0
3
+ size 358
train_results.json ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 12.44,
3
+ "train_loss": 0.020854657278954983,
4
+ "train_runtime": 27142.1177,
5
+ "train_samples_per_second": 5.895,
6
+ "train_steps_per_second": 0.184
7
+ }
trainer_state.json ADDED
@@ -0,0 +1,1270 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "best_metric": 11.199851411589897,
3
+ "best_model_checkpoint": "./checkpoint-5000",
4
+ "epoch": 12.437810945273633,
5
+ "global_step": 5000,
6
+ "is_hyper_param_search": false,
7
+ "is_local_process_zero": true,
8
+ "is_world_process_zero": true,
9
+ "log_history": [
10
+ {
11
+ "epoch": 0.06,
12
+ "learning_rate": 4.800000000000001e-07,
13
+ "loss": 1.0869,
14
+ "step": 25
15
+ },
16
+ {
17
+ "epoch": 0.12,
18
+ "learning_rate": 9.800000000000001e-07,
19
+ "loss": 0.588,
20
+ "step": 50
21
+ },
22
+ {
23
+ "epoch": 0.19,
24
+ "learning_rate": 1.48e-06,
25
+ "loss": 0.2266,
26
+ "step": 75
27
+ },
28
+ {
29
+ "epoch": 0.25,
30
+ "learning_rate": 1.98e-06,
31
+ "loss": 0.1899,
32
+ "step": 100
33
+ },
34
+ {
35
+ "epoch": 0.31,
36
+ "learning_rate": 2.4800000000000004e-06,
37
+ "loss": 0.1476,
38
+ "step": 125
39
+ },
40
+ {
41
+ "epoch": 0.37,
42
+ "learning_rate": 2.9800000000000003e-06,
43
+ "loss": 0.111,
44
+ "step": 150
45
+ },
46
+ {
47
+ "epoch": 0.44,
48
+ "learning_rate": 3.48e-06,
49
+ "loss": 0.0958,
50
+ "step": 175
51
+ },
52
+ {
53
+ "epoch": 0.5,
54
+ "learning_rate": 3.980000000000001e-06,
55
+ "loss": 0.0904,
56
+ "step": 200
57
+ },
58
+ {
59
+ "epoch": 0.56,
60
+ "learning_rate": 4.48e-06,
61
+ "loss": 0.0783,
62
+ "step": 225
63
+ },
64
+ {
65
+ "epoch": 0.62,
66
+ "learning_rate": 4.980000000000001e-06,
67
+ "loss": 0.0676,
68
+ "step": 250
69
+ },
70
+ {
71
+ "epoch": 0.68,
72
+ "learning_rate": 5.480000000000001e-06,
73
+ "loss": 0.0612,
74
+ "step": 275
75
+ },
76
+ {
77
+ "epoch": 0.75,
78
+ "learning_rate": 5.98e-06,
79
+ "loss": 0.0503,
80
+ "step": 300
81
+ },
82
+ {
83
+ "epoch": 0.81,
84
+ "learning_rate": 6.480000000000001e-06,
85
+ "loss": 0.0523,
86
+ "step": 325
87
+ },
88
+ {
89
+ "epoch": 0.87,
90
+ "learning_rate": 6.98e-06,
91
+ "loss": 0.045,
92
+ "step": 350
93
+ },
94
+ {
95
+ "epoch": 0.93,
96
+ "learning_rate": 7.48e-06,
97
+ "loss": 0.0395,
98
+ "step": 375
99
+ },
100
+ {
101
+ "epoch": 1.0,
102
+ "learning_rate": 7.980000000000002e-06,
103
+ "loss": 0.0454,
104
+ "step": 400
105
+ },
106
+ {
107
+ "epoch": 1.06,
108
+ "learning_rate": 8.48e-06,
109
+ "loss": 0.0336,
110
+ "step": 425
111
+ },
112
+ {
113
+ "epoch": 1.12,
114
+ "learning_rate": 8.98e-06,
115
+ "loss": 0.0325,
116
+ "step": 450
117
+ },
118
+ {
119
+ "epoch": 1.18,
120
+ "learning_rate": 9.48e-06,
121
+ "loss": 0.0315,
122
+ "step": 475
123
+ },
124
+ {
125
+ "epoch": 1.24,
126
+ "learning_rate": 9.980000000000001e-06,
127
+ "loss": 0.0326,
128
+ "step": 500
129
+ },
130
+ {
131
+ "epoch": 1.31,
132
+ "learning_rate": 9.946666666666667e-06,
133
+ "loss": 0.0326,
134
+ "step": 525
135
+ },
136
+ {
137
+ "epoch": 1.37,
138
+ "learning_rate": 9.891111111111113e-06,
139
+ "loss": 0.0278,
140
+ "step": 550
141
+ },
142
+ {
143
+ "epoch": 1.43,
144
+ "learning_rate": 9.835555555555556e-06,
145
+ "loss": 0.0292,
146
+ "step": 575
147
+ },
148
+ {
149
+ "epoch": 1.49,
150
+ "learning_rate": 9.780000000000001e-06,
151
+ "loss": 0.0284,
152
+ "step": 600
153
+ },
154
+ {
155
+ "epoch": 1.55,
156
+ "learning_rate": 9.724444444444445e-06,
157
+ "loss": 0.0336,
158
+ "step": 625
159
+ },
160
+ {
161
+ "epoch": 1.62,
162
+ "learning_rate": 9.66888888888889e-06,
163
+ "loss": 0.0291,
164
+ "step": 650
165
+ },
166
+ {
167
+ "epoch": 1.68,
168
+ "learning_rate": 9.613333333333335e-06,
169
+ "loss": 0.0274,
170
+ "step": 675
171
+ },
172
+ {
173
+ "epoch": 1.74,
174
+ "learning_rate": 9.557777777777777e-06,
175
+ "loss": 0.024,
176
+ "step": 700
177
+ },
178
+ {
179
+ "epoch": 1.8,
180
+ "learning_rate": 9.502222222222223e-06,
181
+ "loss": 0.0254,
182
+ "step": 725
183
+ },
184
+ {
185
+ "epoch": 1.87,
186
+ "learning_rate": 9.446666666666667e-06,
187
+ "loss": 0.0201,
188
+ "step": 750
189
+ },
190
+ {
191
+ "epoch": 1.93,
192
+ "learning_rate": 9.391111111111111e-06,
193
+ "loss": 0.0261,
194
+ "step": 775
195
+ },
196
+ {
197
+ "epoch": 1.99,
198
+ "learning_rate": 9.335555555555557e-06,
199
+ "loss": 0.0235,
200
+ "step": 800
201
+ },
202
+ {
203
+ "epoch": 2.05,
204
+ "learning_rate": 9.280000000000001e-06,
205
+ "loss": 0.0165,
206
+ "step": 825
207
+ },
208
+ {
209
+ "epoch": 2.11,
210
+ "learning_rate": 9.224444444444445e-06,
211
+ "loss": 0.0148,
212
+ "step": 850
213
+ },
214
+ {
215
+ "epoch": 2.18,
216
+ "learning_rate": 9.168888888888889e-06,
217
+ "loss": 0.0161,
218
+ "step": 875
219
+ },
220
+ {
221
+ "epoch": 2.24,
222
+ "learning_rate": 9.113333333333335e-06,
223
+ "loss": 0.0144,
224
+ "step": 900
225
+ },
226
+ {
227
+ "epoch": 2.3,
228
+ "learning_rate": 9.057777777777779e-06,
229
+ "loss": 0.016,
230
+ "step": 925
231
+ },
232
+ {
233
+ "epoch": 2.36,
234
+ "learning_rate": 9.002222222222223e-06,
235
+ "loss": 0.0151,
236
+ "step": 950
237
+ },
238
+ {
239
+ "epoch": 2.43,
240
+ "learning_rate": 8.946666666666669e-06,
241
+ "loss": 0.0136,
242
+ "step": 975
243
+ },
244
+ {
245
+ "epoch": 2.49,
246
+ "learning_rate": 8.891111111111111e-06,
247
+ "loss": 0.0176,
248
+ "step": 1000
249
+ },
250
+ {
251
+ "epoch": 2.49,
252
+ "eval_loss": 0.2945367693901062,
253
+ "eval_runtime": 1114.0569,
254
+ "eval_samples_per_second": 1.522,
255
+ "eval_steps_per_second": 0.095,
256
+ "eval_wer": 12.611441307578009,
257
+ "step": 1000
258
+ },
259
+ {
260
+ "epoch": 2.55,
261
+ "learning_rate": 8.835555555555557e-06,
262
+ "loss": 0.0127,
263
+ "step": 1025
264
+ },
265
+ {
266
+ "epoch": 2.61,
267
+ "learning_rate": 8.78e-06,
268
+ "loss": 0.015,
269
+ "step": 1050
270
+ },
271
+ {
272
+ "epoch": 2.67,
273
+ "learning_rate": 8.724444444444445e-06,
274
+ "loss": 0.0134,
275
+ "step": 1075
276
+ },
277
+ {
278
+ "epoch": 2.74,
279
+ "learning_rate": 8.66888888888889e-06,
280
+ "loss": 0.0128,
281
+ "step": 1100
282
+ },
283
+ {
284
+ "epoch": 2.8,
285
+ "learning_rate": 8.613333333333333e-06,
286
+ "loss": 0.0132,
287
+ "step": 1125
288
+ },
289
+ {
290
+ "epoch": 2.86,
291
+ "learning_rate": 8.557777777777778e-06,
292
+ "loss": 0.0121,
293
+ "step": 1150
294
+ },
295
+ {
296
+ "epoch": 2.92,
297
+ "learning_rate": 8.502222222222223e-06,
298
+ "loss": 0.0121,
299
+ "step": 1175
300
+ },
301
+ {
302
+ "epoch": 2.99,
303
+ "learning_rate": 8.446666666666668e-06,
304
+ "loss": 0.012,
305
+ "step": 1200
306
+ },
307
+ {
308
+ "epoch": 3.05,
309
+ "learning_rate": 8.391111111111112e-06,
310
+ "loss": 0.0098,
311
+ "step": 1225
312
+ },
313
+ {
314
+ "epoch": 3.11,
315
+ "learning_rate": 8.335555555555556e-06,
316
+ "loss": 0.0092,
317
+ "step": 1250
318
+ },
319
+ {
320
+ "epoch": 3.17,
321
+ "learning_rate": 8.28e-06,
322
+ "loss": 0.0108,
323
+ "step": 1275
324
+ },
325
+ {
326
+ "epoch": 3.23,
327
+ "learning_rate": 8.224444444444444e-06,
328
+ "loss": 0.0083,
329
+ "step": 1300
330
+ },
331
+ {
332
+ "epoch": 3.3,
333
+ "learning_rate": 8.16888888888889e-06,
334
+ "loss": 0.0084,
335
+ "step": 1325
336
+ },
337
+ {
338
+ "epoch": 3.36,
339
+ "learning_rate": 8.113333333333334e-06,
340
+ "loss": 0.0087,
341
+ "step": 1350
342
+ },
343
+ {
344
+ "epoch": 3.42,
345
+ "learning_rate": 8.057777777777778e-06,
346
+ "loss": 0.0086,
347
+ "step": 1375
348
+ },
349
+ {
350
+ "epoch": 3.48,
351
+ "learning_rate": 8.002222222222222e-06,
352
+ "loss": 0.0087,
353
+ "step": 1400
354
+ },
355
+ {
356
+ "epoch": 3.54,
357
+ "learning_rate": 7.946666666666666e-06,
358
+ "loss": 0.009,
359
+ "step": 1425
360
+ },
361
+ {
362
+ "epoch": 3.61,
363
+ "learning_rate": 7.891111111111112e-06,
364
+ "loss": 0.0099,
365
+ "step": 1450
366
+ },
367
+ {
368
+ "epoch": 3.67,
369
+ "learning_rate": 7.835555555555556e-06,
370
+ "loss": 0.0079,
371
+ "step": 1475
372
+ },
373
+ {
374
+ "epoch": 3.73,
375
+ "learning_rate": 7.78e-06,
376
+ "loss": 0.0083,
377
+ "step": 1500
378
+ },
379
+ {
380
+ "epoch": 3.79,
381
+ "learning_rate": 7.724444444444446e-06,
382
+ "loss": 0.0067,
383
+ "step": 1525
384
+ },
385
+ {
386
+ "epoch": 3.86,
387
+ "learning_rate": 7.66888888888889e-06,
388
+ "loss": 0.0083,
389
+ "step": 1550
390
+ },
391
+ {
392
+ "epoch": 3.92,
393
+ "learning_rate": 7.613333333333334e-06,
394
+ "loss": 0.0095,
395
+ "step": 1575
396
+ },
397
+ {
398
+ "epoch": 3.98,
399
+ "learning_rate": 7.557777777777779e-06,
400
+ "loss": 0.008,
401
+ "step": 1600
402
+ },
403
+ {
404
+ "epoch": 4.04,
405
+ "learning_rate": 7.502222222222223e-06,
406
+ "loss": 0.0066,
407
+ "step": 1625
408
+ },
409
+ {
410
+ "epoch": 4.1,
411
+ "learning_rate": 7.446666666666668e-06,
412
+ "loss": 0.0065,
413
+ "step": 1650
414
+ },
415
+ {
416
+ "epoch": 4.17,
417
+ "learning_rate": 7.3911111111111125e-06,
418
+ "loss": 0.0063,
419
+ "step": 1675
420
+ },
421
+ {
422
+ "epoch": 4.23,
423
+ "learning_rate": 7.335555555555556e-06,
424
+ "loss": 0.0063,
425
+ "step": 1700
426
+ },
427
+ {
428
+ "epoch": 4.29,
429
+ "learning_rate": 7.280000000000001e-06,
430
+ "loss": 0.0057,
431
+ "step": 1725
432
+ },
433
+ {
434
+ "epoch": 4.35,
435
+ "learning_rate": 7.224444444444445e-06,
436
+ "loss": 0.0067,
437
+ "step": 1750
438
+ },
439
+ {
440
+ "epoch": 4.42,
441
+ "learning_rate": 7.1688888888888895e-06,
442
+ "loss": 0.0052,
443
+ "step": 1775
444
+ },
445
+ {
446
+ "epoch": 4.48,
447
+ "learning_rate": 7.113333333333334e-06,
448
+ "loss": 0.0069,
449
+ "step": 1800
450
+ },
451
+ {
452
+ "epoch": 4.54,
453
+ "learning_rate": 7.057777777777778e-06,
454
+ "loss": 0.0063,
455
+ "step": 1825
456
+ },
457
+ {
458
+ "epoch": 4.6,
459
+ "learning_rate": 7.0022222222222225e-06,
460
+ "loss": 0.0059,
461
+ "step": 1850
462
+ },
463
+ {
464
+ "epoch": 4.66,
465
+ "learning_rate": 6.946666666666667e-06,
466
+ "loss": 0.0053,
467
+ "step": 1875
468
+ },
469
+ {
470
+ "epoch": 4.73,
471
+ "learning_rate": 6.891111111111111e-06,
472
+ "loss": 0.0059,
473
+ "step": 1900
474
+ },
475
+ {
476
+ "epoch": 4.79,
477
+ "learning_rate": 6.835555555555556e-06,
478
+ "loss": 0.0063,
479
+ "step": 1925
480
+ },
481
+ {
482
+ "epoch": 4.85,
483
+ "learning_rate": 6.780000000000001e-06,
484
+ "loss": 0.0057,
485
+ "step": 1950
486
+ },
487
+ {
488
+ "epoch": 4.91,
489
+ "learning_rate": 6.724444444444444e-06,
490
+ "loss": 0.0056,
491
+ "step": 1975
492
+ },
493
+ {
494
+ "epoch": 4.98,
495
+ "learning_rate": 6.668888888888889e-06,
496
+ "loss": 0.0064,
497
+ "step": 2000
498
+ },
499
+ {
500
+ "epoch": 4.98,
501
+ "eval_loss": 0.34234580397605896,
502
+ "eval_runtime": 1111.8202,
503
+ "eval_samples_per_second": 1.525,
504
+ "eval_steps_per_second": 0.095,
505
+ "eval_wer": 12.230683506686479,
506
+ "step": 2000
507
+ },
508
+ {
509
+ "epoch": 5.04,
510
+ "learning_rate": 6.613333333333334e-06,
511
+ "loss": 0.0039,
512
+ "step": 2025
513
+ },
514
+ {
515
+ "epoch": 5.1,
516
+ "learning_rate": 6.557777777777778e-06,
517
+ "loss": 0.0036,
518
+ "step": 2050
519
+ },
520
+ {
521
+ "epoch": 5.16,
522
+ "learning_rate": 6.502222222222223e-06,
523
+ "loss": 0.0036,
524
+ "step": 2075
525
+ },
526
+ {
527
+ "epoch": 5.22,
528
+ "learning_rate": 6.446666666666668e-06,
529
+ "loss": 0.0039,
530
+ "step": 2100
531
+ },
532
+ {
533
+ "epoch": 5.29,
534
+ "learning_rate": 6.391111111111111e-06,
535
+ "loss": 0.004,
536
+ "step": 2125
537
+ },
538
+ {
539
+ "epoch": 5.35,
540
+ "learning_rate": 6.335555555555556e-06,
541
+ "loss": 0.0041,
542
+ "step": 2150
543
+ },
544
+ {
545
+ "epoch": 5.41,
546
+ "learning_rate": 6.280000000000001e-06,
547
+ "loss": 0.0046,
548
+ "step": 2175
549
+ },
550
+ {
551
+ "epoch": 5.47,
552
+ "learning_rate": 6.224444444444445e-06,
553
+ "loss": 0.0058,
554
+ "step": 2200
555
+ },
556
+ {
557
+ "epoch": 5.53,
558
+ "learning_rate": 6.16888888888889e-06,
559
+ "loss": 0.0043,
560
+ "step": 2225
561
+ },
562
+ {
563
+ "epoch": 5.6,
564
+ "learning_rate": 6.113333333333333e-06,
565
+ "loss": 0.0042,
566
+ "step": 2250
567
+ },
568
+ {
569
+ "epoch": 5.66,
570
+ "learning_rate": 6.057777777777778e-06,
571
+ "loss": 0.0045,
572
+ "step": 2275
573
+ },
574
+ {
575
+ "epoch": 5.72,
576
+ "learning_rate": 6.002222222222223e-06,
577
+ "loss": 0.0041,
578
+ "step": 2300
579
+ },
580
+ {
581
+ "epoch": 5.78,
582
+ "learning_rate": 5.946666666666668e-06,
583
+ "loss": 0.004,
584
+ "step": 2325
585
+ },
586
+ {
587
+ "epoch": 5.85,
588
+ "learning_rate": 5.891111111111112e-06,
589
+ "loss": 0.004,
590
+ "step": 2350
591
+ },
592
+ {
593
+ "epoch": 5.91,
594
+ "learning_rate": 5.8355555555555565e-06,
595
+ "loss": 0.0043,
596
+ "step": 2375
597
+ },
598
+ {
599
+ "epoch": 5.97,
600
+ "learning_rate": 5.78e-06,
601
+ "loss": 0.004,
602
+ "step": 2400
603
+ },
604
+ {
605
+ "epoch": 6.03,
606
+ "learning_rate": 5.724444444444445e-06,
607
+ "loss": 0.004,
608
+ "step": 2425
609
+ },
610
+ {
611
+ "epoch": 6.09,
612
+ "learning_rate": 5.6688888888888895e-06,
613
+ "loss": 0.0045,
614
+ "step": 2450
615
+ },
616
+ {
617
+ "epoch": 6.16,
618
+ "learning_rate": 5.613333333333334e-06,
619
+ "loss": 0.0049,
620
+ "step": 2475
621
+ },
622
+ {
623
+ "epoch": 6.22,
624
+ "learning_rate": 5.557777777777778e-06,
625
+ "loss": 0.0038,
626
+ "step": 2500
627
+ },
628
+ {
629
+ "epoch": 6.28,
630
+ "learning_rate": 5.5022222222222224e-06,
631
+ "loss": 0.0047,
632
+ "step": 2525
633
+ },
634
+ {
635
+ "epoch": 6.34,
636
+ "learning_rate": 5.4466666666666665e-06,
637
+ "loss": 0.0035,
638
+ "step": 2550
639
+ },
640
+ {
641
+ "epoch": 6.41,
642
+ "learning_rate": 5.391111111111111e-06,
643
+ "loss": 0.0032,
644
+ "step": 2575
645
+ },
646
+ {
647
+ "epoch": 6.47,
648
+ "learning_rate": 5.335555555555556e-06,
649
+ "loss": 0.0032,
650
+ "step": 2600
651
+ },
652
+ {
653
+ "epoch": 6.53,
654
+ "learning_rate": 5.28e-06,
655
+ "loss": 0.0032,
656
+ "step": 2625
657
+ },
658
+ {
659
+ "epoch": 6.59,
660
+ "learning_rate": 5.224444444444445e-06,
661
+ "loss": 0.0036,
662
+ "step": 2650
663
+ },
664
+ {
665
+ "epoch": 6.65,
666
+ "learning_rate": 5.168888888888889e-06,
667
+ "loss": 0.0035,
668
+ "step": 2675
669
+ },
670
+ {
671
+ "epoch": 6.72,
672
+ "learning_rate": 5.113333333333333e-06,
673
+ "loss": 0.0041,
674
+ "step": 2700
675
+ },
676
+ {
677
+ "epoch": 6.78,
678
+ "learning_rate": 5.057777777777778e-06,
679
+ "loss": 0.0035,
680
+ "step": 2725
681
+ },
682
+ {
683
+ "epoch": 6.84,
684
+ "learning_rate": 5.002222222222223e-06,
685
+ "loss": 0.0034,
686
+ "step": 2750
687
+ },
688
+ {
689
+ "epoch": 6.9,
690
+ "learning_rate": 4.946666666666667e-06,
691
+ "loss": 0.0039,
692
+ "step": 2775
693
+ },
694
+ {
695
+ "epoch": 6.97,
696
+ "learning_rate": 4.891111111111111e-06,
697
+ "loss": 0.0034,
698
+ "step": 2800
699
+ },
700
+ {
701
+ "epoch": 7.03,
702
+ "learning_rate": 4.835555555555556e-06,
703
+ "loss": 0.0035,
704
+ "step": 2825
705
+ },
706
+ {
707
+ "epoch": 7.09,
708
+ "learning_rate": 4.78e-06,
709
+ "loss": 0.0037,
710
+ "step": 2850
711
+ },
712
+ {
713
+ "epoch": 7.15,
714
+ "learning_rate": 4.724444444444445e-06,
715
+ "loss": 0.0026,
716
+ "step": 2875
717
+ },
718
+ {
719
+ "epoch": 7.21,
720
+ "learning_rate": 4.66888888888889e-06,
721
+ "loss": 0.0026,
722
+ "step": 2900
723
+ },
724
+ {
725
+ "epoch": 7.28,
726
+ "learning_rate": 4.613333333333334e-06,
727
+ "loss": 0.0034,
728
+ "step": 2925
729
+ },
730
+ {
731
+ "epoch": 7.34,
732
+ "learning_rate": 4.557777777777778e-06,
733
+ "loss": 0.0025,
734
+ "step": 2950
735
+ },
736
+ {
737
+ "epoch": 7.4,
738
+ "learning_rate": 4.502222222222223e-06,
739
+ "loss": 0.0023,
740
+ "step": 2975
741
+ },
742
+ {
743
+ "epoch": 7.46,
744
+ "learning_rate": 4.446666666666667e-06,
745
+ "loss": 0.0022,
746
+ "step": 3000
747
+ },
748
+ {
749
+ "epoch": 7.46,
750
+ "eval_loss": 0.3631705045700073,
751
+ "eval_runtime": 1107.3108,
752
+ "eval_samples_per_second": 1.532,
753
+ "eval_steps_per_second": 0.096,
754
+ "eval_wer": 11.589895988112927,
755
+ "step": 3000
756
+ },
757
+ {
758
+ "epoch": 7.52,
759
+ "learning_rate": 4.391111111111112e-06,
760
+ "loss": 0.0026,
761
+ "step": 3025
762
+ },
763
+ {
764
+ "epoch": 7.59,
765
+ "learning_rate": 4.3355555555555565e-06,
766
+ "loss": 0.0025,
767
+ "step": 3050
768
+ },
769
+ {
770
+ "epoch": 7.65,
771
+ "learning_rate": 4.2800000000000005e-06,
772
+ "loss": 0.0023,
773
+ "step": 3075
774
+ },
775
+ {
776
+ "epoch": 7.71,
777
+ "learning_rate": 4.2244444444444446e-06,
778
+ "loss": 0.0023,
779
+ "step": 3100
780
+ },
781
+ {
782
+ "epoch": 7.77,
783
+ "learning_rate": 4.168888888888889e-06,
784
+ "loss": 0.0023,
785
+ "step": 3125
786
+ },
787
+ {
788
+ "epoch": 7.84,
789
+ "learning_rate": 4.1133333333333335e-06,
790
+ "loss": 0.0027,
791
+ "step": 3150
792
+ },
793
+ {
794
+ "epoch": 7.9,
795
+ "learning_rate": 4.057777777777778e-06,
796
+ "loss": 0.0022,
797
+ "step": 3175
798
+ },
799
+ {
800
+ "epoch": 7.96,
801
+ "learning_rate": 4.002222222222222e-06,
802
+ "loss": 0.0025,
803
+ "step": 3200
804
+ },
805
+ {
806
+ "epoch": 8.02,
807
+ "learning_rate": 3.946666666666667e-06,
808
+ "loss": 0.0017,
809
+ "step": 3225
810
+ },
811
+ {
812
+ "epoch": 8.08,
813
+ "learning_rate": 3.891111111111111e-06,
814
+ "loss": 0.0016,
815
+ "step": 3250
816
+ },
817
+ {
818
+ "epoch": 8.15,
819
+ "learning_rate": 3.835555555555555e-06,
820
+ "loss": 0.0018,
821
+ "step": 3275
822
+ },
823
+ {
824
+ "epoch": 8.21,
825
+ "learning_rate": 3.7800000000000002e-06,
826
+ "loss": 0.0022,
827
+ "step": 3300
828
+ },
829
+ {
830
+ "epoch": 8.27,
831
+ "learning_rate": 3.724444444444445e-06,
832
+ "loss": 0.002,
833
+ "step": 3325
834
+ },
835
+ {
836
+ "epoch": 8.33,
837
+ "learning_rate": 3.668888888888889e-06,
838
+ "loss": 0.0018,
839
+ "step": 3350
840
+ },
841
+ {
842
+ "epoch": 8.4,
843
+ "learning_rate": 3.6133333333333336e-06,
844
+ "loss": 0.0015,
845
+ "step": 3375
846
+ },
847
+ {
848
+ "epoch": 8.46,
849
+ "learning_rate": 3.5577777777777785e-06,
850
+ "loss": 0.0022,
851
+ "step": 3400
852
+ },
853
+ {
854
+ "epoch": 8.52,
855
+ "learning_rate": 3.5022222222222225e-06,
856
+ "loss": 0.0022,
857
+ "step": 3425
858
+ },
859
+ {
860
+ "epoch": 8.58,
861
+ "learning_rate": 3.446666666666667e-06,
862
+ "loss": 0.0021,
863
+ "step": 3450
864
+ },
865
+ {
866
+ "epoch": 8.64,
867
+ "learning_rate": 3.391111111111111e-06,
868
+ "loss": 0.0022,
869
+ "step": 3475
870
+ },
871
+ {
872
+ "epoch": 8.71,
873
+ "learning_rate": 3.335555555555556e-06,
874
+ "loss": 0.0018,
875
+ "step": 3500
876
+ },
877
+ {
878
+ "epoch": 8.77,
879
+ "learning_rate": 3.2800000000000004e-06,
880
+ "loss": 0.002,
881
+ "step": 3525
882
+ },
883
+ {
884
+ "epoch": 8.83,
885
+ "learning_rate": 3.2244444444444444e-06,
886
+ "loss": 0.0025,
887
+ "step": 3550
888
+ },
889
+ {
890
+ "epoch": 8.89,
891
+ "learning_rate": 3.1688888888888893e-06,
892
+ "loss": 0.0017,
893
+ "step": 3575
894
+ },
895
+ {
896
+ "epoch": 8.96,
897
+ "learning_rate": 3.1133333333333337e-06,
898
+ "loss": 0.002,
899
+ "step": 3600
900
+ },
901
+ {
902
+ "epoch": 9.02,
903
+ "learning_rate": 3.0577777777777778e-06,
904
+ "loss": 0.0016,
905
+ "step": 3625
906
+ },
907
+ {
908
+ "epoch": 9.08,
909
+ "learning_rate": 3.0022222222222227e-06,
910
+ "loss": 0.0015,
911
+ "step": 3650
912
+ },
913
+ {
914
+ "epoch": 9.14,
915
+ "learning_rate": 2.946666666666667e-06,
916
+ "loss": 0.0019,
917
+ "step": 3675
918
+ },
919
+ {
920
+ "epoch": 9.2,
921
+ "learning_rate": 2.891111111111111e-06,
922
+ "loss": 0.0016,
923
+ "step": 3700
924
+ },
925
+ {
926
+ "epoch": 9.27,
927
+ "learning_rate": 2.835555555555556e-06,
928
+ "loss": 0.0016,
929
+ "step": 3725
930
+ },
931
+ {
932
+ "epoch": 9.33,
933
+ "learning_rate": 2.7800000000000005e-06,
934
+ "loss": 0.0018,
935
+ "step": 3750
936
+ },
937
+ {
938
+ "epoch": 9.39,
939
+ "learning_rate": 2.7244444444444445e-06,
940
+ "loss": 0.0014,
941
+ "step": 3775
942
+ },
943
+ {
944
+ "epoch": 9.45,
945
+ "learning_rate": 2.6688888888888894e-06,
946
+ "loss": 0.0013,
947
+ "step": 3800
948
+ },
949
+ {
950
+ "epoch": 9.51,
951
+ "learning_rate": 2.6133333333333334e-06,
952
+ "loss": 0.0015,
953
+ "step": 3825
954
+ },
955
+ {
956
+ "epoch": 9.58,
957
+ "learning_rate": 2.557777777777778e-06,
958
+ "loss": 0.0018,
959
+ "step": 3850
960
+ },
961
+ {
962
+ "epoch": 9.64,
963
+ "learning_rate": 2.5022222222222224e-06,
964
+ "loss": 0.0013,
965
+ "step": 3875
966
+ },
967
+ {
968
+ "epoch": 9.7,
969
+ "learning_rate": 2.446666666666667e-06,
970
+ "loss": 0.0015,
971
+ "step": 3900
972
+ },
973
+ {
974
+ "epoch": 9.76,
975
+ "learning_rate": 2.3911111111111113e-06,
976
+ "loss": 0.0013,
977
+ "step": 3925
978
+ },
979
+ {
980
+ "epoch": 9.83,
981
+ "learning_rate": 2.3355555555555557e-06,
982
+ "loss": 0.0014,
983
+ "step": 3950
984
+ },
985
+ {
986
+ "epoch": 9.89,
987
+ "learning_rate": 2.28e-06,
988
+ "loss": 0.0015,
989
+ "step": 3975
990
+ },
991
+ {
992
+ "epoch": 9.95,
993
+ "learning_rate": 2.2244444444444447e-06,
994
+ "loss": 0.0014,
995
+ "step": 4000
996
+ },
997
+ {
998
+ "epoch": 9.95,
999
+ "eval_loss": 0.37881383299827576,
1000
+ "eval_runtime": 1112.8569,
1001
+ "eval_samples_per_second": 1.524,
1002
+ "eval_steps_per_second": 0.095,
1003
+ "eval_wer": 11.255572065378901,
1004
+ "step": 4000
1005
+ },
1006
+ {
1007
+ "epoch": 10.01,
1008
+ "learning_rate": 2.168888888888889e-06,
1009
+ "loss": 0.0016,
1010
+ "step": 4025
1011
+ },
1012
+ {
1013
+ "epoch": 10.07,
1014
+ "learning_rate": 2.1133333333333336e-06,
1015
+ "loss": 0.0013,
1016
+ "step": 4050
1017
+ },
1018
+ {
1019
+ "epoch": 10.14,
1020
+ "learning_rate": 2.057777777777778e-06,
1021
+ "loss": 0.0014,
1022
+ "step": 4075
1023
+ },
1024
+ {
1025
+ "epoch": 10.2,
1026
+ "learning_rate": 2.0022222222222225e-06,
1027
+ "loss": 0.0013,
1028
+ "step": 4100
1029
+ },
1030
+ {
1031
+ "epoch": 10.26,
1032
+ "learning_rate": 1.9466666666666665e-06,
1033
+ "loss": 0.0011,
1034
+ "step": 4125
1035
+ },
1036
+ {
1037
+ "epoch": 10.32,
1038
+ "learning_rate": 1.8911111111111114e-06,
1039
+ "loss": 0.0012,
1040
+ "step": 4150
1041
+ },
1042
+ {
1043
+ "epoch": 10.39,
1044
+ "learning_rate": 1.8355555555555557e-06,
1045
+ "loss": 0.0016,
1046
+ "step": 4175
1047
+ },
1048
+ {
1049
+ "epoch": 10.45,
1050
+ "learning_rate": 1.7800000000000001e-06,
1051
+ "loss": 0.0014,
1052
+ "step": 4200
1053
+ },
1054
+ {
1055
+ "epoch": 10.51,
1056
+ "learning_rate": 1.7244444444444448e-06,
1057
+ "loss": 0.0015,
1058
+ "step": 4225
1059
+ },
1060
+ {
1061
+ "epoch": 10.57,
1062
+ "learning_rate": 1.668888888888889e-06,
1063
+ "loss": 0.0012,
1064
+ "step": 4250
1065
+ },
1066
+ {
1067
+ "epoch": 10.63,
1068
+ "learning_rate": 1.6133333333333335e-06,
1069
+ "loss": 0.0013,
1070
+ "step": 4275
1071
+ },
1072
+ {
1073
+ "epoch": 10.7,
1074
+ "learning_rate": 1.5577777777777777e-06,
1075
+ "loss": 0.0013,
1076
+ "step": 4300
1077
+ },
1078
+ {
1079
+ "epoch": 10.76,
1080
+ "learning_rate": 1.5022222222222224e-06,
1081
+ "loss": 0.0013,
1082
+ "step": 4325
1083
+ },
1084
+ {
1085
+ "epoch": 10.82,
1086
+ "learning_rate": 1.4466666666666669e-06,
1087
+ "loss": 0.0013,
1088
+ "step": 4350
1089
+ },
1090
+ {
1091
+ "epoch": 10.88,
1092
+ "learning_rate": 1.3911111111111111e-06,
1093
+ "loss": 0.0012,
1094
+ "step": 4375
1095
+ },
1096
+ {
1097
+ "epoch": 10.95,
1098
+ "learning_rate": 1.3355555555555558e-06,
1099
+ "loss": 0.0014,
1100
+ "step": 4400
1101
+ },
1102
+ {
1103
+ "epoch": 11.01,
1104
+ "learning_rate": 1.28e-06,
1105
+ "loss": 0.0014,
1106
+ "step": 4425
1107
+ },
1108
+ {
1109
+ "epoch": 11.07,
1110
+ "learning_rate": 1.2244444444444445e-06,
1111
+ "loss": 0.0014,
1112
+ "step": 4450
1113
+ },
1114
+ {
1115
+ "epoch": 11.13,
1116
+ "learning_rate": 1.168888888888889e-06,
1117
+ "loss": 0.001,
1118
+ "step": 4475
1119
+ },
1120
+ {
1121
+ "epoch": 11.19,
1122
+ "learning_rate": 1.1133333333333334e-06,
1123
+ "loss": 0.0008,
1124
+ "step": 4500
1125
+ },
1126
+ {
1127
+ "epoch": 11.26,
1128
+ "learning_rate": 1.0577777777777779e-06,
1129
+ "loss": 0.0014,
1130
+ "step": 4525
1131
+ },
1132
+ {
1133
+ "epoch": 11.32,
1134
+ "learning_rate": 1.0022222222222223e-06,
1135
+ "loss": 0.0012,
1136
+ "step": 4550
1137
+ },
1138
+ {
1139
+ "epoch": 11.38,
1140
+ "learning_rate": 9.466666666666667e-07,
1141
+ "loss": 0.0011,
1142
+ "step": 4575
1143
+ },
1144
+ {
1145
+ "epoch": 11.44,
1146
+ "learning_rate": 8.911111111111112e-07,
1147
+ "loss": 0.0015,
1148
+ "step": 4600
1149
+ },
1150
+ {
1151
+ "epoch": 11.5,
1152
+ "learning_rate": 8.355555555555556e-07,
1153
+ "loss": 0.0008,
1154
+ "step": 4625
1155
+ },
1156
+ {
1157
+ "epoch": 11.57,
1158
+ "learning_rate": 7.8e-07,
1159
+ "loss": 0.0013,
1160
+ "step": 4650
1161
+ },
1162
+ {
1163
+ "epoch": 11.63,
1164
+ "learning_rate": 7.244444444444446e-07,
1165
+ "loss": 0.0011,
1166
+ "step": 4675
1167
+ },
1168
+ {
1169
+ "epoch": 11.69,
1170
+ "learning_rate": 6.68888888888889e-07,
1171
+ "loss": 0.0012,
1172
+ "step": 4700
1173
+ },
1174
+ {
1175
+ "epoch": 11.75,
1176
+ "learning_rate": 6.133333333333333e-07,
1177
+ "loss": 0.0009,
1178
+ "step": 4725
1179
+ },
1180
+ {
1181
+ "epoch": 11.82,
1182
+ "learning_rate": 5.577777777777779e-07,
1183
+ "loss": 0.0011,
1184
+ "step": 4750
1185
+ },
1186
+ {
1187
+ "epoch": 11.88,
1188
+ "learning_rate": 5.022222222222222e-07,
1189
+ "loss": 0.001,
1190
+ "step": 4775
1191
+ },
1192
+ {
1193
+ "epoch": 11.94,
1194
+ "learning_rate": 4.466666666666667e-07,
1195
+ "loss": 0.0008,
1196
+ "step": 4800
1197
+ },
1198
+ {
1199
+ "epoch": 12.0,
1200
+ "learning_rate": 3.9111111111111115e-07,
1201
+ "loss": 0.0009,
1202
+ "step": 4825
1203
+ },
1204
+ {
1205
+ "epoch": 12.06,
1206
+ "learning_rate": 3.3555555555555556e-07,
1207
+ "loss": 0.0013,
1208
+ "step": 4850
1209
+ },
1210
+ {
1211
+ "epoch": 12.13,
1212
+ "learning_rate": 2.8e-07,
1213
+ "loss": 0.001,
1214
+ "step": 4875
1215
+ },
1216
+ {
1217
+ "epoch": 12.19,
1218
+ "learning_rate": 2.2444444444444445e-07,
1219
+ "loss": 0.0008,
1220
+ "step": 4900
1221
+ },
1222
+ {
1223
+ "epoch": 12.25,
1224
+ "learning_rate": 1.6888888888888888e-07,
1225
+ "loss": 0.0007,
1226
+ "step": 4925
1227
+ },
1228
+ {
1229
+ "epoch": 12.31,
1230
+ "learning_rate": 1.1333333333333336e-07,
1231
+ "loss": 0.0013,
1232
+ "step": 4950
1233
+ },
1234
+ {
1235
+ "epoch": 12.38,
1236
+ "learning_rate": 5.777777777777778e-08,
1237
+ "loss": 0.0011,
1238
+ "step": 4975
1239
+ },
1240
+ {
1241
+ "epoch": 12.44,
1242
+ "learning_rate": 2.2222222222222225e-09,
1243
+ "loss": 0.0008,
1244
+ "step": 5000
1245
+ },
1246
+ {
1247
+ "epoch": 12.44,
1248
+ "eval_loss": 0.38012266159057617,
1249
+ "eval_runtime": 1112.6036,
1250
+ "eval_samples_per_second": 1.524,
1251
+ "eval_steps_per_second": 0.095,
1252
+ "eval_wer": 11.199851411589897,
1253
+ "step": 5000
1254
+ },
1255
+ {
1256
+ "epoch": 12.44,
1257
+ "step": 5000,
1258
+ "total_flos": 1.6322330164985856e+20,
1259
+ "train_loss": 0.020854657278954983,
1260
+ "train_runtime": 27142.1177,
1261
+ "train_samples_per_second": 5.895,
1262
+ "train_steps_per_second": 0.184
1263
+ }
1264
+ ],
1265
+ "max_steps": 5000,
1266
+ "num_train_epochs": 13,
1267
+ "total_flos": 1.6322330164985856e+20,
1268
+ "trial_name": null,
1269
+ "trial_params": null
1270
+ }
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:33ed7c38433905a1365c18790a32e9b80e38b4d99788b3a51c70edb3aecd69ff
3
- size 3643
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a3ebd5ec6b9b2945948080568a87ff00951d03abd4956eab0351149cc37e37fe
3
+ size 3579