End of training

Browse files

Files changed (8) hide show

README.md +22 -23
config.json +1 -1
model.safetensors +1 -1
runs/Jul01_11-57-48_d3192d53a3e1/events.out.tfevents.1719835068.d3192d53a3e1.36114.0 +3 -0
runs/Jul01_12-35-04_d3192d53a3e1/events.out.tfevents.1719837304.d3192d53a3e1.36114.1 +3 -0
runs/Jul01_12-41-52_d3192d53a3e1/events.out.tfevents.1719837712.d3192d53a3e1.36114.2 +3 -0
runs/Jul01_13-13-10_d3192d53a3e1/events.out.tfevents.1719839591.d3192d53a3e1.70638.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,10 +1,10 @@
 ---
-base_model: openai/whisper-tiny
 license: apache-2.0
-metrics:
-- wer
 tags:
 - generated_from_trainer
 model-index:
 - name: whisper-tinyfinacial
   results: []
@@ -17,8 +17,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.1540
-- Wer: 154.4944
 ## Model description
@@ -37,32 +37,31 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 1e-05
-- train_batch_size: 16
-- eval_batch_size: 1
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 100
 - training_steps: 600
-- mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Wer      |
-|:-------------:|:-----:|:----:|:---------------:|:--------:|
-| No log        | 25.0  | 50   | 1.4261          | 66.8539  |
-| No log        | 50.0  | 100  | 1.3916          | 86.5169  |
-| No log        | 75.0  | 150  | 1.6553          | 165.1685 |
-| No log        | 100.0 | 200  | 2.6574          | 134.8315 |
-| No log        | 125.0 | 250  | 2.7460          | 142.6966 |
-| No log        | 150.0 | 300  | 3.4242          | 157.3034 |
-| No log        | 175.0 | 350  | 3.7021          | 165.7303 |
-| No log        | 200.0 | 400  | 3.9109          | 168.5393 |
-| No log        | 225.0 | 450  | 4.0157          | 198.8764 |
-| 3.7169        | 250.0 | 500  | 4.1466          | 164.6067 |
-| 3.7169        | 275.0 | 550  | 4.1483          | 152.8090 |
-| 3.7169        | 300.0 | 600  | 4.1540          | 154.4944 |
 ### Framework versions

 ---
 license: apache-2.0
+base_model: openai/whisper-tiny
 tags:
 - generated_from_trainer
+metrics:
+- wer
 model-index:
 - name: whisper-tinyfinacial
   results: []
 This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5217
+- Wer: 55.6180
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1.35e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 100
 - training_steps: 600
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Wer     |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|
+| No log        | 0.4   | 50   | 0.9091          | 64.0449 |
+| No log        | 0.8   | 100  | 0.6941          | 52.2472 |
+| No log        | 1.2   | 150  | 0.5615          | 51.6854 |
+| No log        | 1.6   | 200  | 0.5219          | 47.1910 |
+| No log        | 2.0   | 250  | 0.4938          | 47.7528 |
+| No log        | 2.4   | 300  | 0.4970          | 50.0    |
+| No log        | 2.8   | 350  | 0.4999          | 58.4270 |
+| No log        | 3.2   | 400  | 0.5076          | 46.0674 |
+| No log        | 3.6   | 450  | 0.5157          | 52.2472 |
+| 0.3104        | 4.0   | 500  | 0.5277          | 56.1798 |
+| 0.3104        | 4.4   | 550  | 0.5257          | 57.3034 |
+| 0.3104        | 4.8   | 600  | 0.5217          | 55.6180 |
 ### Framework versions

config.json CHANGED Viewed

@@ -19,7 +19,7 @@
   "decoder_layerdrop": 0.0,
   "decoder_layers": 4,
   "decoder_start_token_id": 50258,
-  "dropout": 0.1,
   "encoder_attention_heads": 6,
   "encoder_ffn_dim": 1536,
   "encoder_layerdrop": 0.0,

   "decoder_layerdrop": 0.0,
   "decoder_layers": 4,
   "decoder_start_token_id": 50258,
+  "dropout": 0.0,
   "encoder_attention_heads": 6,
   "encoder_ffn_dim": 1536,
   "encoder_layerdrop": 0.0,

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:dccbb3ae1823521d955ac4425835578b0c7cae1cd90822cb6469bbf0db5ddf41
 size 151061672

 version https://git-lfs.github.com/spec/v1
+oid sha256:545fd800d83b33655596a90517d963eae1bdeb6a248ba9ec6934f85519637ff5
 size 151061672

runs/Jul01_11-57-48_d3192d53a3e1/events.out.tfevents.1719835068.d3192d53a3e1.36114.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:51a4b9885ff84ac842a3347f34dd16ff7c2fd883524f02e007c3042b466d96c3
+size 10015

runs/Jul01_12-35-04_d3192d53a3e1/events.out.tfevents.1719837304.d3192d53a3e1.36114.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:481c78dad3afbf808ad47899ac19f181712243accd5839501c99135a9736492d
+size 7544

runs/Jul01_12-41-52_d3192d53a3e1/events.out.tfevents.1719837712.d3192d53a3e1.36114.2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5dabee0a5f371c37c812f5c0d1d564a8a811c10e8c060bdab2cbbc7896a0df00
+size 7544

runs/Jul01_13-13-10_d3192d53a3e1/events.out.tfevents.1719839591.d3192d53a3e1.70638.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c3ec2066ba12b2e9a9d3bdd5ec0fabc373796f8d897005d779968a0ed2e28123
+size 10017

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:fcc91f0ed40141c5a99e9e39a8d092b0dec17243655e5e31cbc72f0eb1be0a94
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:476a65ee1646f4f0d5d1152f778d6e77d845c3ddaea79bb165055f8485988710
 size 5240