arslanarjumand/wav2vec-reptiles

Browse files

Files changed (4) hide show

README.md +16 -23
config.json +4 -4
model.safetensors +1 -1
training_args.bin +2 -2

README.md CHANGED Viewed

@@ -15,11 +15,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [arslanarjumand/wav2vec-reptiles](https://huggingface.co/arslanarjumand/wav2vec-reptiles) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 348.5887
-- Pcc Accuracy: 0.3663
-- Pcc Fluency: 0.3919
-- Pcc Total Score: 0.4017
-- Pcc Content: 0.3756
 ## Model description
@@ -38,15 +38,15 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5e-05
 - train_batch_size: 4
 - eval_batch_size: 6
 - seed: 42
-- gradient_accumulation_steps: 2
-- total_train_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
-- lr_scheduler_warmup_ratio: 0.4
 - num_epochs: 15
 - mixed_precision_training: Native AMP
@@ -54,20 +54,13 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Pcc Accuracy | Pcc Fluency | Pcc Total Score | Pcc Content |
 |:-------------:|:-----:|:----:|:---------------:|:------------:|:-----------:|:---------------:|:-----------:|
-| 268.4239      | 1.07  | 500  | 366.8985        | 0.3003       | 0.3061      | 0.3209          | 0.2949      |
-| 516.4266      | 2.13  | 1000 | 366.3171        | 0.3049       | 0.3112      | 0.3257          | 0.2996      |
-| 285.2714      | 3.2   | 1500 | 367.6445        | 0.3101       | 0.3182      | 0.3322          | 0.3060      |
-| 286.5246      | 4.27  | 2000 | 360.3370        | 0.3225       | 0.3329      | 0.3465          | 0.3196      |
-| 697.7015      | 5.34  | 2500 | 360.7297        | 0.3303       | 0.3430      | 0.3558          | 0.3289      |
-| 219.4269      | 6.4   | 3000 | 358.2635        | 0.3392       | 0.3550      | 0.3671          | 0.3400      |
-| 326.4759      | 7.47  | 3500 | 353.8104        | 0.3475       | 0.3665      | 0.3777          | 0.3506      |
-| 512.1421      | 8.54  | 4000 | 355.2744        | 0.3539       | 0.3748      | 0.3857          | 0.3589      |
-| 296.5867      | 9.61  | 4500 | 351.7932        | 0.3591       | 0.3816      | 0.3921          | 0.3656      |
-| 316.3773      | 10.67 | 5000 | 350.8681        | 0.3622       | 0.3856      | 0.3960          | 0.3696      |
-| 247.4901      | 11.74 | 5500 | 350.1711        | 0.3647       | 0.3893      | 0.3993          | 0.3731      |
-| 262.0258      | 12.81 | 6000 | 348.5538        | 0.3658       | 0.3908      | 0.4007          | 0.3744      |
-| 705.4405      | 13.87 | 6500 | 348.5071        | 0.3663       | 0.3917      | 0.4016          | 0.3754      |
-| 264.0478      | 14.94 | 7000 | 348.5887        | 0.3663       | 0.3919      | 0.4017          | 0.3756      |
 ### Framework versions

 This model is a fine-tuned version of [arslanarjumand/wav2vec-reptiles](https://huggingface.co/arslanarjumand/wav2vec-reptiles) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 180.5618
+- Pcc Accuracy: 0.7344
+- Pcc Fluency: 0.7572
+- Pcc Total Score: 0.7949
+- Pcc Content: 0.7727
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2.5e-05
 - train_batch_size: 4
 - eval_batch_size: 6
 - seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
+- lr_scheduler_warmup_ratio: 0.5
 - num_epochs: 15
 - mixed_precision_training: Native AMP
 | Training Loss | Epoch | Step | Validation Loss | Pcc Accuracy | Pcc Fluency | Pcc Total Score | Pcc Content |
 |:-------------:|:-----:|:----:|:---------------:|:------------:|:-----------:|:---------------:|:-----------:|
+| 323.2938      | 2.13  | 500  | 333.4772        | 0.4645       | 0.5166      | 0.5181          | 0.4915      |
+| 274.2192      | 4.27  | 1000 | 259.5493        | 0.5725       | 0.6371      | 0.6430          | 0.6182      |
+| 287.9362      | 6.4   | 1500 | 291.9187        | 0.6475       | 0.6895      | 0.7121          | 0.6902      |
+| 273.6328      | 8.54  | 2000 | 229.1164        | 0.6884       | 0.7243      | 0.7522          | 0.7285      |
+| 211.4504      | 10.67 | 2500 | 223.4485        | 0.7087       | 0.7420      | 0.7727          | 0.7499      |
+| 162.7622      | 12.81 | 3000 | 180.6950        | 0.7302       | 0.7557      | 0.7918          | 0.7695      |
+| 194.6916      | 14.94 | 3500 | 180.5618        | 0.7344       | 0.7572      | 0.7949          | 0.7727      |
 ### Framework versions

config.json CHANGED Viewed

@@ -45,11 +45,11 @@
   "layerdrop": 0.0005,
   "left_max_position_embeddings": 64,
   "mask_feature_length": 5,
-  "mask_feature_min_masks": 2,
-  "mask_feature_prob": 0.0075,
   "mask_time_length": 5,
-  "mask_time_min_masks": 2,
-  "mask_time_prob": 0.0085,
   "max_source_positions": 5000,
   "model_type": "wav2vec2-bert",
   "num_adapter_layers": 1,

   "layerdrop": 0.0005,
   "left_max_position_embeddings": 64,
   "mask_feature_length": 5,
+  "mask_feature_min_masks": 5,
+  "mask_feature_prob": 0.0575,
   "mask_time_length": 5,
+  "mask_time_min_masks": 5,
+  "mask_time_prob": 0.0585,
   "max_source_positions": 5000,
   "model_type": "wav2vec2-bert",
   "num_adapter_layers": 1,

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6e98c8c9f24ddb6f3a745189676a5396bd1041b39da3e08bc8812633a5ff3a2c
 size 2325236000

 version https://git-lfs.github.com/spec/v1
+oid sha256:3b0b673eb880d4c8d4ce1a725874267182d7bc3b1ff32d8b5061035cbe10c10a
 size 2325236000

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:254d0cead7a836a45484bf3e3e264f464b13b4106469c6bcbb66eb67a3eb71bc
-size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:ad61c98ac9e74083e7bf784e4b8953d284c8a3cf81d10f9c5fd2dfeec8b834da
+size 4664