michaelszhu
/

wav2vec2-base-finetuned-radio-ASR-2

@@ -23,7 +23,7 @@ model-index:
     metrics:
     - name: Wer
       type: wer
-      value: 100.0
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -33,8 +33,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai/wav2vec2](https://huggingface.co/openai/wav2vec2) on the Radio-Modified Common Voice 11.0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 382.1391
-- Wer: 100.0
 ## Model description
@@ -53,31 +53,37 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0001
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- training_steps: 6000
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss | Wer   |
-|:-------------:|:------:|:----:|:---------------:|:-----:|
-| 506.6064      | 0.1667 | 1000 | 442.2693        | 100.0 |
-| 435.214       | 0.3333 | 2000 | 379.3636        | 100.0 |
-| 460.0346      | 0.5    | 3000 | 379.4807        | 100.0 |
-| 416.7359      | 0.6667 | 4000 | 382.4834        | 100.0 |
-| 443.2083      | 0.8333 | 5000 | 380.2564        | 100.0 |
-| 407.3617      | 1.0    | 6000 | 382.1391        | 100.0 |
 ### Framework versions
 - Transformers 4.41.2
-- Pytorch 2.3.0+cu121
 - Datasets 2.19.2
 - Tokenizers 0.19.1

     metrics:
     - name: Wer
       type: wer
+      value: 93.31727244160469
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [openai/wav2vec2](https://huggingface.co/openai/wav2vec2) on the Radio-Modified Common Voice 11.0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 274.6956
+- Wer: 93.3173
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-05
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- training_steps: 10000
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Wer     |
+|:-------------:|:-----:|:-----:|:---------------:|:-------:|
+| 454.7647      | 0.1   | 1000  | 432.4697        | 96.7955 |
+| 381.9785      | 0.2   | 2000  | 345.0900        | 97.3774 |
+| 346.4287      | 0.3   | 3000  | 345.3813        | 96.7005 |
+| 313.4983      | 0.4   | 4000  | 334.5611        | 95.2668 |
+| 344.4422      | 0.5   | 5000  | 422.9466        | 96.1369 |
+| 349.3033      | 0.6   | 6000  | 337.1495        | 91.9877 |
+| 337.6954      | 0.7   | 7000  | 299.1713        | 95.9529 |
+| 313.1935      | 0.8   | 8000  | 283.5478        | 95.4207 |
+| 348.8207      | 0.9   | 9000  | 275.7858        | 92.5278 |
+| 325.8637      | 1.0   | 10000 | 274.6956        | 93.3173 |
 ### Framework versions
 - Transformers 4.41.2
+- Pytorch 2.3.1+cu121
 - Datasets 2.19.2
 - Tokenizers 0.19.1

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a54e85fd5ed766deadceeecf35c2ba1375d85b121c90f41ab5f527d61c7b7ff3
 size 377611120

 version https://git-lfs.github.com/spec/v1
+oid sha256:8dd7b3b3e95c65bff047a53687889dbe9164db3f93cfd7dad051d200e0a7dc59
 size 377611120

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f15056c56eaaa934b15edc47c28523cfec498d7e58ee9909a7f34e4f81af1296
 size 5048

 version https://git-lfs.github.com/spec/v1
+oid sha256:1b477aae325973fc97f326529b6d51f2ef0286007caefdce8d143ba8e36ecd8b
 size 5048