ouob
/

whisper-hakka-t1

Automatic Speech Recognition

hf-asr-leaderboard

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

ouob commited on Aug 14, 2023

Commit

1d81fb5

•

1 Parent(s): 02ae1bf

End of training

Files changed (4) hide show

README.md +67 -0
generation_config.json +34 -0
pytorch_model.bin +1 -1
training_args.bin +1 -1

README.md ADDED Viewed

	@@ -0,0 +1,67 @@

+---
+language:
+- hi
+license: apache-2.0
+base_model: openai/whisper-base
+tags:
+- hf-asr-leaderboard
+- generated_from_trainer
+datasets:
+- mozilla-foundation/common_voice_11_0
+model-index:
+- name: Whisper base Hi - Sanchit Gandhi
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# Whisper base Hi - Sanchit Gandhi
+This model is a fine-tuned version of [openai/whisper-base](https://huggingface.co/openai/whisper-base) on the Common Voice 11.0 dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.1381
+- Cer: 8.5165
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 1e-05
+- train_batch_size: 16
+- eval_batch_size: 8
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 500
+- training_steps: 4000
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Cer     |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|
+| 0.3106        | 0.87  | 1000 | 0.3352          | 16.9784 |
+| 0.1609        | 1.74  | 2000 | 0.1887          | 10.5303 |
+| 0.0889        | 2.6   | 3000 | 0.1510          | 9.2926  |
+| 0.0596        | 3.47  | 4000 | 0.1381          | 8.5165  |
+### Framework versions
+- Transformers 4.32.0.dev0
+- Pytorch 2.0.1+cu117
+- Datasets 2.14.4
+- Tokenizers 0.13.3

generation_config.json CHANGED Viewed

@@ -1,4 +1,38 @@
 {
   "begin_suppress_tokens": [
     220,
     50257

 {
+  "alignment_heads": [
+    [
+      3,
+      1
+    ],
+    [
+      4,
+      2
+    ],
+    [
+      4,
+      3
+    ],
+    [
+      4,
+      7
+    ],
+    [
+      5,
+      1
+    ],
+    [
+      5,
+      2
+    ],
+    [
+      5,
+      4
+    ],
+    [
+      5,
+      6
+    ]
+  ],
   "begin_suppress_tokens": [
     220,
     50257

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9712231be4437ad7749f1958e681093d0c77cb4810077fe74ee9afc13664417c
 size 290458785

 version https://git-lfs.github.com/spec/v1
+oid sha256:46815212c03bee9064490ba29b680d3ae59dd90316317867375af9b83490c64c
 size 290458785

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:dbcd30a580871fb5bcf7d42acb343b8dd49ac7a77ea1d9f7a78028545ce5c97c
 size 4155

 version https://git-lfs.github.com/spec/v1
+oid sha256:134ff8e4b79a1c454c7916c5d9282d4ac5370ec09a7103a0ca4d3c0905af02db
 size 4155