jangmin
/

whisper-medium-ko-normalized-1273h

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

jangmin commited on Jun 8, 2023

Commit

ddfd79a

·

1 Parent(s): bb1931b

Update README.md

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -11,14 +11,15 @@ model-index:
 # whisper-small-ko-normalized-1273h
-This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-medium) on a custom dataset for improving Korean speech recognition.
 It achieves the following results on the evaluation set:
 - Loss: 0.1254
 - Wer: 0.0551
 ## Model description
-The model was trained to transcript the Korean audio sources into text.
 ## Intended uses & limitations
@@ -40,7 +41,7 @@ Following indicates the hours information for each dastset.
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
-- train_batch_size: 32
 - eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08

 # whisper-small-ko-normalized-1273h
+This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) on a custom dataset for improving Korean speech recognition.
 It achieves the following results on the evaluation set:
 - Loss: 0.1254
 - Wer: 0.0551
 ## Model description
+The model was a fine-tuned version of `openai/whisper-medium` transcript the Korean audio sources into text.
+It was trained on GCP's `a2-highgpu-1g` (a100-40G) for 26 hours with about $90.
 ## Intended uses & limitations
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
+- train_batch_size: 24
 - eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08