kpriyanshu256
/

whisper-large-v2-as-1000-32-1e-05-bn-multi

Automatic Speech Recognition

generated_from_trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Edit model card

kpriyanshu256/whisper-large-v2-as-600-32-1e-05-bn-Assamese

This model is a fine-tuned version of kpriyanshu256/whisper-large-v2-as-600-32-1e-05-bn on the Common Voice 11.0 and the FLEURS datasets. It achieves the following results on the evaluation set:

Loss: 0.2486
Wer: 17.5600

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 1e-05
train_batch_size: 16
eval_batch_size: 8
seed: 42
distributed_type: multi-GPU
gradient_accumulation_steps: 2
total_train_batch_size: 32
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 50
training_steps: 1000

Training results

Training Loss	Epoch	Step	Validation Loss	Wer
0.1273	0.1	100	0.1737	20.8988
0.0811	0.2	200	0.1739	19.0038
0.0638	0.3	300	0.1823	18.4804
0.0404	1.05	400	0.1893	17.1810
0.0316	1.15	500	0.2067	17.0186
0.027	1.25	600	0.2081	17.7405
0.025	2.01	700	0.2213	17.7585
0.0213	2.11	800	0.2237	17.8488
0.0176	2.21	900	0.2390	16.7479
0.0184	2.31	1000	0.2486	17.5600

Framework versions

Transformers 4.26.0.dev0
Pytorch 1.13.0+cu117
Datasets 2.7.1.dev0
Tokenizers 0.13.2

Downloads last month: 9

Datasets used to train kpriyanshu256/whisper-large-v2-as-1000-32-1e-05-bn-multi

Evaluation results

Wer on Common Voice 11.0
test set self-reported

17.560
Wer on FLEURS
self-reported

null

View on Papers With Code