gary109's picture
update model card README.md
7825687
metadata
license: apache-2.0
tags:
  - automatic-speech-recognition
  - gary109/AI_Light_Dance
  - generated_from_trainer
model-index:
  - name: ai-light-dance_singing3_ft_wav2vec2-large-xlsr-53-v1
    results: []

ai-light-dance_singing3_ft_wav2vec2-large-xlsr-53-v1

This model is a fine-tuned version of gary109/ai-light-dance_singing3_ft_wav2vec2-large-xlsr-53-v1 on the GARY109/AI_LIGHT_DANCE - ONSET-SINGING3 dataset. It achieves the following results on the evaluation set:

  • Loss: 0.5352
  • Wer: 0.2490

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-06
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • gradient_accumulation_steps: 8
  • total_train_batch_size: 64
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_steps: 50
  • num_epochs: 50.0
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss Wer
0.3818 1.0 72 0.5569 0.2535
0.3686 2.0 144 0.5535 0.2501
0.3562 3.0 216 0.5526 0.2501
0.3506 4.0 288 0.5460 0.2520
0.369 5.0 360 0.5390 0.2484
0.3683 6.0 432 0.5426 0.2474
0.3541 7.0 504 0.5452 0.2495
0.369 8.0 576 0.5468 0.2490
0.358 9.0 648 0.5471 0.2453
0.3765 10.0 720 0.5376 0.2459
0.3654 11.0 792 0.5407 0.2486
0.373 12.0 864 0.5390 0.2475
0.3606 13.0 936 0.5441 0.2472
0.369 14.0 1008 0.5388 0.2490
0.3762 15.0 1080 0.5389 0.2497
0.3821 16.0 1152 0.5352 0.2490
0.3605 17.0 1224 0.5434 0.2492
0.3703 18.0 1296 0.5415 0.2500
0.3667 19.0 1368 0.5499 0.2487
0.3703 20.0 1440 0.5482 0.2494
0.369 21.0 1512 0.5418 0.2504
0.3708 22.0 1584 0.5437 0.2479
0.3609 23.0 1656 0.5453 0.2480
0.3534 24.0 1728 0.5393 0.2484
0.3656 25.0 1800 0.5363 0.2477
0.3713 26.0 1872 0.5406 0.2461
0.3572 27.0 1944 0.5369 0.2465
0.3665 28.0 2016 0.5375 0.2466
0.3748 29.0 2088 0.5367 0.2480
0.3769 30.0 2160 0.5359 0.2459
0.3634 31.0 2232 0.5417 0.2471
0.3627 32.0 2304 0.5398 0.2473
0.3497 33.0 2376 0.5428 0.2483
0.3479 34.0 2448 0.5390 0.2481
0.363 35.0 2520 0.5388 0.2478
0.3622 36.0 2592 0.5396 0.2490
0.3685 37.0 2664 0.5408 0.2479
0.356 38.0 2736 0.5385 0.2459
0.3529 39.0 2808 0.5389 0.2467
0.3702 40.0 2880 0.5392 0.2482
0.3645 41.0 2952 0.5408 0.2467
0.3489 42.0 3024 0.5406 0.2474
0.3556 43.0 3096 0.5402 0.2472
0.356 44.0 3168 0.5386 0.2468
0.3632 45.0 3240 0.5402 0.2463
0.3693 46.0 3312 0.5401 0.2472
0.3593 47.0 3384 0.5390 0.2463
0.3515 48.0 3456 0.5399 0.2468
0.3485 49.0 3528 0.5392 0.2461
0.3591 50.0 3600 0.5403 0.2461

Framework versions

  • Transformers 4.21.0.dev0
  • Pytorch 1.9.1+cu102
  • Datasets 2.3.3.dev0
  • Tokenizers 0.12.1