--- language: - pt license: apache-2.0 tags: - generated_from_trainer - hf-asr-leaderboard - pt - robust-speech-event datasets: - common_voice model-index: - name: sew-tiny-portuguese-cv results: - task: name: Automatic Speech Recognition type: automatic-speech-recognition dataset: name: Common Voice 6 type: common_voice args: pt metrics: - name: Test WER type: wer value: 30.02 - name: Test CER type: cer value: 10.34 - task: name: Automatic Speech Recognition type: automatic-speech-recognition dataset: name: Robust Speech Event - Dev Data type: speech-recognition-community-v2/dev_data args: sv metrics: - name: Test WER type: wer value: 56.46 - name: Test CER type: cer value: 22.94 - task: name: Automatic Speech Recognition type: automatic-speech-recognition dataset: name: Robust Speech Event - Dev Data type: speech-recognition-community-v2/dev_data args: pt metrics: - name: Test WER type: wer value: 57.17 - task: name: Automatic Speech Recognition type: automatic-speech-recognition dataset: name: Robust Speech Event - Test Data type: speech-recognition-community-v2/eval_data args: pt metrics: - name: Test WER type: wer value: 61.3 --- # sew-tiny-portuguese-cv This model is a fine-tuned version of [lgris/sew-tiny-pt](https://huggingface.co/lgris/sew-tiny-pt) on the common_voice dataset. It achieves the following results on the evaluation set: - Loss: 0.5110 - Wer: 0.2842 ## Model description More information needed ## Intended uses & limitations More information needed ## Training and evaluation data More information needed ## Training procedure ### Training hyperparameters The following hyperparameters were used during training: - learning_rate: 0.0001 - train_batch_size: 8 - eval_batch_size: 8 - seed: 42 - gradient_accumulation_steps: 4 - total_train_batch_size: 32 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08 - lr_scheduler_type: linear - lr_scheduler_warmup_steps: 1000 - training_steps: 40000 - mixed_precision_training: Native AMP ### Training results | Training Loss | Epoch | Step | Validation Loss | Wer | |:-------------:|:------:|:-----:|:---------------:|:------:| | No log | 4.92 | 1000 | 0.8468 | 0.6494 | | 3.4638 | 9.85 | 2000 | 0.4978 | 0.3815 | | 3.4638 | 14.78 | 3000 | 0.4734 | 0.3417 | | 0.9904 | 19.7 | 4000 | 0.4577 | 0.3344 | | 0.9904 | 24.63 | 5000 | 0.4376 | 0.3170 | | 0.8849 | 29.55 | 6000 | 0.4225 | 0.3118 | | 0.8849 | 34.48 | 7000 | 0.4354 | 0.3080 | | 0.819 | 39.41 | 8000 | 0.4434 | 0.3004 | | 0.819 | 44.33 | 9000 | 0.4710 | 0.3132 | | 0.7706 | 49.26 | 10000 | 0.4497 | 0.3064 | | 0.7706 | 54.19 | 11000 | 0.4598 | 0.3100 | | 0.7264 | 59.11 | 12000 | 0.4271 | 0.3013 | | 0.7264 | 64.04 | 13000 | 0.4333 | 0.2959 | | 0.6909 | 68.96 | 14000 | 0.4554 | 0.3019 | | 0.6909 | 73.89 | 15000 | 0.4444 | 0.2888 | | 0.6614 | 78.81 | 16000 | 0.4734 | 0.3081 | | 0.6614 | 83.74 | 17000 | 0.4820 | 0.3058 | | 0.6379 | 88.67 | 18000 | 0.4416 | 0.2950 | | 0.6379 | 93.59 | 19000 | 0.4614 | 0.2974 | | 0.6055 | 98.52 | 20000 | 0.4812 | 0.3018 | | 0.6055 | 103.45 | 21000 | 0.4700 | 0.3018 | | 0.5823 | 108.37 | 22000 | 0.4726 | 0.2999 | | 0.5823 | 113.3 | 23000 | 0.4979 | 0.2887 | | 0.5597 | 118.23 | 24000 | 0.4813 | 0.2980 | | 0.5597 | 123.15 | 25000 | 0.4968 | 0.2972 | | 0.542 | 128.08 | 26000 | 0.5331 | 0.3059 | | 0.542 | 133.0 | 27000 | 0.5046 | 0.2978 | | 0.5185 | 137.93 | 28000 | 0.4882 | 0.2922 | | 0.5185 | 142.85 | 29000 | 0.4945 | 0.2938 | | 0.499 | 147.78 | 30000 | 0.4971 | 0.2913 | | 0.499 | 152.71 | 31000 | 0.4948 | 0.2873 | | 0.4811 | 157.63 | 32000 | 0.4924 | 0.2918 | | 0.4811 | 162.56 | 33000 | 0.5128 | 0.2911 | | 0.4679 | 167.49 | 34000 | 0.5098 | 0.2892 | | 0.4679 | 172.41 | 35000 | 0.4966 | 0.2863 | | 0.456 | 177.34 | 36000 | 0.5033 | 0.2839 | | 0.456 | 182.27 | 37000 | 0.5114 | 0.2875 | | 0.4453 | 187.19 | 38000 | 0.5154 | 0.2859 | | 0.4453 | 192.12 | 39000 | 0.5102 | 0.2847 | | 0.4366 | 197.04 | 40000 | 0.5110 | 0.2842 | ### Framework versions - Transformers 4.16.0.dev0 - Pytorch 1.10.1+cu102 - Datasets 1.17.1.dev0 - Tokenizers 0.11.0