alvanlii
/

wav2vec2-BERT-cantonese

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

alvanlii commited on Feb 2

Commit

86dd900

•

1 Parent(s): f628665

Update README.md

Files changed (1) hide show

README.md +4 -13

README.md CHANGED Viewed

@@ -19,13 +19,13 @@ model-index:
     metrics:
     - name: Normalized CER
       type: cer
-      value: 20.5
 ---
 # Wav2Vec2-BERT - Alvin
-This model is a fine-tuned version of [facebook/w2v-bert-2.0](https://huggingface.co/facebook/w2v-bert-2.0). This has a CER of 20.5
 ## Training and evaluation data
 For training, three datasets were used:
@@ -65,19 +65,10 @@ predictions = processor.batch_decode(predicted_ids)
 ```
 ## Training Hyperparameters
-- learning_rate: 1e-4
 - train_batch_size: 4 (on 1 3090)
 - eval_batch_size: 1
 - gradient_accumulation_steps: 32
 - total_train_batch_size: 32x4=128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
-- lr_scheduler_warmup_steps: 500
-## Training Results
-| Training Loss | Step | Validation Loss | CER    |
-|:-------------:|:----:|:---------------:|:------:|
-|2.416|1200|1.615|0.4246
-|1.313|4200|0.9049|0.2745
-|1.090|7200|0.7463|0.2388
-|0.907|9600|0.6820|0.2172

     metrics:
     - name: Normalized CER
       type: cer
+      value: 16.26
 ---
 # Wav2Vec2-BERT - Alvin
+This model is a fine-tuned version of [facebook/w2v-bert-2.0](https://huggingface.co/facebook/w2v-bert-2.0). This has a CER of 16.26
 ## Training and evaluation data
 For training, three datasets were used:
 ```
 ## Training Hyperparameters
+- learning_rate: 5e-5
 - train_batch_size: 4 (on 1 3090)
 - eval_batch_size: 1
 - gradient_accumulation_steps: 32
 - total_train_batch_size: 32x4=128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_warmup_steps: 1500