omarxadel commited on
Commit
dd47380
1 Parent(s): 22b0fa7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +33 -35
README.md CHANGED
@@ -1,46 +1,44 @@
1
  ---
 
 
2
  tags:
3
- - generated_from_trainer
 
 
 
 
 
 
 
 
 
4
  model-index:
5
- - name: hubert-large-arabic-egyptian
6
- results: []
 
 
 
 
 
 
 
 
 
 
7
  ---
8
 
9
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
10
- should probably proofread and complete it, then remove this comment. -->
11
 
12
- # hubert-large-arabic-egyptian
13
 
14
- This model was trained from scratch on an unknown dataset.
15
 
16
- ## Model description
17
 
18
- More information needed
 
 
19
 
20
- ## Intended uses & limitations
21
 
22
- More information needed
23
-
24
- ## Training and evaluation data
25
-
26
- More information needed
27
-
28
- ## Training procedure
29
-
30
- ### Training hyperparameters
31
-
32
- The following hyperparameters were used during training:
33
- - learning_rate: 5e-05
34
- - train_batch_size: 8
35
- - eval_batch_size: 8
36
- - seed: 42
37
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
38
- - lr_scheduler_type: linear
39
- - num_epochs: 3.0
40
-
41
- ### Framework versions
42
-
43
- - Transformers 4.20.1
44
- - Pytorch 1.11.0+cu113
45
- - Datasets 2.3.2
46
- - Tokenizers 0.12.1
 
1
  ---
2
+ language: "ar"
3
+ pipeline_tag: automatic-speech-recognition
4
  tags:
5
+ - CTC
6
+ - Attention
7
+ - pytorch
8
+ - Transformer
9
+ license: "cc-by-nc-4.0"
10
+ datasets:
11
+ - MGB-3
12
+ - egyptian-arabic-conversational-speech-corpus
13
+ metrics:
14
+ - wer
15
  model-index:
16
+ - name: omarxadel/hubert-large-arabic-egyptian
17
+ results:
18
+ - task:
19
+ name: Automatic Speech Recognition
20
+ type: automatic-speech-recognition
21
+ metrics:
22
+ - name: Test WER
23
+ type: wer
24
+ value: 25.9
25
+ - name: Validation WER
26
+ type: wer
27
+ value: 23.5
28
  ---
29
 
30
+ # Arabic Hubert-Large - with CTC fine-tuned on MGB-3 and Egyptian Arabic Conversational Speech Corpus (No LM)
 
31
 
32
+ This model is a fine-tuned version of [Arabic Hubert-Large](https://huggingface.co/asafaya/hubert-large-arabic). We finetuned this model on the MGB-3 and Egyptian Arabic Conversational Speech Corpus datasets, acheiving a state of the art for Egyptian Arabic with WER of `25.9%`.
33
 
34
+ The original model was pre-trained on 2,000 hours of 16kHz sampled Arabic speech audio. When using the model make sure that your speech input is also sampled at 16Khz, see the original [paper](https://arxiv.org/abs/2106.07447) for more details on the model.
35
 
36
+ The performance of the model on CommonVoice Arabic 8.0 is the following:
37
 
38
+ | Valid WER | Test WER |
39
+ |:---------:|:--------:|
40
+ | 23.55 | 25.59 |
41
 
42
+ # Acknowledgement
43
 
44
+ Model fine-tuning and data processing for this work were performed as a part of a Graduation Project from Faculty of Engineering, Alexandria University, CCE Program.