Tahsin-Mayeesha
commited on
Commit
•
ecbc11f
1
Parent(s):
e10defc
Update README.md
Browse files
README.md
CHANGED
@@ -75,4 +75,15 @@ The following hyperparameters were used during training:
|
|
75 |
Note : Training and evaluation script modified from https://huggingface.co/chmanoj/xls-r-300m-te and https://github.com/huggingface/transformers/tree/master/examples/research_projects/robust-speech-event.
|
76 |
Bengali speech data was not available from common voice or librispeech multilingual datasets, so OpenSLR53 has been used.
|
77 |
|
78 |
-
Note 2 : Minimum audio duration of 0.1s has been used to filter the training data which excluded may be 10-20 samples.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
75 |
Note : Training and evaluation script modified from https://huggingface.co/chmanoj/xls-r-300m-te and https://github.com/huggingface/transformers/tree/master/examples/research_projects/robust-speech-event.
|
76 |
Bengali speech data was not available from common voice or librispeech multilingual datasets, so OpenSLR53 has been used.
|
77 |
|
78 |
+
Note 2 : Minimum audio duration of 0.1s has been used to filter the training data which excluded may be 10-20 samples.
|
79 |
+
|
80 |
+
# Citation
|
81 |
+
|
82 |
+
@misc {tahsin_mayeesha_2023,
|
83 |
+
author = { {Tahsin Mayeesha} },
|
84 |
+
title = { wav2vec2-bn-300m (Revision e10defc) },
|
85 |
+
year = 2023,
|
86 |
+
url = { https://huggingface.co/Tahsin-Mayeesha/wav2vec2-bn-300m },
|
87 |
+
doi = { 10.57967/hf/0939 },
|
88 |
+
publisher = { Hugging Face }
|
89 |
+
}
|