ArthurMalajyan
/

seamless-m4t-v2-large-asr-hyw

Automatic Speech Recognition

Western Armenian

seamless_communication

Model card Files Files and versions Community

ArthurMalajyan commited on Jun 1

Commit

2532b19

•

1 Parent(s): 161c5f6

Update README.md

Files changed (1) hide show

README.md +15 -3

README.md CHANGED Viewed

@@ -15,14 +15,14 @@ tags:
 # SeamlessM4T v2 ASR for Western Armenian
-This model is a fine-tuned version of the [facebook/seamless-m4t-v2-large](https://huggingface.co/facebook/seamless-m4t-v2-large). Initially, it was fine-tuned on the Common Voice 16.1 and Google Fleurs datasets. Subsequently, it was further fine-tuned on the [ReRooted](https://github.com/jhdeov/ReRooted-ArmenianCorpus) corpus.
 The model achieves the following results on the test sets:
 - CV_wer: 0.308
 - CV_cer: 0.07
 - GF_wer: 0.311
 - GF_cer: 0.094
-After fine-tuning on Western Armenian data, the model occasionally translates Eastern Armenian speech into Western Armenian.
 ### Training hyperparameters
@@ -37,4 +37,16 @@ The following hyperparameters were used during training:
 ### Framework versions
 - Pytorch 2.1.1
-- fairseq2==0.2.0

 # SeamlessM4T v2 ASR for Western Armenian
+This model is a fine-tuned version of the [facebook/seamless-m4t-v2-large](https://huggingface.co/facebook/seamless-m4t-v2-large) for the ASR task. Initially, it was fine-tuned on the Common Voice 16.1 and Google Fleurs datasets. Subsequently, it was further fine-tuned on the [ReRooted](https://github.com/jhdeov/ReRooted-ArmenianCorpus) corpus.
 The model achieves the following results on the test sets:
 - CV_wer: 0.308
 - CV_cer: 0.07
 - GF_wer: 0.311
 - GF_cer: 0.094
+After fine-tuning on Western Armenian data, the model occasionally translates Eastern Armenian speech into Western Armenian(Colab link: [Test SeamlessM4T v2 ASR for Western Armenian](https://colab.research.google.com/drive/16TyabwvSU7fR54x0xzgkTMhtArCibZ5H?usp=sharing)).
 ### Training hyperparameters
 ### Framework versions
 - Pytorch 2.1.1
+- Fairseq2 0.2.0
+## Citation
+For SeamlessM4T v2, please cite :
+```bibtex
+@inproceedings{seamless2023,
+   title="Seamless: Multilingual Expressive and Streaming Speech Translation",
+   author="{Seamless Communication}, Lo{\"i}c Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek, Yilin Yang, Ethan Ye, Ivan Evtimov, Pierre Fernandez, Cynthia Gao, Prangthip Hansanti, Elahe Kalbassi, Amanda Kallet, Artyom Kozhevnikov, Gabriel Mejia, Robin San Roman, Christophe Touret, Corinne Wong, Carleigh Wood, Bokai Yu, Pierre Andrews, Can Balioglu, Peng-Jen Chen, Marta R. Costa-juss{\`a}, Maha Elbayad, Hongyu Gong, Francisco Guzm{\'a}n, Kevin Heffernan, Somya Jain, Justine Kao, Ann Lee, Xutai Ma, Alex Mourachko, Benjamin Peloquin, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Anna Sun, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang, Mary Williamson",
+  journal={ArXiv},
+  year={2023}
+}
+```
+[//]: # "https://arxiv.org/abs/2312.05187"