--- license: cc-by-nc-4.0 language: - hyw datasets: - mozilla-foundation/common_voice_16_1 - google/fleurs - ReRooted pipeline_tag: automatic-speech-recognition tags: - speech-to-text - seamless_communication --- # SeamlessM4T v2 ASR for Western Armenian This model is a fine-tuned version of the [facebook/seamless-m4t-v2-large](https://huggingface.co/facebook/seamless-m4t-v2-large) for the ASR task. Initially, it was fine-tuned on the Common Voice 16.1 and Google Fleurs datasets. Subsequently, it was further fine-tuned on the [ReRooted](https://github.com/jhdeov/ReRooted-ArmenianCorpus) corpus. The model achieves the following results on the test sets: - CV_wer: 0.308 - CV_cer: 0.07 - GF_wer: 0.311 - GF_cer: 0.094 After fine-tuning on Western Armenian data, the model occasionally translates Eastern Armenian speech into Western Armenian(Colab link: [Test SeamlessM4T v2 ASR for Western Armenian](https://colab.research.google.com/drive/16TyabwvSU7fR54x0xzgkTMhtArCibZ5H?usp=sharing)). ### Training hyperparameters The following hyperparameters were used during training: - learning_rate: 1e-6 - train_batch_size: 4 - eval_batch_size: 1 - seed: 43 - optimizer: Adam with betas=(0.9, 0.98) and epsilon=1e-08 - lr_scheduler_type: MyleLR - lr_scheduler_warmup_steps: 100 ### Framework versions - Pytorch 2.1.1 - Fairseq2 0.2.0 ## Citation For SeamlessM4T v2, please cite : ```bibtex @inproceedings{seamless2023, title="Seamless: Multilingual Expressive and Streaming Speech Translation", author="{Seamless Communication}, Lo{\"i}c Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek, Yilin Yang, Ethan Ye, Ivan Evtimov, Pierre Fernandez, Cynthia Gao, Prangthip Hansanti, Elahe Kalbassi, Amanda Kallet, Artyom Kozhevnikov, Gabriel Mejia, Robin San Roman, Christophe Touret, Corinne Wong, Carleigh Wood, Bokai Yu, Pierre Andrews, Can Balioglu, Peng-Jen Chen, Marta R. Costa-juss{\`a}, Maha Elbayad, Hongyu Gong, Francisco Guzm{\'a}n, Kevin Heffernan, Somya Jain, Justine Kao, Ann Lee, Xutai Ma, Alex Mourachko, Benjamin Peloquin, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Anna Sun, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang, Mary Williamson", journal={ArXiv}, year={2023} } ``` [//]: # "https://arxiv.org/abs/2312.05187"