ArthurMalajyan commited on
Commit
2532b19
1 Parent(s): 161c5f6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -3
README.md CHANGED
@@ -15,14 +15,14 @@ tags:
15
 
16
  # SeamlessM4T v2 ASR for Western Armenian
17
 
18
- This model is a fine-tuned version of the [facebook/seamless-m4t-v2-large](https://huggingface.co/facebook/seamless-m4t-v2-large). Initially, it was fine-tuned on the Common Voice 16.1 and Google Fleurs datasets. Subsequently, it was further fine-tuned on the [ReRooted](https://github.com/jhdeov/ReRooted-ArmenianCorpus) corpus.
19
  The model achieves the following results on the test sets:
20
  - CV_wer: 0.308
21
  - CV_cer: 0.07
22
  - GF_wer: 0.311
23
  - GF_cer: 0.094
24
 
25
- After fine-tuning on Western Armenian data, the model occasionally translates Eastern Armenian speech into Western Armenian.
26
 
27
  ### Training hyperparameters
28
 
@@ -37,4 +37,16 @@ The following hyperparameters were used during training:
37
 
38
  ### Framework versions
39
  - Pytorch 2.1.1
40
- - fairseq2==0.2.0
 
 
 
 
 
 
 
 
 
 
 
 
 
15
 
16
  # SeamlessM4T v2 ASR for Western Armenian
17
 
18
+ This model is a fine-tuned version of the [facebook/seamless-m4t-v2-large](https://huggingface.co/facebook/seamless-m4t-v2-large) for the ASR task. Initially, it was fine-tuned on the Common Voice 16.1 and Google Fleurs datasets. Subsequently, it was further fine-tuned on the [ReRooted](https://github.com/jhdeov/ReRooted-ArmenianCorpus) corpus.
19
  The model achieves the following results on the test sets:
20
  - CV_wer: 0.308
21
  - CV_cer: 0.07
22
  - GF_wer: 0.311
23
  - GF_cer: 0.094
24
 
25
+ After fine-tuning on Western Armenian data, the model occasionally translates Eastern Armenian speech into Western Armenian(Colab link: [Test SeamlessM4T v2 ASR for Western Armenian](https://colab.research.google.com/drive/16TyabwvSU7fR54x0xzgkTMhtArCibZ5H?usp=sharing)).
26
 
27
  ### Training hyperparameters
28
 
 
37
 
38
  ### Framework versions
39
  - Pytorch 2.1.1
40
+ - Fairseq2 0.2.0
41
+
42
+ ## Citation
43
+ For SeamlessM4T v2, please cite :
44
+ ```bibtex
45
+ @inproceedings{seamless2023,
46
+ title="Seamless: Multilingual Expressive and Streaming Speech Translation",
47
+ author="{Seamless Communication}, Lo{\"i}c Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek, Yilin Yang, Ethan Ye, Ivan Evtimov, Pierre Fernandez, Cynthia Gao, Prangthip Hansanti, Elahe Kalbassi, Amanda Kallet, Artyom Kozhevnikov, Gabriel Mejia, Robin San Roman, Christophe Touret, Corinne Wong, Carleigh Wood, Bokai Yu, Pierre Andrews, Can Balioglu, Peng-Jen Chen, Marta R. Costa-juss{\`a}, Maha Elbayad, Hongyu Gong, Francisco Guzm{\'a}n, Kevin Heffernan, Somya Jain, Justine Kao, Ann Lee, Xutai Ma, Alex Mourachko, Benjamin Peloquin, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Anna Sun, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang, Mary Williamson",
48
+ journal={ArXiv},
49
+ year={2023}
50
+ }
51
+ ```
52
+ [//]: # "https://arxiv.org/abs/2312.05187"