Fairseq
Italian
Catalan
fdelucaf commited on
Commit
77b7923
1 Parent(s): 8b37d89

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -13,7 +13,9 @@ library_name: fairseq
13
 
14
  ## Model description
15
 
16
- This model was trained from scratch using the Fairseq toolkit on a combination of datasets comprising both Catalan-Italian data sourced from Opus, and additional datasets where synthetic Catalan was generated from the Spanish side of Spanish-Italian corpora using Projecte Aina’s Spanish-Catalan model. This gave a total of approximately 100 million sentence pairs. The model is evaluated on the Flores, NTEU and NTREX evaluation sets.  
 
 
17
 
18
  ## Intended uses and limitations
19
 
 
13
 
14
  ## Model description
15
 
16
+ This model was trained from scratch using the Fairseq toolkit on a combination of datasets comprising both Catalan-Italian data sourced from Opus,
17
+ and additional datasets where synthetic Catalan was generated from the Spanish side of Spanish-Italian corpora using [Projecte Aina’s Spanish-Catalan model](https://huggingface.co/projecte-aina/aina-translator-es-ca).
18
+ This gave a total of approximately 100 million sentence pairs. The model is evaluated on the Flores, NTEU and NTREX evaluation sets.  
19
 
20
  ## Intended uses and limitations
21