Update README.md
Browse files
README.md
CHANGED
@@ -13,7 +13,9 @@ library_name: fairseq
|
|
13 |
|
14 |
## Model description
|
15 |
|
16 |
-
This model was trained from scratch using the Fairseq toolkit on a combination of datasets comprising both Catalan-Italian data sourced from Opus,
|
|
|
|
|
17 |
|
18 |
## Intended uses and limitations
|
19 |
|
|
|
13 |
|
14 |
## Model description
|
15 |
|
16 |
+
This model was trained from scratch using the Fairseq toolkit on a combination of datasets comprising both Catalan-Italian data sourced from Opus,
|
17 |
+
and additional datasets where synthetic Catalan was generated from the Spanish side of Spanish-Italian corpora using [Projecte Aina’s Spanish-Catalan model](https://huggingface.co/projecte-aina/aina-translator-es-ca).
|
18 |
+
This gave a total of approximately 100 million sentence pairs. The model is evaluated on the Flores, NTEU and NTREX evaluation sets.
|
19 |
|
20 |
## Intended uses and limitations
|
21 |
|