projecte-aina
/

aina-translator-ca-fr

Model card Files Files and versions Community

fdelucaf commited on Dec 12, 2023

Commit

fc629be

•

1 Parent(s): 7148279

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -26,7 +26,7 @@ license:  apache-2.0
 ## Model description
-This model was trained from scratch using the [Fairseq toolkit](https://fairseq.readthedocs.io/en/latest/) on a combination of Catalan-French datasets, which after filtering and cleaning comprised 18.634.844 sentence pairs. The model is evaluated on the Flores (general) and NTREX (news) evaluation sets.
 ## Intended uses and limitations
@@ -83,7 +83,7 @@ All datasets are deduplicated and filtered to remove any sentence pairs with a c
 #### Tokenization
-All data is tokenized using sentencepiece, with 50 thousand token sentencepiece model  learned from the combination of all filtered training data. This model is included.
 #### Hyperparameters

 ## Model description
+This model was trained from scratch using the [Fairseq toolkit](https://fairseq.readthedocs.io/en/latest/) on a combination of Catalan-French datasets, which after filtering and cleaning comprised 18.634.844 sentence pairs. The model is evaluated on the Flores and NTREX evaluation sets.
 ## Intended uses and limitations
 #### Tokenization
+All data is tokenized using sentencepiece, with 50 thousand token sentencepiece model learned from the combination of all filtered training data. This model is included.
 #### Hyperparameters