projecte-aina
/

aina-translator-gl-ca

Model card Files Files and versions Community

fdelucaf commited on Dec 14, 2023

Commit

866d69b

•

1 Parent(s): 2c50863

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -74,7 +74,7 @@ The Galician-Catalan data collected from the web was a combination of the follow
 |Memories Projectes Lliures | 794.631 |
 | **Total**     	| **4.92.275** |
-The datasets were concatentated before filtering to avoid intra-dataset duplicates and the final size was 4.267.995.
 The 5.750.000 sentence pairs of synthetic parallel data were created from a random sampling of the [Projecte Aina ES-CA corpus](https://huggingface.co/projecte-aina/mt-aina-ca-es)
 ### Training procedure

 |Memories Projectes Lliures | 794.631 |
 | **Total**     	| **4.92.275** |
+The datasets were concatenated before filtering to avoid intra-dataset duplicates and the final size was 4.267.995.
 The 5.750.000 sentence pairs of synthetic parallel data were created from a random sampling of the [Projecte Aina ES-CA corpus](https://huggingface.co/projecte-aina/mt-aina-ca-es)
 ### Training procedure