fdelucaf commited on
Commit
fcde9f4
1 Parent(s): 866d69b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -72,7 +72,7 @@ The Galician-Catalan data collected from the web was a combination of the follow
72
  |Covost 2 | 263.729 |
73
  |Gene-Crawling | 38.320 |
74
  |Memories Projectes Lliures | 794.631 |
75
- | **Total** | **4.92.275** |
76
 
77
  The datasets were concatenated before filtering to avoid intra-dataset duplicates and the final size was 4.267.995.
78
  The 5.750.000 sentence pairs of synthetic parallel data were created from a random sampling of the [Projecte Aina ES-CA corpus](https://huggingface.co/projecte-aina/mt-aina-ca-es)
 
72
  |Covost 2 | 263.729 |
73
  |Gene-Crawling | 38.320 |
74
  |Memories Projectes Lliures | 794.631 |
75
+ | **Total** | **4.952.275** |
76
 
77
  The datasets were concatenated before filtering to avoid intra-dataset duplicates and the final size was 4.267.995.
78
  The 5.750.000 sentence pairs of synthetic parallel data were created from a random sampling of the [Projecte Aina ES-CA corpus](https://huggingface.co/projecte-aina/mt-aina-ca-es)