Fairseq
Catalan
Portuguese
AudreyVM commited on
Commit
1e26999
1 Parent(s): 26dd04b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -3
README.md CHANGED
@@ -60,13 +60,10 @@ The model was trained on a combination of the following datasets:
60
  | WikiMatrix | 358.873 | 317.649 |
61
  | GNOME | 5.211 | 1.752|
62
  | KDE4 | 166.208 | 117.828 |
63
- | QED | 53.635 | 43.736 |
64
- | TED2020 v1 | 48.942 | 41.461 |
65
  | OpenSubtitles | 384.142 | 235.604 |
66
  | GlobalVoices| 4.035 | 3.430|
67
  | Tatoeba | 754 | 723 |
68
  | Europarl | 1.692.106 | 1.631.989 |
69
- | **Total** | **15.391.745** | **6.159.631** |
70
 
71
  All corpora except Europarl were collected from [Opus](https://opus.nlpl.eu/).
72
  The Europarl corpus is a synthetic parallel corpus created from the original Spanish-Catalan corpus by [SoftCatalà](https://github.com/Softcatala/Europarl-catalan).
 
60
  | WikiMatrix | 358.873 | 317.649 |
61
  | GNOME | 5.211 | 1.752|
62
  | KDE4 | 166.208 | 117.828 |
 
 
63
  | OpenSubtitles | 384.142 | 235.604 |
64
  | GlobalVoices| 4.035 | 3.430|
65
  | Tatoeba | 754 | 723 |
66
  | Europarl | 1.692.106 | 1.631.989 |
 
67
 
68
  All corpora except Europarl were collected from [Opus](https://opus.nlpl.eu/).
69
  The Europarl corpus is a synthetic parallel corpus created from the original Spanish-Catalan corpus by [SoftCatalà](https://github.com/Softcatala/Europarl-catalan).