ymoslem
/

NMT-EN-FR-CT2

Model card Files Files and versions Community

ymoslem commited on May 13

Commit

0684c43

•

1 Parent(s): d2d33fe

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -13,7 +13,8 @@ pipeline_tag: translation
 # Model Details
 French-to-English Machine Translation model trained by Yasmin Moslem.
-This is a Transformer-based model originally trained with OpenNMT-py and then converted to the CTranslate2 format for efficient inference.
 ## Tools
@@ -25,6 +26,11 @@ This is a Transformer-based model originally trained with OpenNMT-py and then co
 This model is trained on the French-to-English portion of the [UN Corpus](https://conferences.unite.un.org/UNCorpus/),
 consisting of approx. 20 million segments.
 ## Demo
 A demo of this model is available at: https://www.machinetranslation.io/

 # Model Details
 French-to-English Machine Translation model trained by Yasmin Moslem.
+This model depends on the Transformer (base) architecture.
+The model was originally trained with OpenNMT-py and then converted to the CTranslate2 format for efficient inference.
 ## Tools
 This model is trained on the French-to-English portion of the [UN Corpus](https://conferences.unite.un.org/UNCorpus/),
 consisting of approx. 20 million segments.
+## Tokenizer
+The tokenizer was trained using [SentencePiece](https://github.com/google/sentencepiece) on shared vocabulary.
+Hence, there is only one SentencePiece model that can be used for tokenizing both the source and target texts.
 ## Demo
 A demo of this model is available at: https://www.machinetranslation.io/