google
/

canine-s

@@ -14,7 +14,13 @@ Pretrained CANINE model on English language using a masked language modeling (ML
 What's special about CANINE is that it doesn't require an explicit tokenizer (such as WordPiece or SentencePiece) as other models like BERT and RoBERTa. Instead, it directly operates at a character level: each character is turned into its [Unicode code point](https://en.wikipedia.org/wiki/Code_point#:~:text=For%20Unicode%2C%20the%20particular%20sequence,forming%20a%20self%2Dsynchronizing%20code.).
-This means that input processing is trivial and can typically be accomplished as: `input_ids = [ord(char) for char in text]`, using the built-in ord() function in Python.
 Disclaimer: The team releasing CANINE did not write a model card for this model so this model card has been written by the Hugging Face team.

 What's special about CANINE is that it doesn't require an explicit tokenizer (such as WordPiece or SentencePiece) as other models like BERT and RoBERTa. Instead, it directly operates at a character level: each character is turned into its [Unicode code point](https://en.wikipedia.org/wiki/Code_point#:~:text=For%20Unicode%2C%20the%20particular%20sequence,forming%20a%20self%2Dsynchronizing%20code.).
+This means that input processing is trivial and can typically be accomplished as:
+```
+input_ids = [ord(char) for char in text]
+```
+The ord() function is part of Python, and turns each character into its Unicode code point.
 Disclaimer: The team releasing CANINE did not write a model card for this model so this model card has been written by the Hugging Face team.