MultiBertGunjanPatrick
/

multiberts-seed-8

Transformers PyTorch

English bert pretraining exbert multiberts Inference Endpoints

Model card Files Files and versions Community

gchhablani commited on Sep 23, 2021

Commit

8c75388

•

1 Parent(s): b1c8475

Add README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -31,11 +31,11 @@ was pretrained with two objectives:
   predict if the two sentences were following each other or not.
 This way, the model learns an inner representation of the English language that can then be used to extract features
 useful for downstream tasks: if you have a dataset of labeled sentences for instance, you can train a standard
-classifier using the features produced by the BERT model as inputs.
 ## Intended uses & limitations
 You can use the raw model for either masked language modeling or next sentence prediction, but it's mostly intended to
-be fine-tuned on a downstream task. See the [model hub](https://huggingface.co/models?filter=bert) to look for
 fine-tuned versions on a task that interests you.
 Note that this model is primarily aimed at being fine-tuned on tasks that use the whole sentence (potentially masked)
 to make decisions, such as sequence classification, token classification or question answering. For tasks such as text

   predict if the two sentences were following each other or not.
 This way, the model learns an inner representation of the English language that can then be used to extract features
 useful for downstream tasks: if you have a dataset of labeled sentences for instance, you can train a standard
+classifier using the features produced by the MultiBERTs model as inputs.
 ## Intended uses & limitations
 You can use the raw model for either masked language modeling or next sentence prediction, but it's mostly intended to
+be fine-tuned on a downstream task. See the [model hub](https://huggingface.co/models?filter=multiberts) to look for
 fine-tuned versions on a task that interests you.
 Note that this model is primarily aimed at being fine-tuned on tasks that use the whole sentence (potentially masked)
 to make decisions, such as sequence classification, token classification or question answering. For tasks such as text