AshtonIsNotHere
/

xlm-roberta-long-base-4096

Inference Endpoints

Model card Files Files and versions Community

AshtonIsNotHere commited on Nov 30, 2022

Commit

c240683

•

1 Parent(s): a7580aa

Updated README for clarity

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ datasets:
 ## XLM-R Longformer Model  / XLM-Long
 This is an XLM-RoBERTa longformer model that was pre-trained from the XLM-RoBERTa checkpoint using the Longformer [pre-training scheme](https://github.com/allenai/longformer/blob/master/scripts/convert_model_to_long.ipynb) on the English WikiText-103 corpus.
-This model is identical to [markussagen's xlm-r longformer model,](https://huggingface.co/markussagen/xlm-roberta-longformer-base-4096) the difference being that the weights have been transferred to a Longformer model, in order to enable loading with ```.from_pretrained()```.
 ## How to Use
 The model can be used as expected to fine-tune on a downstream task.

 ## XLM-R Longformer Model  / XLM-Long
 This is an XLM-RoBERTa longformer model that was pre-trained from the XLM-RoBERTa checkpoint using the Longformer [pre-training scheme](https://github.com/allenai/longformer/blob/master/scripts/convert_model_to_long.ipynb) on the English WikiText-103 corpus.
+This model is identical to [markussagen's xlm-r longformer model,](https://huggingface.co/markussagen/xlm-roberta-longformer-base-4096) the difference being that the weights have been transferred to a Longformer model, in order to enable loading with ```AutoModel.from_pretrained()``` without the need for external libraries.
 ## How to Use
 The model can be used as expected to fine-tune on a downstream task.