vgaraujov
/

led-base-16384-spanish

Text2Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

vgaraujov commited on Mar 6, 2024

Commit

db3dbc4

·

verified ·

1 Parent(s): 73230e8

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -13,13 +13,13 @@ widget:
 - text: Quito es la capital de <mask>
 ---
-# Longformer Encoder-Decoder Spanish (LEDS) (base-sized model)
-LED model based on [BARTO](https://huggingface.co/vgaraujov/bart-base-spanish). It was introduced in the paper [Sequence-to-Sequence Spanish Pre-trained Language Models](https://arxiv.org/abs/2309.11259).
 ## Model description
-LEDS is a BART-based model (transformer encoder-decoder) with a bidirectional (BERT-like) encoder and an autoregressive (GPT-like) decoder. BART is pre-trained by (1) corrupting text with an arbitrary noising function and (2) learning a model to reconstruct the original text.
 To process 16K tokens, the BARTO's position embedding matrix was simply copied 16 times.

 - text: Quito es la capital de <mask>
 ---
+# Longformer Encoder-Decoder Spanish (LEDO) (base-sized model)
+LEDO is based on [BARTO](https://huggingface.co/vgaraujov/bart-base-spanish) and was introduced in the paper [Sequence-to-Sequence Spanish Pre-trained Language Models](https://arxiv.org/abs/2309.11259).
 ## Model description
+LEDO is a BART-based model (transformer encoder-decoder) with a bidirectional (BERT-like) encoder and an autoregressive (GPT-like) decoder. BART is pre-trained by (1) corrupting text with an arbitrary noising function and (2) learning a model to reconstruct the original text.
 To process 16K tokens, the BARTO's position embedding matrix was simply copied 16 times.