typo: encoder-encoder -> encoder-decoder
Browse filesBART is an encoder-decoder, not an encoder-encoder.
> It uses a standard Tranformer-based neural machine translation architecture which, despite its simplicity, can be seen as generalizing BERT (due to the bidirectional encoder), GPT (with the left-to-right decoder), and many other more recent pretraining schemes.
(https://arxiv.org/abs/1910.13461)
README.md
CHANGED
@@ -11,7 +11,7 @@ Disclaimer: The team releasing BART did not write a model card for this model so
|
|
11 |
|
12 |
## Model description
|
13 |
|
14 |
-
BART is a transformer encoder-
|
15 |
|
16 |
BART is particularly effective when fine-tuned for text generation (e.g. summarization, translation) but also works well for comprehension tasks (e.g. text classification, question answering).
|
17 |
|
|
|
11 |
|
12 |
## Model description
|
13 |
|
14 |
+
BART is a transformer encoder-decoder (seq2seq) model with a bidirectional (BERT-like) encoder and an autoregressive (GPT-like) decoder. BART is pre-trained by (1) corrupting text with an arbitrary noising function, and (2) learning a model to reconstruct the original text.
|
15 |
|
16 |
BART is particularly effective when fine-tuned for text generation (e.g. summarization, translation) but also works well for comprehension tasks (e.g. text classification, question answering).
|
17 |
|