ccdv
/

lsg-bart-base-16384-arxiv

text2text-generation

Model card Files Files and versions Community

ccdv commited on May 9, 2022

Commit

46085ee

·

1 Parent(s): b274270

readme

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -21,14 +21,14 @@ This model is a fine-tuned version of [ccdv/lsg-bart-base-4096-arxiv](https://hu
 It achieves the following results on the test set:
 | Length | Global tokens | Sparse Type | Block Size | Sparsity | Connexions | R1    | R2    | RL    | RLsum |
-|:------ |:--------- |:---------- |:-------- | :--------- |:----- |:----- |:----- |:----- |
 | 16384  | 64            | -           | 256        | 0        | 768        | 48.55 | 20.76 | 28.39 | 44.03 |
 ## Model description
 The model has about ~145 millions parameters (6 encoder layers - 6 decoder layers). \
-The model is warm started from BART-base, converted to handle long sequences (encoder only) and fine tuned.
 ## Intended uses & limitations

 It achieves the following results on the test set:
 | Length | Global tokens | Sparse Type | Block Size | Sparsity | Connexions | R1    | R2    | RL    | RLsum |
+|:------ |:------------- |:----------- |:---------- |:-------- | :--------- |:----- |:----- |:----- |:----- |
 | 16384  | 64            | -           | 256        | 0        | 768        | 48.55 | 20.76 | 28.39 | 44.03 |
 ## Model description
 The model has about ~145 millions parameters (6 encoder layers - 6 decoder layers). \
+The model is warm started from [ccdv/lsg-bart-base-4096-arxiv](https://huggingface.co/ccdv/lsg-bart-base-4096-arxiv), converted to handle long sequences (encoder only) and fine tuned.
 ## Intended uses & limitations