ccdv
/

lsg-bart-base-4096-wcep

text2text-generation

Model card Files Files and versions Community

ccdv commited on May 25, 2022

Commit

81f9e70

•

1 Parent(s): 59f8fad

readme

Files changed (2) hide show

README.md +1 -1
attn.png +0 -0

README.md CHANGED Viewed

@@ -60,7 +60,7 @@ The model relies on Local-Sparse-Global attention to handle long sequences:
 ![attn](attn.png)
 The model has about ~145 millions parameters (6 encoder layers - 6 decoder layers). \
-The model is warm started from BART-base, converted to handle long sequences (encoder only) and fine tuned. \
 ## Intended uses & limitations

 ![attn](attn.png)
 The model has about ~145 millions parameters (6 encoder layers - 6 decoder layers). \
+The model is warm started from BART-base, converted to handle long sequences (encoder only) and fine tuned.
 ## Intended uses & limitations

attn.png ADDED Viewed