readme
Browse files
README.md
CHANGED
@@ -15,6 +15,9 @@ model-index:
|
|
15 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
16 |
should probably proofread and complete it, then remove this comment. -->
|
17 |
|
|
|
|
|
|
|
18 |
# ccdv/lsg-bart-base-16384-arxiv
|
19 |
|
20 |
This model is a fine-tuned version of [ccdv/lsg-bart-base-4096-arxiv](https://huggingface.co/ccdv/lsg-bart-base-4096-arxiv) on the scientific_papers arxiv dataset. \
|
@@ -31,8 +34,6 @@ The model relies on Local-Sparse-Global attention to handle long sequences:
|
|
31 |
|
32 |
The model has about ~145 millions parameters (6 encoder layers - 6 decoder layers). \
|
33 |
The model is warm started from [ccdv/lsg-bart-base-4096-arxiv](https://huggingface.co/ccdv/lsg-bart-base-4096-arxiv), converted to handle long sequences (encoder only) and fine tuned. \
|
34 |
-
**This model relies on a custom modeling file, you need to add trust_remote_code=True**\
|
35 |
-
**See [\#13467](https://github.com/huggingface/transformers/pull/13467)**
|
36 |
|
37 |
## Intended uses & limitations
|
38 |
|
|
|
15 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
16 |
should probably proofread and complete it, then remove this comment. -->
|
17 |
|
18 |
+
**This model relies on a custom modeling file, you need to add trust_remote_code=True**\
|
19 |
+
**See [\#13467](https://github.com/huggingface/transformers/pull/13467)**
|
20 |
+
|
21 |
# ccdv/lsg-bart-base-16384-arxiv
|
22 |
|
23 |
This model is a fine-tuned version of [ccdv/lsg-bart-base-4096-arxiv](https://huggingface.co/ccdv/lsg-bart-base-4096-arxiv) on the scientific_papers arxiv dataset. \
|
|
|
34 |
|
35 |
The model has about ~145 millions parameters (6 encoder layers - 6 decoder layers). \
|
36 |
The model is warm started from [ccdv/lsg-bart-base-4096-arxiv](https://huggingface.co/ccdv/lsg-bart-base-4096-arxiv), converted to handle long sequences (encoder only) and fine tuned. \
|
|
|
|
|
37 |
|
38 |
## Intended uses & limitations
|
39 |
|