readme
Browse files
README.md
CHANGED
@@ -21,14 +21,14 @@ This model is a fine-tuned version of [ccdv/lsg-bart-base-4096-arxiv](https://hu
|
|
21 |
It achieves the following results on the test set:
|
22 |
|
23 |
| Length | Global tokens | Sparse Type | Block Size | Sparsity | Connexions | R1 | R2 | RL | RLsum |
|
24 |
-
|:------
|
25 |
| 16384 | 64 | - | 256 | 0 | 768 | 48.55 | 20.76 | 28.39 | 44.03 |
|
26 |
|
27 |
|
28 |
## Model description
|
29 |
|
30 |
The model has about ~145 millions parameters (6 encoder layers - 6 decoder layers). \
|
31 |
-
The model is warm started from
|
32 |
|
33 |
## Intended uses & limitations
|
34 |
|
|
|
21 |
It achieves the following results on the test set:
|
22 |
|
23 |
| Length | Global tokens | Sparse Type | Block Size | Sparsity | Connexions | R1 | R2 | RL | RLsum |
|
24 |
+
|:------ |:------------- |:----------- |:---------- |:-------- | :--------- |:----- |:----- |:----- |:----- |
|
25 |
| 16384 | 64 | - | 256 | 0 | 768 | 48.55 | 20.76 | 28.39 | 44.03 |
|
26 |
|
27 |
|
28 |
## Model description
|
29 |
|
30 |
The model has about ~145 millions parameters (6 encoder layers - 6 decoder layers). \
|
31 |
+
The model is warm started from [ccdv/lsg-bart-base-4096-arxiv](https://huggingface.co/ccdv/lsg-bart-base-4096-arxiv), converted to handle long sequences (encoder only) and fine tuned.
|
32 |
|
33 |
## Intended uses & limitations
|
34 |
|