ccdv commited on
Commit
46085ee
1 Parent(s): b274270
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -21,14 +21,14 @@ This model is a fine-tuned version of [ccdv/lsg-bart-base-4096-arxiv](https://hu
21
  It achieves the following results on the test set:
22
 
23
  | Length | Global tokens | Sparse Type | Block Size | Sparsity | Connexions | R1 | R2 | RL | RLsum |
24
- |:------ |:--------- |:---------- |:-------- | :--------- |:----- |:----- |:----- |:----- |
25
  | 16384 | 64 | - | 256 | 0 | 768 | 48.55 | 20.76 | 28.39 | 44.03 |
26
 
27
 
28
  ## Model description
29
 
30
  The model has about ~145 millions parameters (6 encoder layers - 6 decoder layers). \
31
- The model is warm started from BART-base, converted to handle long sequences (encoder only) and fine tuned.
32
 
33
  ## Intended uses & limitations
34
 
 
21
  It achieves the following results on the test set:
22
 
23
  | Length | Global tokens | Sparse Type | Block Size | Sparsity | Connexions | R1 | R2 | RL | RLsum |
24
+ |:------ |:------------- |:----------- |:---------- |:-------- | :--------- |:----- |:----- |:----- |:----- |
25
  | 16384 | 64 | - | 256 | 0 | 768 | 48.55 | 20.76 | 28.39 | 44.03 |
26
 
27
 
28
  ## Model description
29
 
30
  The model has about ~145 millions parameters (6 encoder layers - 6 decoder layers). \
31
+ The model is warm started from [ccdv/lsg-bart-base-4096-arxiv](https://huggingface.co/ccdv/lsg-bart-base-4096-arxiv), converted to handle long sequences (encoder only) and fine tuned.
32
 
33
  ## Intended uses & limitations
34