language: en | |
license: apache-2.0 | |
tags: | |
- summarization | |
datasets: arxiv-summarization | |
model-index: | |
- name: ArtifactAI/led_large_16384_arxiv_summarization | |
results: | |
- task: | |
type: summarization | |
name: Summarization | |
dataset: | |
name: ccdv/arxiv-summarization | |
type: ccdv/arxiv-summarization | |
config: section | |
split: test | |
metrics: | |
- type: rouge | |
value: 37.9472 | |
name: ROUGE-1 | |
verified: true | |
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZDFkMzY4YTk0NGUyNDJjYzc2MWFiMGJlNWUyYTM2YjlmNjlkY2VkYmVhMDk2YjIxMjE3MjE4M2ZkOTAwODE2ZSIsInZlcnNpb24iOjF9.t2x5mqi0xM9Q0K9MscHZ6v_5pc-MOw8KieFTvFMqh5K4UAvvvcVGOGfGQi_Qb57gQa2DkrW0cNrJADY0VA1tAQ | |
- type: rouge | |
value: 11.3138 | |
name: ROUGE-2 | |
verified: true | |
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNjdlYmQ4ZmRkNzc3YzE0NGQ2MTRhNDE4YTExNDYwYmNjODFhYjdmYTJlZWE4OTRhYWRiZmNmODZkMDZjMWY3NSIsInZlcnNpb24iOjF9.RPWY5CZMjaFaQ1vRQPoHyZxPD67dQdbXYL0UlJ53b_q1dMczXb7HtE_UmDNPi6F7thciVt6xWIzsckVmp9ZJCw | |
- type: rouge | |
value: 20.5557 | |
name: ROUGE-L | |
verified: true | |
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYWEwNTQ5MWViZTYwM2EyNzI0OWEyZDNlY2ExOTJiMjI3MmNjM2I4YmJjMzljYTQ3NjhkNjAzYzM5MDQzYjVkOCIsInZlcnNpb24iOjF9.ZgSkTbiUDaQRJGBIXjlTZKbtKmrIljEJ6btwhyfBsaz5oS0qmI76-b_vDRswnx96OcGTqdxICIjma6jgNbKiBA | |
- type: rouge | |
value: 33.8336 | |
name: ROUGE-LSUM | |
verified: true | |
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiY2EzNzNhMWVmYjM5ZWUwOTZkYjU0MGZjMWQ0YTQ1NzA1NWQ4MjBjNjNhM2FmMmE3MmM3NzQwMzVkN2QzMzQxZiIsInZlcnNpb24iOjF9.bhxtgWXjCEv5ZFY3F7Mp-r4EHrIU8BNZ8X2zhpjSoyVLmjbfdFB-lnJdoH3PfVZEa14T96SJqMSHa6yzlqGEAQ | |
- type: loss | |
value: 2.8064792156219482 | |
name: loss | |
verified: true | |
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYzBhMTE0ZTdhOTRmYWE1Mjk5ZmViYjZiMjBmNzc2YzQ4YmNhYWM3NzRjYWUwYTEyZjU1NGI5MjVhODQwOTBlNCIsInZlcnNpb24iOjF9.l0nIJCcjoFyPF9M7MHiQxBQ3wtyk6jXURY0ZF6Xny3_DpkDh5YHs9kF494GJp5eYj6XG5HRGCgqhfmU7-fywAw | |
- type: gen_len | |
value: 157.4174 | |
name: gen_len | |
verified: true | |
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNDY0ZmE4M2VmOTU1NWY5M2I4YTYxNjM3NTkxNWU4NDY3N2Y0MTM1YWNlNmNjMGQ4N2UzM2ZkZWJhZTVmMjQ2OCIsInZlcnNpb24iOjF9.sAp6g7nt1tKTdGfOlGm3fdxzH1jxjNOZO65BNnVJkxDhu86j8QP3ZvNPv7PpD2sK4p6yM_HlHPPeX4bgmDi2BQ | |
## Introduction | |
A led-large-16384 model to summarize ArXiv papers. Inputs are the abstracts of papers and full documents, and outputs are the summaries of the papers. | |
[Allenai's Longformer Encoder-Decoder (LED)](https://github.com/allenai/longformer#longformer). | |
As described in [Longformer: The Long-Document Transformer](https://arxiv.org/pdf/2004.05150.pdf) by Iz Beltagy, Matthew E. Peters, Arman Cohan, | |
*led-base-16384* was initialized from [*bart-base*](https://huggingface.co/facebook/bart-base) since both models share the exact same architecture. To | |
be able to process 16K tokens, *bart-base*'s position embedding matrix was simply copied 16 times. | |