lewtun HF staff commited on
Commit
b85c159
1 Parent(s): 4a68718

Add evaluation results on scientific_papers dataset

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the [scientific_papers](https://huggingface.co/datasets/scientific_papers) dataset by

@lewtun

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-project-d47ba8c2-7654948).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=scientific_papers).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=scientific_papers).

Files changed (1) hide show
  1. README.md +36 -1
README.md CHANGED
@@ -1,11 +1,46 @@
1
  ---
2
-
3
  language: en
4
  license: apache-2.0
5
  datasets:
6
  - scientific_papers
7
  tags:
8
  - summarization
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  ---
10
 
11
  # BigBirdPegasus model (large)
 
1
  ---
 
2
  language: en
3
  license: apache-2.0
4
  datasets:
5
  - scientific_papers
6
  tags:
7
  - summarization
8
+ model-index:
9
+ - name: google/bigbird-pegasus-large-arxiv
10
+ results:
11
+ - task:
12
+ type: summarization
13
+ name: Summarization
14
+ dataset:
15
+ name: scientific_papers
16
+ type: scientific_papers
17
+ config: arxiv
18
+ split: test
19
+ metrics:
20
+ - name: ROUGE-1
21
+ type: rouge
22
+ value: 43.4702
23
+ verified: true
24
+ - name: ROUGE-2
25
+ type: rouge
26
+ value: 17.4297
27
+ verified: true
28
+ - name: ROUGE-L
29
+ type: rouge
30
+ value: 26.2587
31
+ verified: true
32
+ - name: ROUGE-LSUM
33
+ type: rouge
34
+ value: 35.5587
35
+ verified: true
36
+ - name: loss
37
+ type: loss
38
+ value: 2.1113228797912598
39
+ verified: true
40
+ - name: gen_len
41
+ type: gen_len
42
+ value: 183.3702
43
+ verified: true
44
  ---
45
 
46
  # BigBirdPegasus model (large)