lewtun HF staff commited on
Commit
ef5cd17
1 Parent(s): cc82caa

Add evaluation results on scientific_papers dataset

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the [scientific_papers](https://huggingface.co/datasets/scientific_papers) dataset by

@lewtun

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-project-d47ba8c2-7654949).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=scientific_papers).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=scientific_papers).

Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -45,6 +45,39 @@ model-index:
45
  type: gen_len
46
  value: 221.2531
47
  verified: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
48
  ---
49
 
50
  # BigBirdPegasus model (large)
 
45
  type: gen_len
46
  value: 221.2531
47
  verified: true
48
+ - task:
49
+ type: summarization
50
+ name: Summarization
51
+ dataset:
52
+ name: scientific_papers
53
+ type: scientific_papers
54
+ config: arxiv
55
+ split: test
56
+ metrics:
57
+ - name: ROUGE-1
58
+ type: rouge
59
+ value: 40.3815
60
+ verified: true
61
+ - name: ROUGE-2
62
+ type: rouge
63
+ value: 14.374
64
+ verified: true
65
+ - name: ROUGE-L
66
+ type: rouge
67
+ value: 23.4773
68
+ verified: true
69
+ - name: ROUGE-LSUM
70
+ type: rouge
71
+ value: 33.772
72
+ verified: true
73
+ - name: loss
74
+ type: loss
75
+ value: 3.235051393508911
76
+ verified: true
77
+ - name: gen_len
78
+ type: gen_len
79
+ value: 186.2003
80
+ verified: true
81
  ---
82
 
83
  # BigBirdPegasus model (large)