lewtun HF staff commited on
Commit
8f66b46
1 Parent(s): d07f228

Add evaluation results on the plain_text config and validation split of launch/gov_report

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the plain_text config and validation split of the [launch/gov_report](https://huggingface.co/datasets/launch/gov_report) dataset by @nonchalant-nagavalli, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-launch__gov_report-plain_text-cd8e90-16116212).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=launch/gov_report).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=launch/gov_report).

Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -384,6 +384,39 @@ model-index:
384
  type: gen_len
385
  value: 71.6372
386
  verified: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
387
  ---
388
 
389
  # README - long-t5-tglobal-base-16384-booksum-V11-big_patent-V2
 
384
  type: gen_len
385
  value: 71.6372
386
  verified: true
387
+ - task:
388
+ type: summarization
389
+ name: Summarization
390
+ dataset:
391
+ name: launch/gov_report
392
+ type: launch/gov_report
393
+ config: plain_text
394
+ split: validation
395
+ metrics:
396
+ - name: ROUGE-1
397
+ type: rouge
398
+ value: 23.5925
399
+ verified: true
400
+ - name: ROUGE-2
401
+ type: rouge
402
+ value: 5.6762
403
+ verified: true
404
+ - name: ROUGE-L
405
+ type: rouge
406
+ value: 13.8108
407
+ verified: true
408
+ - name: ROUGE-LSUM
409
+ type: rouge
410
+ value: 20.2437
411
+ verified: true
412
+ - name: loss
413
+ type: loss
414
+ value: 2.6377077102661133
415
+ verified: true
416
+ - name: gen_len
417
+ type: gen_len
418
+ value: 64.1807
419
+ verified: true
420
  ---
421
 
422
  # README - long-t5-tglobal-base-16384-booksum-V11-big_patent-V2