autoevaluator HF staff commited on
Commit
ba45c6e
1 Parent(s): 136181d

Add evaluation results on the default config and test split of xsum

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the default config and test split of the [xsum](https://huggingface.co/datasets/xsum) dataset by

@zuzannad1

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-xsum-default-8e4fa8-60494145409).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=xsum).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=xsum).

Files changed (1) hide show
  1. README.md +10 -10
README.md CHANGED
@@ -17,30 +17,30 @@ model-index:
17
  split: test
18
  metrics:
19
  - type: rouge
20
- value: 38.643
21
  name: ROUGE-1
22
  verified: true
23
- verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiY2E1MTczZDhmODNjNjY2NzU5ZTJlYjM0ZWQ4YmUzN2NhYTExOGYxZTU5YmU5YThjM2FiZmVhMzU5OGE2NGZhNSIsInZlcnNpb24iOjF9.jdp1DkzoLLLpNknLrIC8oxcOKt0si9iK7r3qMuh2UVzSeHr8aG3kMNjpybMw3C9hhb2ebXzUpWok2ILvRSZTBw
24
  - type: rouge
25
- value: 17.7546
26
  name: ROUGE-2
27
  verified: true
28
- verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMDViMTgzYTQxNzFkZTg0NWY2MjgyNGQ0MmVhMDBkMjMyMTllZWM0YzE5ZjQyMmEwY2QxYTViMWMzMDQwNzdiNiIsInZlcnNpb24iOjF9.ja641EZwDrll0akyPOATo9Rqj1uaCpAftziHd0mi5ZuLqCUZsh8H0OLfjvZLNK1JwtkMi3n_P_8UYvmG1tuiAQ
29
  - type: rouge
30
- value: 32.2114
31
  name: ROUGE-L
32
  verified: true
33
- verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMWZhN2YxZTI5NGQ1MzRiYTEzZmI4YTRmYzY5NTMxYWVjZTRhODFjZTJlN2VkZWRkNzllOTE5ZTA5OTY2OGNiYiIsInZlcnNpb24iOjF9.hs2yl3ArmJuDo_N87MUWqcJ034sCjD8borR4kE_D91z0aL3NilFdpDk2iuyynE9pCn4JttetiGRLngpMvKekDw
34
  - type: rouge
35
- value: 32.2207
36
  name: ROUGE-LSUM
37
  verified: true
38
- verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNWJkMTdmYmJlMGY3ZDg0ZGU4NTk3ZDRiMTg2ODUxZjU3ODJiMGNmZGZiOGFmNDhhZmRkMTE5MTM1YTMwNDI3NiIsInZlcnNpb24iOjF9.nTXRauPJTCmm1Ed4mp4LyIaWKd0OXhK94OAZEnIpN549pMZ19ufrNTuBeXQj6vLQAsaugbrPotBXBPe-Pbp3Dg
39
  - type: loss
40
- value: 1.8224396705627441
41
  name: loss
42
  verified: true
43
- verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZDlhYTY5MzFkOGRlNGFjOGZiYzUyYmQ3ZWViNmY2ODZiYzhjNzYyYWZlN2ViZTJiZWEyMDFlYjE0YmIyODY2YyIsInZlcnNpb24iOjF9.96OPA94rxBtQpSiEEk7hBffOa30pe1TslYE9cpZiiwQb7GOCNGeUqjxWmzE0-R1_QluMN527k0dFL1G2KWQwAA
44
  - type: gen_len
45
  value: 19.7028
46
  name: gen_len
 
17
  split: test
18
  metrics:
19
  - type: rouge
20
+ value: 38.6513
21
  name: ROUGE-1
22
  verified: true
23
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNmIyOTlhYjQ1NTgyMDZiMThjN2NhOTU2N2JiOWRlNmYzMzIyYzQ5NmY1NjdmZWM1ZTBmNDkwYWNlNGJmMjI5ZiIsInZlcnNpb24iOjF9.JW9NwjJOl1i41VQNH12NhOrOgRTTKwm4dirFYxUKDw-QQ9qbNkGe1iut9MAsiS3RVfxzrEM9r_e-craaN06KBA
24
  - type: rouge
25
+ value: 17.7585
26
  name: ROUGE-2
27
  verified: true
28
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMzcwMTZhODVjMDI4NjBiMDI1ZDNiZjlhZjc2Nzk3ZmQxNmZhYzlkNDUwMWE2NDNlOGE0Y2I4YzdlZWFlYmY1YSIsInZlcnNpb24iOjF9.GAt22--gTCqKOOeUMGtbtkcsjlS6rONCxiU9YXbPz78lwpdmXsLf1d-bo4JNmMLSd8v8n9UMzGPtUYpSKTiDCQ
29
  - type: rouge
30
+ value: 32.2033
31
  name: ROUGE-L
32
  verified: true
33
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZGJlOWVhY2M4YjNhZmQzMjRhZDk0NGMyN2U3MDI2ZDYxY2I4ZDA5M2RjZjg5MmFlZTgwYTdjNDM0ZjUxN2NmZiIsInZlcnNpb24iOjF9.FZSlI-y-wBrX6SrPjpLAC28oJSCWrRLnFtWnsFQfDZJVOuFFkI4_-R-XSFBrWuC3EzgM2WnuSCmRfzZn3iHWBw
34
  - type: rouge
35
+ value: 32.2064
36
  name: ROUGE-LSUM
37
  verified: true
38
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNGFlMWIzZmI4OTRjMDZlOTk2MGNlNTQzZDA1MWZkNjhiYWM2ZTFjZGQ0MDYwNGQ0ZjA3Yzk5OTU5ODEwNmFiNCIsInZlcnNpb24iOjF9.K6Ww1AvjfhEAh2msrBhO9SK2TL9szwTJ04S1F_ejLolrHM_YgoeTx38dlAnibKSHKyYEM2DlJt0qmS7nBKKRCA
39
  - type: loss
40
+ value: 1.822434902191162
41
  name: loss
42
  verified: true
43
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNDAyMjBlZTI1MTI4YjRjZmY0ZjliNmNkYTJmY2ZkMjViZGM4MGE1Y2E2MjM0NTNlNDM3MTk1ZGQ0ZWNiZGI3MSIsInZlcnNpb24iOjF9.tA38SX6sMrSATaKdnZbSLxYKDqKIiKseq7yT37gg-6WaU62qw72ij3BZmF-UJWWYCFdNSa-F5FAYkzwL5peGBw
44
  - type: gen_len
45
  value: 19.7028
46
  name: gen_len