autoevaluator HF staff commited on
Commit
f1f6227
1 Parent(s): 0e0e10f

Add evaluation results on the de-en config and test split of wmt16

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the de-en config and test split of the [wmt16](https://huggingface.co/datasets/wmt16) dataset by

@DarkSourceOfCode
, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-wmt16-de-en-bfa340-42157145094).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=wmt16).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=wmt16).

Files changed (1) hide show
  1. README.md +30 -6
README.md CHANGED
@@ -24,18 +24,42 @@ model-index:
24
  config: ro-en
25
  split: test
26
  metrics:
27
- - name: BLEU
28
- type: bleu
29
  value: 6.0012
 
30
  verified: true
31
- - name: loss
32
- type: loss
33
  value: 1.7407585382461548
 
34
  verified: true
35
- - name: gen_len
36
- type: gen_len
37
  value: 18.2281
 
38
  verified: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
39
  ---
40
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
41
  should probably proofread and complete it, then remove this comment. -->
 
24
  config: ro-en
25
  split: test
26
  metrics:
27
+ - type: bleu
 
28
  value: 6.0012
29
+ name: BLEU
30
  verified: true
31
+ - type: loss
 
32
  value: 1.7407585382461548
33
+ name: loss
34
  verified: true
35
+ - type: gen_len
 
36
  value: 18.2281
37
+ name: gen_len
38
  verified: true
39
+ - task:
40
+ type: translation
41
+ name: Translation
42
+ dataset:
43
+ name: wmt16
44
+ type: wmt16
45
+ config: de-en
46
+ split: test
47
+ metrics:
48
+ - type: bleu
49
+ value: 0.7585
50
+ name: BLEU
51
+ verified: true
52
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNjcyOTc4YzFlZWE1NmJjNjIxZDljOWQxNzc4MDhjMGM5OWVlOTIxZjJiNmM5NmFlMmY3YjBkYmIzZjhkY2Y3NSIsInZlcnNpb24iOjF9.Uf2P537ulJ0oI4m2xkSoR_L-5J2qUuu09Q0jr5LFYXe28-PRLZAUizXlS3AZ4jclGjbvUmTmIDsroOGgiqpSCQ
53
+ - type: loss
54
+ value: 9.197447776794434
55
+ name: loss
56
+ verified: true
57
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNmNhZDZlNTk4NTg4OWFiNjkwOWQxOWViNDExNTVjYzhkMGUwYmU1M2MwMjlmZDczYThkMTUwNzFkMjk5NTRhOCIsInZlcnNpb24iOjF9.xvfeeznVmNb22l3YO-fOJYnWeia-y-Oelwn24qyvMShP4VgDOOpufvJiIG6sYWGeoACz7L7AfKlRU6GUpvNnAg
58
+ - type: gen_len
59
+ value: 18.2718
60
+ name: gen_len
61
+ verified: true
62
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNzJlYzRkNTBhZGNhZWE4ZGQ1ZGNhZWVmZWRhZjZkNTc3NDI0NDhhMTQwZjViNGQ2Zjk5MzhlNzM4YzY3MDM4YiIsInZlcnNpb24iOjF9.qjeO8wg_wURGTH0USvF3xwCbSuNgVfGyngQnfbqqmPpOPUwyz08_-ktjmOoE_IrLSIFIZ8XLMqr1rOOMkX9_Cg
63
  ---
64
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
65
  should probably proofread and complete it, then remove this comment. -->