Sebastian autoevaluator HF staff commited on
Commit
4c0f212
1 Parent(s): 551211e

Add verifyToken field to verify evaluation results are produced by Hugging Face's automatic model evaluator (#4)

Browse files

- Add verifyToken field to verify evaluation results are produced by Hugging Face's automatic model evaluator (a65041a008a2a69fd77546902b845cd3fbc55c11)


Co-authored-by: Evaluation Bot <autoevaluator@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +7 -5
README.md CHANGED
@@ -1,10 +1,10 @@
1
  ---
2
  language: multilingual
 
3
  tags:
4
  - question-answering
5
  datasets:
6
  - squad_v2
7
- license: cc-by-4.0
8
  model-index:
9
  - name: deepset/xlm-roberta-large-squad2
10
  results:
@@ -17,14 +17,16 @@ model-index:
17
  config: squad_v2
18
  split: validation
19
  metrics:
20
- - name: Exact Match
21
- type: exact_match
22
  value: 81.8281
 
23
  verified: true
24
- - name: F1
25
- type: f1
26
  value: 84.8886
 
27
  verified: true
 
28
  ---
29
 
30
  # Multilingual XLM-RoBERTa large for QA on various languages
 
1
  ---
2
  language: multilingual
3
+ license: cc-by-4.0
4
  tags:
5
  - question-answering
6
  datasets:
7
  - squad_v2
 
8
  model-index:
9
  - name: deepset/xlm-roberta-large-squad2
10
  results:
 
17
  config: squad_v2
18
  split: validation
19
  metrics:
20
+ - type: exact_match
 
21
  value: 81.8281
22
+ name: Exact Match
23
  verified: true
24
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNzVhZDE2NTg5NmUwOWRkMmI2MGUxYjFlZjIzNmMyNDQ2MDY2MDNhYzE0ZjY5YTkyY2U4ODc3ODFiZjQxZWQ2YSIsInZlcnNpb24iOjF9.f_rN3WPMAdv-OBPz0T7N7lOxYz9f1nEr_P-vwKhi3jNdRKp_JTy18MYR9eyJM2riKHC6_ge-8XwfyrUf51DSDA
25
+ - type: f1
26
  value: 84.8886
27
+ name: F1
28
  verified: true
29
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZGE5MWJmZGUxMGMwNWFhYzVhZjQwZGEwOWQ4N2Q2Yjg5NzdjNDFiNDhiYTQ1Y2E5ZWJkOTFhYmI1Y2Q2ZGYwOCIsInZlcnNpb24iOjF9.TIdH-tOx3kEMDs5wK1r6iwZqqSjNGlBrpawrsE917j1F3UFJVnQ7wJwaj0OIgmC4iw8OQeLZL56ucBcLApa-AQ
30
  ---
31
 
32
  # Multilingual XLM-RoBERTa large for QA on various languages