autoevaluator HF staff commited on
Commit
19f956c
1 Parent(s): 057ef0d

Add evaluation results on the alex-apostolo--filtered-cuad config and test split of alex-apostolo/filtered-cuad

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the alex-apostolo--filtered-cuad config and test split of the [alex-apostolo/filtered-cuad](https://huggingface.co/datasets/alex-apostolo/filtered-cuad) dataset by

@pankajm

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-alex-apostolo__filtered-cuad-alex-apostolo__filtered-cu-fd7768-3096988009).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=alex-apostolo/filtered-cuad).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=alex-apostolo/filtered-cuad).

Files changed (1) hide show
  1. README.md +25 -1
README.md CHANGED
@@ -6,7 +6,31 @@ datasets:
6
  - alex-apostolo/filtered-cuad
7
  model-index:
8
  - name: roberta-base-filtered-cuad
9
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
6
  - alex-apostolo/filtered-cuad
7
  model-index:
8
  - name: roberta-base-filtered-cuad
9
+ results:
10
+ - task:
11
+ type: question-answering
12
+ name: Question Answering
13
+ dataset:
14
+ name: alex-apostolo/filtered-cuad
15
+ type: alex-apostolo/filtered-cuad
16
+ config: alex-apostolo--filtered-cuad
17
+ split: test
18
+ metrics:
19
+ - type: f1
20
+ value: 71.4517
21
+ name: F1
22
+ verified: true
23
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZWEzOTZkOTUwNzcxOTA2MTkxZmE2YzJlMDgyNzljOTgyMzAwZWM4MzVkYjkxZjYxYTRkNDQyMmYzNjk5MzU2ZSIsInZlcnNpb24iOjF9.qSR7O9rUJ0gpaPrVhX9UmlrQGYVLyCDjEUvOVCu59OeaGuqsvY7-bppTl702UAiXT6eP9RSN5SGtxMVlmgU_CA
24
+ - type: exact
25
+ value: 69.1239
26
+ name: Exact Match
27
+ verified: true
28
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYmJhYzcxOWJjNWI4NjNiYjAyY2NhNjIzNTNhMmIwOTA2MGNmNTVhZjZmNDU2MDYwNDZmYjM2MTY5YWI4NDQ4ZCIsInZlcnNpb24iOjF9.vPsieQjxMN7QQk9mLtFCGOCFMqziBRWlf0_KhZp3wFTOSpA_U88ifDQRV4uedLs9-IzEAz3I_NOMijMrpW_AAw
29
+ - type: loss
30
+ value: 0.054761599749326706
31
+ name: loss
32
+ verified: true
33
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYjBhOTFkYTBjOGU1NjlmOGExNWViYjYzMTAyZTQ0MGRkMGQ2MTkwODkwY2I1NTkzMjI5OGI4NWFlYzdmYjJjNiIsInZlcnNpb24iOjF9.nEMGXH-CvWl1RUNIny3_IyjHyPDI9hHQNd2hRMAwjb8iV_73ie48I6h0iVnRRWwnKcYzvamt-LzKheVVIN3ICQ
34
  ---
35
 
36
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You