Add evaluation results on the alex-apostolo--filtered-cuad config and test split of alex-apostolo/filtered-cuad

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the alex-apostolo--filtered-cuad config and test split of the [alex-apostolo/filtered-cuad](https://huggingface.co/datasets/alex-apostolo/filtered-cuad) dataset by

@pankajm

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-alex-apostolo__filtered-cuad-alex-apostolo__filtered-cu-fd7768-3096988009).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=alex-apostolo/filtered-cuad).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=alex-apostolo/filtered-cuad).

Files changed (1) hide show

README.md +25 -1

README.md CHANGED Viewed

@@ -6,7 +6,31 @@ datasets:
 - alex-apostolo/filtered-cuad
 model-index:
 - name: roberta-base-filtered-cuad
-  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You

 - alex-apostolo/filtered-cuad
 model-index:
 - name: roberta-base-filtered-cuad
+  results:
+  - task:
+      type: question-answering
+      name: Question Answering
+    dataset:
+      name: alex-apostolo/filtered-cuad
+      type: alex-apostolo/filtered-cuad
+      config: alex-apostolo--filtered-cuad
+      split: test
+    metrics:
+    - type: f1
+      value: 71.4517
+      name: F1
+      verified: true
+      verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZWEzOTZkOTUwNzcxOTA2MTkxZmE2YzJlMDgyNzljOTgyMzAwZWM4MzVkYjkxZjYxYTRkNDQyMmYzNjk5MzU2ZSIsInZlcnNpb24iOjF9.qSR7O9rUJ0gpaPrVhX9UmlrQGYVLyCDjEUvOVCu59OeaGuqsvY7-bppTl702UAiXT6eP9RSN5SGtxMVlmgU_CA
+    - type: exact
+      value: 69.1239
+      name: Exact Match
+      verified: true
+      verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYmJhYzcxOWJjNWI4NjNiYjAyY2NhNjIzNTNhMmIwOTA2MGNmNTVhZjZmNDU2MDYwNDZmYjM2MTY5YWI4NDQ4ZCIsInZlcnNpb24iOjF9.vPsieQjxMN7QQk9mLtFCGOCFMqziBRWlf0_KhZp3wFTOSpA_U88ifDQRV4uedLs9-IzEAz3I_NOMijMrpW_AAw
+    - type: loss
+      value: 0.054761599749326706
+      name: loss
+      verified: true
+      verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYjBhOTFkYTBjOGU1NjlmOGExNWViYjYzMTAyZTQ0MGRkMGQ2MTkwODkwY2I1NTkzMjI5OGI4NWFlYzdmYjJjNiIsInZlcnNpb24iOjF9.nEMGXH-CvWl1RUNIny3_IyjHyPDI9hHQNd2hRMAwjb8iV_73ie48I6h0iVnRRWwnKcYzvamt-LzKheVVIN3ICQ
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You