research-backup
/

roberta-large-semeval2012-mask-prompt-a-nce-classification

@@ -14,7 +14,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Analogy Questions (SAT full)
       type: multiple-choice-qa
@@ -25,7 +25,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Analogy Questions (SAT)
       type: multiple-choice-qa
@@ -36,7 +36,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Analogy Questions (BATS)
       type: multiple-choice-qa
@@ -47,7 +47,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Analogy Questions (Google)
       type: multiple-choice-qa
@@ -58,7 +58,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Analogy Questions (U2)
       type: multiple-choice-qa
@@ -69,7 +69,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Analogy Questions (U4)
       type: multiple-choice-qa
@@ -80,7 +80,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Lexical Relation Classification (BLESS)
       type: classification
@@ -91,10 +91,10 @@ model-index:
     metrics:
     - name: F1
       type: f1
-      value: None
     - name: F1 (macro)
       type: f1_macro
-      value: None
   - task:
       name: Lexical Relation Classification (CogALexV)
       type: classification
@@ -105,10 +105,10 @@ model-index:
     metrics:
     - name: F1
       type: f1
-      value: None
     - name: F1 (macro)
       type: f1_macro
-      value: None
   - task:
       name: Lexical Relation Classification (EVALution)
       type: classification
@@ -119,10 +119,10 @@ model-index:
     metrics:
     - name: F1
       type: f1
-      value: None
     - name: F1 (macro)
       type: f1_macro
-      value: None
   - task:
       name: Lexical Relation Classification (K&H+N)
       type: classification
@@ -133,10 +133,10 @@ model-index:
     metrics:
     - name: F1
       type: f1
-      value: None
     - name: F1 (macro)
       type: f1_macro
-      value: None
   - task:
       name: Lexical Relation Classification (ROOT09)
       type: classification
@@ -147,10 +147,10 @@ model-index:
     metrics:
     - name: F1
       type: f1
-      value: None
     - name: F1 (macro)
       type: f1_macro
-      value: None
 ---
 # relbert/roberta-large-semeval2012-mask-prompt-a-nce-classification
@@ -160,20 +160,20 @@ RelBERT fine-tuned from [roberta-large](https://huggingface.co/roberta-large) on
 Fine-tuning is done via [RelBERT](https://github.com/asahi417/relbert) library (see the repository for more detail).
 It achieves the following results on the relation understanding tasks:
 - Analogy Question ([dataset](https://huggingface.co/datasets/relbert/analogy_questions), [full result](https://huggingface.co/relbert/roberta-large-semeval2012-mask-prompt-a-nce-classification/raw/main/analogy.json)):
-    - Accuracy on SAT (full): None
-    - Accuracy on SAT: None
-    - Accuracy on BATS: None
-    - Accuracy on U2: None
-    - Accuracy on U4: None
-    - Accuracy on Google: None
 - Lexical Relation Classification ([dataset](https://huggingface.co/datasets/relbert/lexical_relation_classification), [full result](https://huggingface.co/relbert/roberta-large-semeval2012-mask-prompt-a-nce-classification/raw/main/classification.json)):
-    - Micro F1 score on BLESS: None
-    - Micro F1 score on CogALexV: None
-    - Micro F1 score on EVALution: None
-    - Micro F1 score on K&H+N: None
-    - Micro F1 score on ROOT09: None
 - Relation Mapping ([dataset](https://huggingface.co/datasets/relbert/relation_mapping), [full result](https://huggingface.co/relbert/roberta-large-semeval2012-mask-prompt-a-nce-classification/raw/main/relation_mapping.json)):
-    - Accuracy on Relation Mapping: None
 ### Usage

     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.755734126984127
   - task:
       name: Analogy Questions (SAT full)
       type: multiple-choice-qa
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.6229946524064172
   - task:
       name: Analogy Questions (SAT)
       type: multiple-choice-qa
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.6320474777448071
   - task:
       name: Analogy Questions (BATS)
       type: multiple-choice-qa
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.725958866036687
   - task:
       name: Analogy Questions (Google)
       type: multiple-choice-qa
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.892
   - task:
       name: Analogy Questions (U2)
       type: multiple-choice-qa
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.5789473684210527
   - task:
       name: Analogy Questions (U4)
       type: multiple-choice-qa
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.5972222222222222
   - task:
       name: Lexical Relation Classification (BLESS)
       type: classification
     metrics:
     - name: F1
       type: f1
+      value: 0.9287328612324846
     - name: F1 (macro)
       type: f1_macro
+      value: 0.9262077386649067
   - task:
       name: Lexical Relation Classification (CogALexV)
       type: classification
     metrics:
     - name: F1
       type: f1
+      value: 0.8723004694835681
     - name: F1 (macro)
       type: f1_macro
+      value: 0.7217088913797018
   - task:
       name: Lexical Relation Classification (EVALution)
       type: classification
     metrics:
     - name: F1
       type: f1
+      value: 0.6966413867822319
     - name: F1 (macro)
       type: f1_macro
+      value: 0.6911312709181459
   - task:
       name: Lexical Relation Classification (K&H+N)
       type: classification
     metrics:
     - name: F1
       type: f1
+      value: 0.9625095638867636
     - name: F1 (macro)
       type: f1_macro
+      value: 0.8787519473070131
   - task:
       name: Lexical Relation Classification (ROOT09)
       type: classification
     metrics:
     - name: F1
       type: f1
+      value: 0.9113130680037606
     - name: F1 (macro)
       type: f1_macro
+      value: 0.909356876425773
 ---
 # relbert/roberta-large-semeval2012-mask-prompt-a-nce-classification
 Fine-tuning is done via [RelBERT](https://github.com/asahi417/relbert) library (see the repository for more detail).
 It achieves the following results on the relation understanding tasks:
 - Analogy Question ([dataset](https://huggingface.co/datasets/relbert/analogy_questions), [full result](https://huggingface.co/relbert/roberta-large-semeval2012-mask-prompt-a-nce-classification/raw/main/analogy.json)):
+    - Accuracy on SAT (full): 0.6229946524064172
+    - Accuracy on SAT: 0.6320474777448071
+    - Accuracy on BATS: 0.725958866036687
+    - Accuracy on U2: 0.5789473684210527
+    - Accuracy on U4: 0.5972222222222222
+    - Accuracy on Google: 0.892
 - Lexical Relation Classification ([dataset](https://huggingface.co/datasets/relbert/lexical_relation_classification), [full result](https://huggingface.co/relbert/roberta-large-semeval2012-mask-prompt-a-nce-classification/raw/main/classification.json)):
+    - Micro F1 score on BLESS: 0.9287328612324846
+    - Micro F1 score on CogALexV: 0.8723004694835681
+    - Micro F1 score on EVALution: 0.6966413867822319
+    - Micro F1 score on K&H+N: 0.9625095638867636
+    - Micro F1 score on ROOT09: 0.9113130680037606
 - Relation Mapping ([dataset](https://huggingface.co/datasets/relbert/relation_mapping), [full result](https://huggingface.co/relbert/roberta-large-semeval2012-mask-prompt-a-nce-classification/raw/main/relation_mapping.json)):
+    - Accuracy on Relation Mapping: 0.755734126984127
 ### Usage