research-backup
/

roberta-large-semeval2012-average-no-mask-prompt-c-nce-classification

@@ -14,7 +14,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Analogy Questions (SAT full)
       type: multiple-choice-qa
@@ -25,7 +25,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Analogy Questions (SAT)
       type: multiple-choice-qa
@@ -36,7 +36,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Analogy Questions (BATS)
       type: multiple-choice-qa
@@ -47,7 +47,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Analogy Questions (Google)
       type: multiple-choice-qa
@@ -58,7 +58,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Analogy Questions (U2)
       type: multiple-choice-qa
@@ -69,7 +69,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Analogy Questions (U4)
       type: multiple-choice-qa
@@ -80,7 +80,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: None
   - task:
       name: Lexical Relation Classification (BLESS)
       type: classification
@@ -91,10 +91,10 @@ model-index:
     metrics:
     - name: F1
       type: f1
-      value: None
     - name: F1 (macro)
       type: f1_macro
-      value: None
   - task:
       name: Lexical Relation Classification (CogALexV)
       type: classification
@@ -105,10 +105,10 @@ model-index:
     metrics:
     - name: F1
       type: f1
-      value: None
     - name: F1 (macro)
       type: f1_macro
-      value: None
   - task:
       name: Lexical Relation Classification (EVALution)
       type: classification
@@ -119,10 +119,10 @@ model-index:
     metrics:
     - name: F1
       type: f1
-      value: None
     - name: F1 (macro)
       type: f1_macro
-      value: None
   - task:
       name: Lexical Relation Classification (K&H+N)
       type: classification
@@ -133,10 +133,10 @@ model-index:
     metrics:
     - name: F1
       type: f1
-      value: None
     - name: F1 (macro)
       type: f1_macro
-      value: None
   - task:
       name: Lexical Relation Classification (ROOT09)
       type: classification
@@ -147,10 +147,10 @@ model-index:
     metrics:
     - name: F1
       type: f1
-      value: None
     - name: F1 (macro)
       type: f1_macro
-      value: None
 ---
 # relbert/roberta-large-semeval2012-average-no-mask-prompt-c-nce-classification
@@ -160,20 +160,20 @@ RelBERT fine-tuned from [roberta-large](https://huggingface.co/roberta-large) on
 Fine-tuning is done via [RelBERT](https://github.com/asahi417/relbert) library (see the repository for more detail).
 It achieves the following results on the relation understanding tasks:
 - Analogy Question ([dataset](https://huggingface.co/datasets/relbert/analogy_questions), [full result](https://huggingface.co/relbert/roberta-large-semeval2012-average-no-mask-prompt-c-nce-classification/raw/main/analogy.json)):
-    - Accuracy on SAT (full): None
-    - Accuracy on SAT: None
-    - Accuracy on BATS: None
-    - Accuracy on U2: None
-    - Accuracy on U4: None
-    - Accuracy on Google: None
 - Lexical Relation Classification ([dataset](https://huggingface.co/datasets/relbert/lexical_relation_classification), [full result](https://huggingface.co/relbert/roberta-large-semeval2012-average-no-mask-prompt-c-nce-classification/raw/main/classification.json)):
-    - Micro F1 score on BLESS: None
-    - Micro F1 score on CogALexV: None
-    - Micro F1 score on EVALution: None
-    - Micro F1 score on K&H+N: None
-    - Micro F1 score on ROOT09: None
 - Relation Mapping ([dataset](https://huggingface.co/datasets/relbert/relation_mapping), [full result](https://huggingface.co/relbert/roberta-large-semeval2012-average-no-mask-prompt-c-nce-classification/raw/main/relation_mapping.json)):
-    - Accuracy on Relation Mapping: None
 ### Usage

     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.7127976190476191
   - task:
       name: Analogy Questions (SAT full)
       type: multiple-choice-qa
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.29411764705882354
   - task:
       name: Analogy Questions (SAT)
       type: multiple-choice-qa
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.29080118694362017
   - task:
       name: Analogy Questions (BATS)
       type: multiple-choice-qa
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.4641467481934408
   - task:
       name: Analogy Questions (Google)
       type: multiple-choice-qa
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.614
   - task:
       name: Analogy Questions (U2)
       type: multiple-choice-qa
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.32456140350877194
   - task:
       name: Analogy Questions (U4)
       type: multiple-choice-qa
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.3449074074074074
   - task:
       name: Lexical Relation Classification (BLESS)
       type: classification
     metrics:
     - name: F1
       type: f1
+      value: 0.8862437848425494
     - name: F1 (macro)
       type: f1_macro
+      value: 0.8781526549150734
   - task:
       name: Lexical Relation Classification (CogALexV)
       type: classification
     metrics:
     - name: F1
       type: f1
+      value: 0.8370892018779342
     - name: F1 (macro)
       type: f1_macro
+      value: 0.6286516686265566
   - task:
       name: Lexical Relation Classification (EVALution)
       type: classification
     metrics:
     - name: F1
       type: f1
+      value: 0.5384615384615384
     - name: F1 (macro)
       type: f1_macro
+      value: 0.5368027921312294
   - task:
       name: Lexical Relation Classification (K&H+N)
       type: classification
     metrics:
     - name: F1
       type: f1
+      value: 0.9659177853516032
     - name: F1 (macro)
       type: f1_macro
+      value: 0.8925325170399768
   - task:
       name: Lexical Relation Classification (ROOT09)
       type: classification
     metrics:
     - name: F1
       type: f1
+      value: 0.8567847069884049
     - name: F1 (macro)
       type: f1_macro
+      value: 0.8346603805121989
 ---
 # relbert/roberta-large-semeval2012-average-no-mask-prompt-c-nce-classification
 Fine-tuning is done via [RelBERT](https://github.com/asahi417/relbert) library (see the repository for more detail).
 It achieves the following results on the relation understanding tasks:
 - Analogy Question ([dataset](https://huggingface.co/datasets/relbert/analogy_questions), [full result](https://huggingface.co/relbert/roberta-large-semeval2012-average-no-mask-prompt-c-nce-classification/raw/main/analogy.json)):
+    - Accuracy on SAT (full): 0.29411764705882354
+    - Accuracy on SAT: 0.29080118694362017
+    - Accuracy on BATS: 0.4641467481934408
+    - Accuracy on U2: 0.32456140350877194
+    - Accuracy on U4: 0.3449074074074074
+    - Accuracy on Google: 0.614
 - Lexical Relation Classification ([dataset](https://huggingface.co/datasets/relbert/lexical_relation_classification), [full result](https://huggingface.co/relbert/roberta-large-semeval2012-average-no-mask-prompt-c-nce-classification/raw/main/classification.json)):
+    - Micro F1 score on BLESS: 0.8862437848425494
+    - Micro F1 score on CogALexV: 0.8370892018779342
+    - Micro F1 score on EVALution: 0.5384615384615384
+    - Micro F1 score on K&H+N: 0.9659177853516032
+    - Micro F1 score on ROOT09: 0.8567847069884049
 - Relation Mapping ([dataset](https://huggingface.co/datasets/relbert/relation_mapping), [full result](https://huggingface.co/relbert/roberta-large-semeval2012-average-no-mask-prompt-c-nce-classification/raw/main/relation_mapping.json)):
+    - Accuracy on Relation Mapping: 0.7127976190476191
 ### Usage