relbert/roberta-large-semeval2012-average-no-mask-prompt-b-nce-classification-conceptnet-validated
RelBERT fine-tuned from roberta-large on
relbert/semeval2012_relational_similarity.
Fine-tuning is done via RelBERT library (see the repository for more detail).
It achieves the following results on the relation understanding tasks:
- Analogy Question (dataset, full result):
- Accuracy on SAT (full): 0.516042780748663
- Accuracy on SAT: 0.5281899109792285
- Accuracy on BATS: 0.632017787659811
- Accuracy on U2: 0.4342105263157895
- Accuracy on U4: 0.5069444444444444
- Accuracy on Google: 0.724
- Lexical Relation Classification (dataset, full result):
- Micro F1 score on BLESS: 0.9034202199789061
- Micro F1 score on CogALexV: 0.8342723004694835
- Micro F1 score on EVALution: 0.6581798483206934
- Micro F1 score on K&H+N: 0.9604228976838005
- Micro F1 score on ROOT09: 0.8909432779692886
- Relation Mapping (dataset, full result):
- Accuracy on Relation Mapping: 0.8167460317460318
Usage
This model can be used through the relbert library. Install the library via pip
pip install relbert
and activate model as below.
from relbert import RelBERT
model = RelBERT("relbert/roberta-large-semeval2012-average-no-mask-prompt-b-nce-classification-conceptnet-validated")
vector = model.get_embedding(['Tokyo', 'Japan']) # shape of (1024, )
Training hyperparameters
The following hyperparameters were used during training:
- model: roberta-large
- max_length: 64
- mode: average_no_mask
- data: relbert/semeval2012_relational_similarity
- split: train
- data_eval: relbert/conceptnet_high_confidence
- split_eval: full
- template_mode: manual
- template: Today, I finally discovered the relation between and : is 's
- loss_function: nce_logout
- classification_loss: True
- temperature_nce_constant: 0.05
- temperature_nce_rank: {'min': 0.01, 'max': 0.05, 'type': 'linear'}
- epoch: 30
- batch: 128
- lr: 5e-06
- lr_decay: False
- lr_warmup: 1
- weight_decay: 0
- random_seed: 0
- exclude_relation: None
- exclude_relation_eval: None
- n_sample: 640
- gradient_accumulation: 8
The full configuration can be found at fine-tuning parameter file.
Reference
If you use any resource from RelBERT, please consider to cite our paper.
@inproceedings{ushio-etal-2021-distilling-relation-embeddings,
title = "{D}istilling {R}elation {E}mbeddings from {P}re-trained {L}anguage {M}odels",
author = "Ushio, Asahi and
Schockaert, Steven and
Camacho-Collados, Jose",
booktitle = "EMNLP 2021",
year = "2021",
address = "Online",
publisher = "Association for Computational Linguistics",
}
- Downloads last month
- 5
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.
Dataset used to train research-backup/roberta-large-semeval2012-average-no-mask-prompt-b-nce-classification-conceptnet-validated
Evaluation results
- Accuracy on Relation Mappingself-reported0.817
- Accuracy on SAT fullself-reported0.516
- Accuracy on SATself-reported0.528
- Accuracy on BATSself-reported0.632
- Accuracy on Googleself-reported0.724
- Accuracy on U2self-reported0.434
- Accuracy on U4self-reported0.507
- F1 on BLESSself-reported0.903
- F1 (macro) on BLESSself-reported0.893
- F1 on CogALexVself-reported0.834