research-backup
/

relbert-roberta-base-semeval2012-v6-mask-prompt-d-nce-0

Feature Extraction

text-embeddings-inference

Inference Endpoints

Model card Files Files and versions Community

asahi417 commited on Nov 22, 2022

Commit

6cfd363

•

1 Parent(s): 3187797

model update

Files changed (1) hide show

README.md +22 -23

README.md CHANGED Viewed

@@ -191,29 +191,28 @@ vector = model.get_embedding(['Tokyo', 'Japan'])  # shape of (1024, )
 ### Training hyperparameters
 The following hyperparameters were used during training:
- - model: "roberta-base"
- - max_length: "64"
- - mode: "mask"
- - data: "relbert/semeval2012_relational_similarity_v6"
- - split: "train"
- - split_eval: "validation"
- - template_mode: "manual"
- - template: "I wasn’t aware of this relationship, but I just read in the encyclopedia that <subj> is the <mask> of <obj>"
- - loss_function: "nce_logout"
- - classification_loss: "False"
- - temperature_nce_constant: "0.05"
- - temperature_nce_rank: "{'min': 0.01, 'max': 0.05, 'type': 'linear'}"
- - epoch: "10"
- - batch: "128"
- - lr: "5e-06"
- - lr_decay: "False"
- - lr_warmup: "1"
- - weight_decay: "0"
- - random_seed: "0"
- - exclude_relation: "None"
- - n_sample: "320"
- - gradient_accumulation: "8"
- - relation_level: "None"
 The full configuration can be found at [fine-tuning parameter file](https://huggingface.co/relbert/relbert-roberta-base-semeval2012-v6-mask-prompt-d-nce-0/raw/main/trainer_config.json).

 ### Training hyperparameters
 The following hyperparameters were used during training:
+ - model: roberta-base
+ - max_length: 64
+ - mode: mask
+ - data: relbert/semeval2012_relational_similarity_v6
+ - split: train
+ - split_eval: validation
+ - template_mode: manual
+ - loss_function: nce_logout
+ - classification_loss: False
+ - temperature_nce_constant: 0.05
+ - temperature_nce_rank: {'min': 0.01, 'max': 0.05, 'type': 'linear'}
+ - epoch: 10
+ - batch: 128
+ - lr: 5e-06
+ - lr_decay: False
+ - lr_warmup: 1
+ - weight_decay: 0
+ - random_seed: 0
+ - exclude_relation: None
+ - n_sample: 320
+ - gradient_accumulation: 8
+ - relation_level: None
 The full configuration can be found at [fine-tuning parameter file](https://huggingface.co/relbert/relbert-roberta-base-semeval2012-v6-mask-prompt-d-nce-0/raw/main/trainer_config.json).