model update
Browse files
README.md
CHANGED
@@ -191,29 +191,28 @@ vector = model.get_embedding(['Tokyo', 'Japan']) # shape of (1024, )
|
|
191 |
### Training hyperparameters
|
192 |
|
193 |
The following hyperparameters were used during training:
|
194 |
-
- model:
|
195 |
-
- max_length:
|
196 |
-
- mode:
|
197 |
-
- data:
|
198 |
-
- split:
|
199 |
-
- split_eval:
|
200 |
-
- template_mode:
|
201 |
-
-
|
202 |
-
-
|
203 |
-
-
|
204 |
-
-
|
205 |
-
-
|
206 |
-
-
|
207 |
-
-
|
208 |
-
-
|
209 |
-
-
|
210 |
-
-
|
211 |
-
-
|
212 |
-
-
|
213 |
-
-
|
214 |
-
-
|
215 |
-
-
|
216 |
-
- relation_level: "None"
|
217 |
|
218 |
The full configuration can be found at [fine-tuning parameter file](https://huggingface.co/relbert/relbert-roberta-base-semeval2012-v6-mask-prompt-d-nce-0/raw/main/trainer_config.json).
|
219 |
|
|
|
191 |
### Training hyperparameters
|
192 |
|
193 |
The following hyperparameters were used during training:
|
194 |
+
- model: roberta-base
|
195 |
+
- max_length: 64
|
196 |
+
- mode: mask
|
197 |
+
- data: relbert/semeval2012_relational_similarity_v6
|
198 |
+
- split: train
|
199 |
+
- split_eval: validation
|
200 |
+
- template_mode: manual
|
201 |
+
- loss_function: nce_logout
|
202 |
+
- classification_loss: False
|
203 |
+
- temperature_nce_constant: 0.05
|
204 |
+
- temperature_nce_rank: {'min': 0.01, 'max': 0.05, 'type': 'linear'}
|
205 |
+
- epoch: 10
|
206 |
+
- batch: 128
|
207 |
+
- lr: 5e-06
|
208 |
+
- lr_decay: False
|
209 |
+
- lr_warmup: 1
|
210 |
+
- weight_decay: 0
|
211 |
+
- random_seed: 0
|
212 |
+
- exclude_relation: None
|
213 |
+
- n_sample: 320
|
214 |
+
- gradient_accumulation: 8
|
215 |
+
- relation_level: None
|
|
|
216 |
|
217 |
The full configuration can be found at [fine-tuning parameter file](https://huggingface.co/relbert/relbert-roberta-base-semeval2012-v6-mask-prompt-d-nce-0/raw/main/trainer_config.json).
|
218 |
|