metadata
license: mit
tags:
- generated_from_trainer
model-index:
- name: predict-perception-xlmr-focus-victim
results: []
predict-perception-xlmr-focus-victim
This model is a fine-tuned version of xlm-roberta-base on an unknown dataset. It achieves the following results on the evaluation set:
- Loss: 0.2546
- Rmse: 0.6301
- Rmse Focus::a Sulla vittima: 0.6301
- Mae: 0.5441
- Mae Focus::a Sulla vittima: 0.5441
- R2: 0.7205
- R2 Focus::a Sulla vittima: 0.7205
- Cos: 0.8261
- Pair: 0.0
- Rank: 0.5
- Neighbors: 0.7802
- Rsa: nan
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 1e-05
- train_batch_size: 20
- eval_batch_size: 8
- seed: 1996
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 30
Training results
Training Loss | Epoch | Step | Validation Loss | Rmse | Rmse Focus::a Sulla vittima | Mae | Mae Focus::a Sulla vittima | R2 | R2 Focus::a Sulla vittima | Cos | Pair | Rank | Neighbors | Rsa |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1.0607 | 1.0 | 15 | 0.9261 | 1.2017 | 1.2017 | 0.9557 | 0.9557 | -0.0166 | -0.0166 | 0.4783 | 0.0 | 0.5 | 0.6332 | nan |
1.0107 | 2.0 | 30 | 0.9481 | 1.2159 | 1.2159 | 0.9861 | 0.9861 | -0.0408 | -0.0408 | 0.4783 | 0.0 | 0.5 | 0.6332 | nan |
0.9921 | 3.0 | 45 | 0.9068 | 1.1892 | 1.1892 | 0.9548 | 0.9548 | 0.0045 | 0.0045 | 0.4783 | 0.0 | 0.5 | 0.6332 | nan |
0.7769 | 4.0 | 60 | 0.5014 | 0.8842 | 0.8842 | 0.7121 | 0.7121 | 0.4496 | 0.4496 | 0.7391 | 0.0 | 0.5 | 0.6232 | nan |
0.5763 | 5.0 | 75 | 0.4019 | 0.7917 | 0.7917 | 0.6737 | 0.6737 | 0.5588 | 0.5588 | 0.8261 | 0.0 | 0.5 | 0.8155 | nan |
0.4378 | 6.0 | 90 | 0.3594 | 0.7486 | 0.7486 | 0.5957 | 0.5957 | 0.6055 | 0.6055 | 0.7391 | 0.0 | 0.5 | 0.4442 | nan |
0.3595 | 7.0 | 105 | 0.3452 | 0.7337 | 0.7337 | 0.6333 | 0.6333 | 0.6210 | 0.6210 | 0.5652 | 0.0 | 0.5 | 0.2649 | nan |
0.3192 | 8.0 | 120 | 0.3275 | 0.7147 | 0.7147 | 0.6205 | 0.6205 | 0.6405 | 0.6405 | 0.7391 | 0.0 | 0.5 | 0.6561 | nan |
0.2482 | 9.0 | 135 | 0.2978 | 0.6815 | 0.6815 | 0.5754 | 0.5754 | 0.6731 | 0.6731 | 0.7391 | 0.0 | 0.5 | 0.6715 | nan |
0.2416 | 10.0 | 150 | 0.3018 | 0.6860 | 0.6860 | 0.5954 | 0.5954 | 0.6687 | 0.6687 | 0.5652 | 0.0 | 0.5 | 0.2553 | nan |
0.2292 | 11.0 | 165 | 0.2764 | 0.6565 | 0.6565 | 0.5522 | 0.5522 | 0.6966 | 0.6966 | 0.9130 | 0.0 | 0.5 | 0.8408 | nan |
0.1752 | 12.0 | 180 | 0.3070 | 0.6920 | 0.6920 | 0.5680 | 0.5680 | 0.6629 | 0.6629 | 0.7391 | 0.0 | 0.5 | 0.6715 | nan |
0.1956 | 13.0 | 195 | 0.2923 | 0.6752 | 0.6752 | 0.5499 | 0.5499 | 0.6791 | 0.6791 | 0.8261 | 0.0 | 0.5 | 0.7843 | nan |
0.1424 | 14.0 | 210 | 0.3163 | 0.7023 | 0.7023 | 0.6060 | 0.6060 | 0.6528 | 0.6528 | 0.9130 | 0.0 | 0.5 | 0.8408 | nan |
0.152 | 15.0 | 225 | 0.2436 | 0.6164 | 0.6164 | 0.5127 | 0.5127 | 0.7326 | 0.7326 | 0.9130 | 0.0 | 0.5 | 0.8408 | nan |
0.1277 | 16.0 | 240 | 0.2471 | 0.6208 | 0.6208 | 0.5367 | 0.5367 | 0.7287 | 0.7287 | 0.8261 | 0.0 | 0.5 | 0.7802 | nan |
0.1269 | 17.0 | 255 | 0.2573 | 0.6334 | 0.6334 | 0.5329 | 0.5329 | 0.7175 | 0.7175 | 0.8261 | 0.0 | 0.5 | 0.7802 | nan |
0.1058 | 18.0 | 270 | 0.2538 | 0.6291 | 0.6291 | 0.5530 | 0.5530 | 0.7214 | 0.7214 | 0.7391 | 0.0 | 0.5 | 0.2347 | nan |
0.107 | 19.0 | 285 | 0.2568 | 0.6328 | 0.6328 | 0.5464 | 0.5464 | 0.7181 | 0.7181 | 0.8261 | 0.0 | 0.5 | 0.7802 | nan |
0.1185 | 20.0 | 300 | 0.2452 | 0.6183 | 0.6183 | 0.5317 | 0.5317 | 0.7309 | 0.7309 | 0.7391 | 0.0 | 0.5 | 0.2347 | nan |
0.1029 | 21.0 | 315 | 0.2419 | 0.6142 | 0.6142 | 0.5415 | 0.5415 | 0.7344 | 0.7344 | 0.7391 | 0.0 | 0.5 | 0.2347 | nan |
0.0908 | 22.0 | 330 | 0.2462 | 0.6196 | 0.6196 | 0.5261 | 0.5261 | 0.7297 | 0.7297 | 0.8261 | 0.0 | 0.5 | 0.7802 | nan |
0.0901 | 23.0 | 345 | 0.2528 | 0.6279 | 0.6279 | 0.5330 | 0.5330 | 0.7225 | 0.7225 | 0.8261 | 0.0 | 0.5 | 0.7802 | nan |
0.0979 | 24.0 | 360 | 0.2800 | 0.6607 | 0.6607 | 0.5682 | 0.5682 | 0.6927 | 0.6927 | 0.9130 | 0.0 | 0.5 | 0.8408 | nan |
0.0992 | 25.0 | 375 | 0.2502 | 0.6246 | 0.6246 | 0.5517 | 0.5517 | 0.7254 | 0.7254 | 0.6522 | 0.0 | 0.5 | 0.2372 | nan |
0.0846 | 26.0 | 390 | 0.2570 | 0.6331 | 0.6331 | 0.5524 | 0.5524 | 0.7178 | 0.7178 | 0.8261 | 0.0 | 0.5 | 0.7802 | nan |
0.0717 | 27.0 | 405 | 0.2562 | 0.6321 | 0.6321 | 0.5456 | 0.5456 | 0.7187 | 0.7187 | 0.8261 | 0.0 | 0.5 | 0.7802 | nan |
0.0739 | 28.0 | 420 | 0.2570 | 0.6330 | 0.6330 | 0.5471 | 0.5471 | 0.7179 | 0.7179 | 0.8261 | 0.0 | 0.5 | 0.7802 | nan |
0.0828 | 29.0 | 435 | 0.2553 | 0.6309 | 0.6309 | 0.5446 | 0.5446 | 0.7198 | 0.7198 | 0.8261 | 0.0 | 0.5 | 0.7802 | nan |
0.086 | 30.0 | 450 | 0.2546 | 0.6301 | 0.6301 | 0.5441 | 0.5441 | 0.7205 | 0.7205 | 0.8261 | 0.0 | 0.5 | 0.7802 | nan |
Framework versions
- Transformers 4.16.2
- Pytorch 1.10.2+cu113
- Datasets 1.18.3
- Tokenizers 0.11.0