RoBERTa_Combined_Generated_v2_2000_Fold1
This model is a fine-tuned version of ICT2214Team7/RoBERTa_Test_Training on an unknown dataset. It achieves the following results on the evaluation set:
- Loss: 0.0542
- Precision: 0.8803
- Recall: 0.9358
- F1: 0.9072
- Accuracy: 0.9861
- Report: {'AGE': {'precision': 0.9491525423728814, 'recall': 0.9911504424778761, 'f1-score': 0.9696969696969698, 'support': 113}, 'LOC': {'precision': 0.775, 'recall': 0.915129151291513, 'f1-score': 0.8392554991539763, 'support': 271}, 'NAT': {'precision': 0.9176470588235294, 'recall': 0.9512195121951219, 'f1-score': 0.9341317365269461, 'support': 164}, 'ORG': {'precision': 0.9230769230769231, 'recall': 0.9230769230769231, 'f1-score': 0.9230769230769231, 'support': 130}, 'PER': {'precision': 0.967948717948718, 'recall': 0.9263803680981595, 'f1-score': 0.9467084639498432, 'support': 163}, 'micro avg': {'precision': 0.8803131991051454, 'recall': 0.9357907253269917, 'f1-score': 0.9072046109510086, 'support': 841}, 'macro avg': {'precision': 0.9065650484444104, 'recall': 0.9413912794279188, 'f1-score': 0.9225739184809317, 'support': 841}, 'weighted avg': {'precision': 0.8865029678487937, 'recall': 0.9357907253269917, 'f1-score': 0.9090666852089522, 'support': 841}}
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 5e-05
- train_batch_size: 8
- eval_batch_size: 8
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 3
Training results
Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy | Report |
---|---|---|---|---|---|---|---|---|
No log | 1.0 | 160 | 0.0557 | 0.8449 | 0.9132 | 0.8777 | 0.9828 | {'AGE': {'precision': 0.9411764705882353, 'recall': 0.9911504424778761, 'f1-score': 0.9655172413793104, 'support': 113}, 'LOC': {'precision': 0.7467948717948718, 'recall': 0.8597785977859779, 'f1-score': 0.7993138936535162, 'support': 271}, 'NAT': {'precision': 0.8603351955307262, 'recall': 0.9390243902439024, 'f1-score': 0.8979591836734694, 'support': 164}, 'ORG': {'precision': 0.8613138686131386, 'recall': 0.9076923076923077, 'f1-score': 0.8838951310861423, 'support': 130}, 'PER': {'precision': 0.9320987654320988, 'recall': 0.9263803680981595, 'f1-score': 0.9292307692307692, 'support': 163}, 'micro avg': {'precision': 0.8448844884488449, 'recall': 0.9131985731272295, 'f1-score': 0.8777142857142858, 'support': 841}, 'macro avg': {'precision': 0.8683438343918141, 'recall': 0.9248052212596447, 'f1-score': 0.8951832438046414, 'support': 841}, 'weighted avg': {'precision': 0.8486708979608324, 'recall': 0.9131985731272295, 'f1-score': 0.8791365065448608, 'support': 841}} |
No log | 2.0 | 320 | 0.0581 | 0.8686 | 0.9429 | 0.9042 | 0.9847 | {'AGE': {'precision': 0.9411764705882353, 'recall': 0.9911504424778761, 'f1-score': 0.9655172413793104, 'support': 113}, 'LOC': {'precision': 0.7832817337461301, 'recall': 0.933579335793358, 'f1-score': 0.8518518518518519, 'support': 271}, 'NAT': {'precision': 0.8516483516483516, 'recall': 0.9451219512195121, 'f1-score': 0.8959537572254336, 'support': 164}, 'ORG': {'precision': 0.9166666666666666, 'recall': 0.9307692307692308, 'f1-score': 0.9236641221374045, 'support': 130}, 'PER': {'precision': 0.9681528662420382, 'recall': 0.9325153374233128, 'f1-score': 0.9500000000000001, 'support': 163}, 'micro avg': {'precision': 0.8685651697699891, 'recall': 0.9429250891795482, 'f1-score': 0.9042189281641961, 'support': 841}, 'macro avg': {'precision': 0.8921852177782844, 'recall': 0.9466272595366579, 'f1-score': 0.9173973945188001, 'support': 841}, 'weighted avg': {'precision': 0.8742784834198815, 'recall': 0.9429250891795482, 'f1-score': 0.9058478622955382, 'support': 841}} |
No log | 3.0 | 480 | 0.0542 | 0.8803 | 0.9358 | 0.9072 | 0.9861 | {'AGE': {'precision': 0.9491525423728814, 'recall': 0.9911504424778761, 'f1-score': 0.9696969696969698, 'support': 113}, 'LOC': {'precision': 0.775, 'recall': 0.915129151291513, 'f1-score': 0.8392554991539763, 'support': 271}, 'NAT': {'precision': 0.9176470588235294, 'recall': 0.9512195121951219, 'f1-score': 0.9341317365269461, 'support': 164}, 'ORG': {'precision': 0.9230769230769231, 'recall': 0.9230769230769231, 'f1-score': 0.9230769230769231, 'support': 130}, 'PER': {'precision': 0.967948717948718, 'recall': 0.9263803680981595, 'f1-score': 0.9467084639498432, 'support': 163}, 'micro avg': {'precision': 0.8803131991051454, 'recall': 0.9357907253269917, 'f1-score': 0.9072046109510086, 'support': 841}, 'macro avg': {'precision': 0.9065650484444104, 'recall': 0.9413912794279188, 'f1-score': 0.9225739184809317, 'support': 841}, 'weighted avg': {'precision': 0.8865029678487937, 'recall': 0.9357907253269917, 'f1-score': 0.9090666852089522, 'support': 841}} |
Framework versions
- Transformers 4.40.2
- Pytorch 2.3.0+cu121
- Datasets 2.19.1
- Tokenizers 0.19.1
- Downloads last month
- 3
Model tree for ICT2214Team7/RoBERTa_Combined_Generated_v2_2000_Fold1
Base model
distilbert/distilroberta-base
Finetuned
ICT2214Team7/RoBERTa_Test_Training