Edit model card

scenario-kd-pre-ner-full-mdeberta_data-univner_half44

This model is a fine-tuned version of microsoft/mdeberta-v3-base on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 61.1919
  • Precision: 0.7756
  • Recall: 0.7755
  • F1: 0.7756
  • Accuracy: 0.9777

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 3e-05
  • train_batch_size: 8
  • eval_batch_size: 32
  • seed: 44
  • gradient_accumulation_steps: 4
  • total_train_batch_size: 32
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 10

Training results

Training Loss Epoch Step Validation Loss Precision Recall F1 Accuracy
151.0754 0.5828 500 109.7110 0.6040 0.2672 0.3705 0.9352
98.7113 1.1655 1000 90.7571 0.7117 0.5442 0.6168 0.9633
84.9101 1.7483 1500 83.2285 0.7420 0.6780 0.7085 0.9711
78.0914 2.3310 2000 78.4685 0.7315 0.7673 0.7490 0.9751
73.3274 2.9138 2500 75.3025 0.7529 0.7253 0.7388 0.9749
69.5578 3.4965 3000 72.3677 0.7494 0.7768 0.7629 0.9764
66.2785 4.0793 3500 70.2815 0.7497 0.7674 0.7584 0.9765
63.6717 4.6620 4000 68.0548 0.7654 0.7690 0.7672 0.9767
61.7684 5.2448 4500 66.4630 0.7745 0.7606 0.7675 0.9770
60.0429 5.8275 5000 65.2917 0.7689 0.7667 0.7678 0.9770
58.5111 6.4103 5500 64.1732 0.7711 0.7749 0.7730 0.9777
57.5843 6.9930 6000 63.2243 0.7810 0.7729 0.7769 0.9774
56.5783 7.5758 6500 62.6324 0.7680 0.7830 0.7755 0.9773
55.6773 8.1585 7000 61.9595 0.7722 0.7814 0.7768 0.9777
55.3077 8.7413 7500 61.5984 0.7856 0.7728 0.7791 0.9780
54.8003 9.3240 8000 61.3256 0.7754 0.7765 0.7760 0.9778
54.5737 9.9068 8500 61.1919 0.7756 0.7755 0.7756 0.9777

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.1.1+cu121
  • Datasets 2.14.5
  • Tokenizers 0.19.1
Downloads last month
3
Safetensors
Model size
236M params
Tensor type
F32
·
Inference API
Unable to determine this model’s pipeline type. Check the docs .

Model tree for haryoaw/scenario-kd-pre-ner-full-mdeberta_data-univner_half44

Finetuned
(206)
this model