Edit model card

EElayoutlmv3_jordyvl_rvl_cdip_100_examples_per_class_2023-09-02_txt_vis_concat_enc_9_ramp

This model is a fine-tuned version of microsoft/layoutlmv3-base on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 1.2989
  • Accuracy: 0.7625
  • Exit 0 Accuracy: 0.0775
  • Exit 1 Accuracy: 0.755

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 2
  • eval_batch_size: 1
  • seed: 42
  • gradient_accumulation_steps: 24
  • total_train_batch_size: 48
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 60

Training results

Training Loss Epoch Step Validation Loss Accuracy Exit 0 Accuracy Exit 1 Accuracy
No log 0.96 16 2.6928 0.1225 0.0525 0.0625
No log 1.98 33 2.5722 0.23 0.035 0.0625
No log 3.0 50 2.3786 0.2925 0.0425 0.0625
No log 3.96 66 2.1686 0.39 0.05 0.0625
No log 4.98 83 1.9395 0.505 0.045 0.0625
No log 6.0 100 1.6538 0.5875 0.0475 0.0625
No log 6.96 116 1.4783 0.6425 0.0475 0.0625
No log 7.98 133 1.2744 0.7175 0.0525 0.0625
No log 9.0 150 1.1395 0.7325 0.06 0.0625
No log 9.96 166 1.0234 0.7525 0.065 0.0625
No log 10.98 183 0.9838 0.75 0.0675 0.0625
No log 12.0 200 0.9310 0.7475 0.065 0.0625
No log 12.96 216 0.9234 0.755 0.065 0.0625
No log 13.98 233 0.9256 0.7675 0.065 0.0625
No log 15.0 250 0.9318 0.7725 0.0675 0.0625
No log 15.96 266 0.9192 0.7475 0.0675 0.0875
No log 16.98 283 0.9302 0.7625 0.065 0.4775
No log 18.0 300 0.9552 0.76 0.065 0.685
No log 18.96 316 1.0063 0.775 0.065 0.715
No log 19.98 333 1.0117 0.7675 0.065 0.7425
No log 21.0 350 0.9867 0.775 0.065 0.77
No log 21.96 366 1.0445 0.7725 0.0675 0.775
No log 22.98 383 1.0835 0.765 0.07 0.7725
No log 24.0 400 1.0637 0.775 0.0725 0.7725
No log 24.96 416 1.1717 0.765 0.0725 0.75
No log 25.98 433 1.0935 0.7675 0.0675 0.77
No log 27.0 450 1.2155 0.7625 0.0675 0.7675
No log 27.96 466 1.1269 0.7675 0.0675 0.765
No log 28.98 483 1.1821 0.775 0.0675 0.755
1.4929 30.0 500 1.1562 0.7775 0.0725 0.7675
1.4929 30.96 516 1.1784 0.7625 0.07 0.7525
1.4929 31.98 533 1.1937 0.76 0.0725 0.7625
1.4929 33.0 550 1.2074 0.76 0.0775 0.7575
1.4929 33.96 566 1.2167 0.76 0.0775 0.7525
1.4929 34.98 583 1.2324 0.7575 0.0725 0.755
1.4929 36.0 600 1.2309 0.7525 0.075 0.755
1.4929 36.96 616 1.2377 0.76 0.0725 0.755
1.4929 37.98 633 1.2426 0.765 0.075 0.7525
1.4929 39.0 650 1.2471 0.76 0.075 0.7525
1.4929 39.96 666 1.2536 0.7625 0.075 0.7525
1.4929 40.98 683 1.2556 0.7575 0.0775 0.755
1.4929 42.0 700 1.2607 0.7625 0.0725 0.755
1.4929 42.96 716 1.2662 0.765 0.075 0.755
1.4929 43.98 733 1.2702 0.765 0.075 0.755
1.4929 45.0 750 1.2757 0.7675 0.075 0.7575
1.4929 45.96 766 1.2773 0.7675 0.0775 0.755
1.4929 46.98 783 1.2805 0.7675 0.0775 0.755
1.4929 48.0 800 1.2809 0.765 0.075 0.7525
1.4929 48.96 816 1.2841 0.76 0.0775 0.7525
1.4929 49.98 833 1.2902 0.76 0.075 0.7525
1.4929 51.0 850 1.2930 0.765 0.0775 0.755
1.4929 51.96 866 1.2963 0.7625 0.0775 0.7575
1.4929 52.98 883 1.2968 0.76 0.0775 0.7575
1.4929 54.0 900 1.2977 0.7625 0.0775 0.7575
1.4929 54.96 916 1.2978 0.765 0.0775 0.755
1.4929 55.98 933 1.2986 0.765 0.0775 0.755
1.4929 57.0 950 1.2988 0.765 0.0775 0.755
1.4929 57.6 960 1.2989 0.7625 0.0775 0.755

Framework versions

  • Transformers 4.31.0
  • Pytorch 2.0.1+cu117
  • Datasets 2.13.1
  • Tokenizers 0.13.3
Downloads last month
2
Inference API
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.