Edit model card

nl+no_processing

This model is a fine-tuned version of facebook/mbart-large-cc25 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.6038
  • Smatch Precision: 73.7
  • Smatch Recall: 76.48
  • Smatch Fscore: 75.06
  • Smatch Unparsable: 0
  • Percent Not Recoverable: 0.2323

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 2
  • eval_batch_size: 2
  • seed: 42
  • gradient_accumulation_steps: 8
  • total_train_batch_size: 16
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 25

Training results

Training Loss Epoch Step Validation Loss Smatch Precision Smatch Recall Smatch Fscore Smatch Unparsable Percent Not Recoverable
0.8025 1.0 3477 1.3793 18.51 65.71 28.88 0 0.0
0.13 2.0 6954 0.9377 27.0 71.3 39.16 0 0.1161
0.0953 3.0 10431 0.7509 34.09 72.74 46.42 0 0.1161
0.1386 4.0 13908 0.8524 33.38 73.32 45.87 2 0.0
0.0974 5.0 17385 0.6957 41.69 73.92 53.31 0 0.0
0.0705 6.0 20862 0.6145 47.98 75.12 58.55 0 0.0
0.2265 7.0 24339 0.6439 47.06 75.53 57.99 0 0.0
0.0506 8.0 27817 0.5974 53.0 76.95 62.77 0 0.0
0.064 9.0 31294 0.6387 51.83 77.47 62.11 0 0.0
0.0112 10.0 34771 0.6066 54.82 76.98 64.03 0 0.0
0.047 11.0 38248 0.5970 60.36 77.04 67.69 0 0.0
0.0134 12.0 41725 0.5675 61.72 77.15 68.58 0 0.0
0.0656 13.0 45202 0.6210 62.8 76.92 69.15 0 0.0581
0.015 14.0 48679 0.6257 62.8 77.32 69.31 0 0.0
0.0134 15.0 52156 0.5635 66.7 77.34 71.63 0 0.1161
0.0265 16.0 55634 0.5839 67.61 76.76 71.89 0 0.0581
0.0219 17.0 59111 0.5894 68.66 77.43 72.78 0 0.1161
0.0008 18.0 62588 0.5981 68.44 77.57 72.72 0 0.0
0.0157 19.0 66065 0.6184 69.88 77.42 73.46 0 0.0581
0.0334 20.0 69542 0.6026 70.76 77.37 73.92 0 0.2323
0.0619 21.0 73019 0.6021 72.03 77.0 74.44 0 0.1742
0.0075 22.0 76496 0.6166 72.33 76.74 74.47 0 0.0581
0.0164 23.0 79973 0.6100 72.75 77.03 74.83 0 0.2323
0.0011 24.0 83451 0.6037 73.7 76.51 75.08 0 0.2323
0.0865 25.0 86925 0.6038 73.7 76.48 75.06 0 0.2323

Framework versions

  • Transformers 4.34.0.dev0
  • Pytorch 2.0.1+cu117
  • Datasets 2.14.2
  • Tokenizers 0.13.3
Downloads last month
0
Safetensors
Model size
614M params
Tensor type
F32
·

Finetuned from