NASES-clara-med / README.md
joheras's picture
update model card README.md
407d3bd
metadata
tags:
  - simplification
  - generated_from_trainer
metrics:
  - rouge
model-index:
  - name: NASES-clara-med
    results: []

NASES-clara-med

This model is a fine-tuned version of ELiRF/NASES on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 3.3274
  • Rouge1: 43.5242
  • Rouge2: 25.5006
  • Rougel: 37.6746
  • Rougelsum: 37.9494

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5.6e-05
  • train_batch_size: 16
  • eval_batch_size: 16
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 30

Training results

Training Loss Epoch Step Validation Loss Rouge1 Rouge2 Rougel Rougelsum
No log 1.0 190 2.2062 42.852 24.8838 37.1452 37.3371
No log 2.0 380 2.3578 37.9691 19.9011 32.0423 32.1643
1.9348 3.0 570 2.3038 41.0743 22.9938 35.4133 35.7045
1.9348 4.0 760 2.2657 43.452 25.738 37.9271 38.184
0.8021 5.0 950 2.3763 43.6414 25.6831 37.8838 38.1249
0.8021 6.0 1140 2.5113 43.0882 25.4964 37.5245 37.7459
0.8021 7.0 1330 2.6203 43.164 25.1229 37.396 37.7116
0.2689 8.0 1520 2.7020 43.7588 25.7127 37.9395 38.1406
0.2689 9.0 1710 2.7533 43.3676 25.2676 37.5607 37.8348
0.1474 10.0 1900 2.8033 43.7734 25.7325 37.8845 38.161
0.1474 11.0 2090 2.8847 43.1775 25.4384 37.4397 37.7056
0.1474 12.0 2280 2.9204 43.6149 25.6053 37.8689 38.121
0.0924 13.0 2470 2.9404 43.355 25.458 37.4775 37.7391
0.0924 14.0 2660 2.9606 43.7408 25.7361 38.0951 38.3446
0.0666 15.0 2850 2.9954 43.4987 25.5428 37.5166 37.7468
0.0666 16.0 3040 3.0491 43.8303 25.4726 37.8056 38.0368
0.0666 17.0 3230 3.0872 43.8177 25.9315 37.9765 38.2842
0.0485 18.0 3420 3.0743 43.0783 25.1678 37.3844 37.6768
0.0485 19.0 3610 3.1571 43.5241 25.5695 37.7233 37.9743
0.0328 20.0 3800 3.1866 43.8114 25.6402 37.7608 37.9713
0.0328 21.0 3990 3.1502 43.7322 25.6384 37.9642 38.1954
0.0328 22.0 4180 3.1723 43.5597 25.291 37.5834 37.8289
0.0264 23.0 4370 3.2214 43.251 24.8926 37.0472 37.2819
0.0264 24.0 4560 3.2764 43.469 25.2663 37.3452 37.5562
0.0169 25.0 4750 3.3104 44.0058 25.8218 38.0374 38.2664
0.0169 26.0 4940 3.2820 43.4012 25.3515 37.5514 37.7375
0.0169 27.0 5130 3.3169 43.3498 25.2935 37.3567 37.5987
0.013 28.0 5320 3.3005 43.8029 25.6506 38.053 38.2852
0.013 29.0 5510 3.3266 43.711 25.5171 37.6826 37.9781
0.0109 30.0 5700 3.3274 43.5242 25.5006 37.6746 37.9494

Framework versions

  • Transformers 4.25.1
  • Pytorch 1.13.0
  • Datasets 2.8.0
  • Tokenizers 0.12.1