Zekunli's picture
update model card README.md
b886e01
|
raw
history blame
No virus
2.29 kB
metadata
license: apache-2.0
tags:
  - generated_from_trainer
model-index:
  - name: flan-t5-base-extraction-cnndm_2000-all-hint_precision-ep50
    results: []

flan-t5-base-extraction-cnndm_2000-all-hint_precision-ep50

This model is a fine-tuned version of google/flan-t5-base on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.8409
  • Hint Hit Num: 2.3443
  • Hint Precision: 0.4272
  • Num: 5.472
  • Gen Len: 18.9975

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 60
  • eval_batch_size: 400
  • seed: 1799
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 50

Training results

Training Loss Epoch Step Validation Loss Hint Hit Num Hint Precision Num Gen Len
2.2521 5.88 200 1.8616 2.2495 0.4197 5.3613 18.9832
1.9538 11.76 400 1.8307 2.2661 0.4202 5.3879 18.996
1.8538 17.65 600 1.8238 2.3191 0.4251 5.446 18.9979
1.7799 23.53 800 1.8277 2.3338 0.4259 5.4634 18.9975
1.7242 29.41 1000 1.8360 2.3131 0.4243 5.4378 18.9982
1.6819 35.29 1200 1.8409 2.3443 0.4272 5.472 18.9975
1.6535 41.18 1400 1.8451 2.3363 0.4257 5.4672 18.9976
1.6397 47.06 1600 1.8472 2.3414 0.4257 5.4776 18.9982

Framework versions

  • Transformers 4.18.0
  • Pytorch 1.10.0+cu111
  • Datasets 2.5.1
  • Tokenizers 0.12.1