Edit model card

GUE_mouse_2-seqsight_65536_512_47M-L32_all

This model is a fine-tuned version of mahdibaghbanzadeh/seqsight_65536_512_47M on the mahdibaghbanzadeh/GUE_mouse_2 dataset. It achieves the following results on the evaluation set:

  • Loss: 1.4923
  • F1 Score: 0.8199
  • Accuracy: 0.8201

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.0005
  • train_batch_size: 2048
  • eval_batch_size: 2048
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • training_steps: 10000

Training results

Training Loss Epoch Step Validation Loss F1 Score Accuracy
0.4054 100.0 200 0.7253 0.7405 0.7409
0.1346 200.0 400 1.0998 0.7193 0.7195
0.0689 300.0 600 1.2385 0.7530 0.7530
0.0433 400.0 800 1.4264 0.7408 0.7409
0.0303 500.0 1000 1.5952 0.7439 0.7439
0.0237 600.0 1200 1.5753 0.7341 0.7348
0.0196 700.0 1400 1.6173 0.7497 0.75
0.0167 800.0 1600 1.7331 0.7529 0.7530
0.014 900.0 1800 1.8038 0.7499 0.75
0.013 1000.0 2000 1.8529 0.7469 0.7470
0.0114 1100.0 2200 1.7976 0.7561 0.7561
0.0106 1200.0 2400 1.8884 0.7529 0.7530
0.0101 1300.0 2600 1.9043 0.7528 0.7530
0.0097 1400.0 2800 1.9710 0.7528 0.7530
0.0083 1500.0 3000 1.8886 0.7591 0.7591
0.0081 1600.0 3200 1.8498 0.7530 0.7530
0.0076 1700.0 3400 1.9551 0.7591 0.7591
0.0077 1800.0 3600 1.9208 0.7561 0.7561
0.0066 1900.0 3800 1.8954 0.7589 0.7591
0.0069 2000.0 4000 1.8680 0.7407 0.7409
0.0064 2100.0 4200 2.0258 0.7683 0.7683
0.0059 2200.0 4400 1.9716 0.7587 0.7591
0.0068 2300.0 4600 2.0603 0.7713 0.7713
0.0059 2400.0 4800 2.0135 0.7651 0.7652
0.0061 2500.0 5000 1.9758 0.7621 0.7622
0.005 2600.0 5200 2.1556 0.7652 0.7652
0.0053 2700.0 5400 2.0520 0.7498 0.75
0.0054 2800.0 5600 2.2497 0.7560 0.7561
0.0047 2900.0 5800 2.0620 0.7559 0.7561
0.005 3000.0 6000 1.9706 0.7618 0.7622
0.0045 3100.0 6200 2.1524 0.7587 0.7591
0.0042 3200.0 6400 2.2165 0.7561 0.7561
0.0049 3300.0 6600 1.9786 0.7589 0.7591
0.0039 3400.0 6800 2.2495 0.7713 0.7713
0.004 3500.0 7000 2.3557 0.7591 0.7591
0.0039 3600.0 7200 2.1475 0.7621 0.7622
0.0038 3700.0 7400 2.1291 0.7591 0.7591
0.0038 3800.0 7600 2.2240 0.7591 0.7591
0.0036 3900.0 7800 2.2950 0.7683 0.7683
0.0039 4000.0 8000 2.1987 0.7591 0.7591
0.0035 4100.0 8200 2.2783 0.7621 0.7622
0.0036 4200.0 8400 2.2651 0.7591 0.7591
0.0032 4300.0 8600 2.2795 0.7591 0.7591
0.003 4400.0 8800 2.3454 0.7591 0.7591
0.0032 4500.0 9000 2.3081 0.7591 0.7591
0.003 4600.0 9200 2.2963 0.7652 0.7652
0.0027 4700.0 9400 2.3278 0.7622 0.7622
0.0028 4800.0 9600 2.3769 0.7622 0.7622
0.0026 4900.0 9800 2.3410 0.7622 0.7622
0.003 5000.0 10000 2.3142 0.7622 0.7622

Framework versions

  • PEFT 0.9.0
  • Transformers 4.38.2
  • Pytorch 2.2.0+cu121
  • Datasets 2.17.1
  • Tokenizers 0.15.2
Downloads last month
0
Unable to determine this model’s pipeline type. Check the docs .
Invalid base_model specified in model card metadata. Needs to be a model id from hf.co/models.