GUE_virus_covid-seqsight_65536_512_47M-L32_all
This model is a fine-tuned version of mahdibaghbanzadeh/seqsight_65536_512_47M on the mahdibaghbanzadeh/GUE_virus_covid dataset. It achieves the following results on the evaluation set:
- Loss: 0.8911
- F1 Score: 0.6614
- Accuracy: 0.6627
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 0.0005
- train_batch_size: 2048
- eval_batch_size: 2048
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- training_steps: 10000
Training results
Training Loss | Epoch | Step | Validation Loss | F1 Score | Accuracy |
---|---|---|---|---|---|
2.1502 | 5.56 | 200 | 1.9532 | 0.2539 | 0.2794 |
1.7911 | 11.11 | 400 | 1.5566 | 0.4198 | 0.4216 |
1.577 | 16.67 | 600 | 1.3911 | 0.4866 | 0.4820 |
1.4645 | 22.22 | 800 | 1.3114 | 0.5226 | 0.5159 |
1.3956 | 27.78 | 1000 | 1.2558 | 0.5337 | 0.5296 |
1.3406 | 33.33 | 1200 | 1.2095 | 0.5505 | 0.5470 |
1.3009 | 38.89 | 1400 | 1.1875 | 0.5572 | 0.5532 |
1.27 | 44.44 | 1600 | 1.1603 | 0.5650 | 0.5633 |
1.2405 | 50.0 | 1800 | 1.1324 | 0.5758 | 0.5770 |
1.2126 | 55.56 | 2000 | 1.1088 | 0.5902 | 0.5898 |
1.1864 | 61.11 | 2200 | 1.0813 | 0.5981 | 0.5990 |
1.1546 | 66.67 | 2400 | 1.0571 | 0.6077 | 0.6058 |
1.1303 | 72.22 | 2600 | 1.0436 | 0.6140 | 0.6124 |
1.1097 | 77.78 | 2800 | 1.0236 | 0.6233 | 0.6218 |
1.0861 | 83.33 | 3000 | 1.0071 | 0.6278 | 0.6251 |
1.0663 | 88.89 | 3200 | 0.9930 | 0.6346 | 0.6321 |
1.0468 | 94.44 | 3400 | 0.9837 | 0.6383 | 0.6353 |
1.0312 | 100.0 | 3600 | 0.9733 | 0.6379 | 0.6363 |
1.014 | 105.56 | 3800 | 0.9593 | 0.6442 | 0.6408 |
1.0005 | 111.11 | 4000 | 0.9502 | 0.6450 | 0.6444 |
0.9896 | 116.67 | 4200 | 0.9446 | 0.6461 | 0.6442 |
0.9794 | 122.22 | 4400 | 0.9395 | 0.6483 | 0.6471 |
0.9704 | 127.78 | 4600 | 0.9294 | 0.6521 | 0.6506 |
0.9614 | 133.33 | 4800 | 0.9301 | 0.6520 | 0.6514 |
0.9544 | 138.89 | 5000 | 0.9255 | 0.6534 | 0.6520 |
0.9478 | 144.44 | 5200 | 0.9251 | 0.6539 | 0.6537 |
0.9407 | 150.0 | 5400 | 0.9191 | 0.6544 | 0.6532 |
0.9353 | 155.56 | 5600 | 0.9162 | 0.6570 | 0.6559 |
0.9304 | 161.11 | 5800 | 0.9141 | 0.6588 | 0.6575 |
0.9254 | 166.67 | 6000 | 0.9104 | 0.6605 | 0.6597 |
0.9214 | 172.22 | 6200 | 0.9093 | 0.6612 | 0.6600 |
0.9178 | 177.78 | 6400 | 0.9099 | 0.6612 | 0.6606 |
0.9108 | 183.33 | 6600 | 0.9074 | 0.6610 | 0.6604 |
0.9092 | 188.89 | 6800 | 0.9057 | 0.6650 | 0.6644 |
0.9037 | 194.44 | 7000 | 0.9055 | 0.6633 | 0.6628 |
0.9021 | 200.0 | 7200 | 0.9023 | 0.6660 | 0.6653 |
0.8966 | 205.56 | 7400 | 0.8984 | 0.6672 | 0.6666 |
0.8946 | 211.11 | 7600 | 0.8970 | 0.6630 | 0.6634 |
0.8907 | 216.67 | 7800 | 0.8968 | 0.6666 | 0.6665 |
0.8878 | 222.22 | 8000 | 0.8948 | 0.6670 | 0.6666 |
0.8846 | 227.78 | 8200 | 0.8934 | 0.6652 | 0.6650 |
0.882 | 233.33 | 8400 | 0.8934 | 0.6676 | 0.6677 |
0.8814 | 238.89 | 8600 | 0.8919 | 0.6666 | 0.6665 |
0.8799 | 244.44 | 8800 | 0.8908 | 0.6679 | 0.6677 |
0.8765 | 250.0 | 9000 | 0.8911 | 0.6670 | 0.6672 |
0.8765 | 255.56 | 9200 | 0.8907 | 0.6664 | 0.6667 |
0.875 | 261.11 | 9400 | 0.8906 | 0.6675 | 0.6675 |
0.8743 | 266.67 | 9600 | 0.8909 | 0.6680 | 0.6679 |
0.8736 | 272.22 | 9800 | 0.8907 | 0.6671 | 0.6671 |
0.8731 | 277.78 | 10000 | 0.8906 | 0.6676 | 0.6676 |
Framework versions
- PEFT 0.9.0
- Transformers 4.38.2
- Pytorch 2.2.0+cu121
- Datasets 2.17.1
- Tokenizers 0.15.2
- Downloads last month
- 0
Unable to determine this model’s pipeline type. Check the
docs
.