scenario-KD-PO-CDF-EN-FROM-EN-D2_data-en-cardiff_eng_only_alpha-jason

This model is a fine-tuned version of haryoaw/scenario-TCR_data-en-cardiff_eng_only2 on the None dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Accuracy	F1
No log	1.72	100	21.3190	0.3576	0.3272
No log	3.45	200	21.9669	0.3765	0.3322
No log	5.17	300	21.3943	0.3880	0.3770
No log	6.9	400	22.4346	0.3937	0.3679
21.6763	8.62	500	22.0732	0.3898	0.3832
21.6763	10.34	600	22.3971	0.3915	0.3870
21.6763	12.07	700	22.5065	0.3946	0.3879
21.6763	13.79	800	22.9460	0.3942	0.3906
21.6763	15.52	900	22.6589	0.4123	0.4073
15.4435	17.24	1000	23.3626	0.4039	0.3969
15.4435	18.97	1100	23.8309	0.3990	0.3925
15.4435	20.69	1200	24.5910	0.3955	0.3914
15.4435	22.41	1300	24.9435	0.3893	0.3778
15.4435	24.14	1400	25.2576	0.3920	0.3836
11.044	25.86	1500	25.6593	0.3951	0.3861
11.044	27.59	1600	25.6976	0.3973	0.3902
11.044	29.31	1700	25.9311	0.4056	0.3983