scenario-KD-PR-MSV-D2_data-cl-cardiff_cl_only_gamma-jason

This model is a fine-tuned version of FacebookAI/xlm-roberta-base on the None dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Accuracy	F1
No log	1.09	250	12.0692	0.3727	0.3489
14.0169	2.17	500	10.9338	0.3819	0.3797
14.0169	3.26	750	10.7424	0.4059	0.4032
11.6579	4.35	1000	11.5422	0.3927	0.3806
11.6579	5.43	1250	11.1316	0.4012	0.4005
9.9619	6.52	1500	11.8448	0.4105	0.4082
9.9619	7.61	1750	12.5910	0.4174	0.4151
8.6208	8.7	2000	11.4658	0.3989	0.3971
8.6208	9.78	2250	12.6395	0.4066	0.4067
7.4098	10.87	2500	12.9836	0.4035	0.3977
7.4098	11.96	2750	13.6894	0.3966	0.3929
6.5302	13.04	3000	15.4482	0.3997	0.3936
6.5302	14.13	3250	14.9618	0.3688	0.3596
5.6406	15.22	3500	14.4216	0.3943	0.3905
5.6406	16.3	3750	15.1277	0.4035	0.4023
5.0685	17.39	4000	15.6837	0.3920	0.3883
5.0685	18.48	4250	16.1391	0.4020	0.4015
4.3243	19.57	4500	16.0750	0.3773	0.3744
4.3243	20.65	4750	16.0997	0.3989	0.3951
4.0088	21.74	5000	16.2234	0.3966	0.3907
4.0088	22.83	5250	16.6659	0.3850	0.3833
3.5813	23.91	5500	16.1725	0.4136	0.4133
3.5813	25.0	5750	16.7571	0.3873	0.3850
3.2165	26.09	6000	17.0675	0.3951	0.3943
3.2165	27.17	6250	16.7777	0.3951	0.3934
3.0739	28.26	6500	17.1236	0.4043	0.4036
3.0739	29.35	6750	17.2063	0.3889	0.3875