electra-case-16

This model is a fine-tuned version of amr8ta/electra-case-16 on the None dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Accuracy
No log	1.0	44	0.3311	0.8733
No log	2.0	88	0.3448	0.8667
No log	3.0	132	0.3250	0.9
No log	4.0	176	0.3456	0.9
No log	5.0	220	0.3695	0.9067
No log	6.0	264	0.4012	0.8933
No log	7.0	308	0.3983	0.8933
No log	8.0	352	0.4132	0.8933
No log	9.0	396	0.4336	0.9067
No log	10.0	440	0.4263	0.8933
No log	11.0	484	0.4124	0.9
0.0318	12.0	528	0.4278	0.8933
0.0318	13.0	572	0.4593	0.8933
0.0318	14.0	616	0.4477	0.8933
0.0318	15.0	660	0.4505	0.8933
0.0318	16.0	704	0.4525	0.8933