collapse_gemma-2-2b_hs2_accumulatesubsample_iter14_sftsd2

This model is a fine-tuned version of google/gemma-2-2b on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Input Tokens Seen
No log	0	0	1.3909	0
1.4664	0.0531	5	1.2779	265768
1.0297	0.1062	10	1.2239	526776
0.9672	0.1594	15	1.2051	794288
0.9285	0.2125	20	1.2391	1063824
0.7632	0.2656	25	1.2306	1332408
0.7406	0.3187	30	1.2478	1595464
0.6883	0.3718	35	1.2507	1871024
0.5929	0.4250	40	1.2429	2133560
0.4589	0.4781	45	1.2391	2394480
0.6095	0.5312	50	1.2221	2663544
0.5181	0.5843	55	1.2246	2930064
0.4917	0.6375	60	1.2135	3199536
0.5105	0.6906	65	1.2249	3465264
0.4253	0.7437	70	1.2138	3727952
0.4506	0.7968	75	1.2148	3991304
0.4301	0.8499	80	1.2095	4255664
0.432	0.9031	85	1.2015	4523456
0.3698	0.9562	90	1.2208	4781552