danish-legal-lm-base / train_results.json
kiddothe2b
500k training steps with 128 tokens
5443746
{
"epoch": 53.58,
"train_loss": 1.193239736328125,
"train_runtime": 102111.9968,
"train_samples_per_second": 1253.526,
"train_steps_per_second": 4.897
}