Qwen2.5-Coder-3B-Instruct-sft / train_results.json
edbeeching's picture
edbeeching HF staff
Model save
17b4164 verified
{
"total_flos": 38010460569600.0,
"train_loss": 0.0,
"train_runtime": 2.1351,
"train_samples": 1000,
"train_samples_per_second": 2851.346,
"train_steps_per_second": 44.962
}