unsup-wc-s64-bs128-lr6 / train_results.json
Zhuoxu Huang
init
dedc425
{
"epoch": 1.0,
"train_loss": 8.470750703670477,
"train_runtime": 4458.002,
"train_samples": 974010,
"train_samples_per_second": 218.214,
"train_steps_per_second": 1.705
}