flintlock_3B_v0.1 / README.md
ambrosfitz's picture
Update README.md
071281f verified
|
raw
history blame
963 Bytes
metadata
license: apache-2.0
datasets:
  - ambrosfitz/ps_data_v2.2

Run history:

train/epoch β–β–β–‚β–‚β–‚β–ƒβ–ƒβ–„β–„β–„β–…β–…β–…β–†β–†β–‡β–‡β–‡β–ˆβ–ˆβ–ˆ
train/global_step β–β–β–‚β–‚β–‚β–ƒβ–ƒβ–„β–„β–„β–…β–…β–…β–†β–†β–‡β–‡β–‡β–ˆβ–ˆβ–ˆ
train/grad_norm β–ˆβ–ˆβ–„β–…β–„β–…β–ƒβ–‚β–‚β–„β–‚β–‚β–‚β–‚β–β–β–β–β–β–
train/learning_rate β–‚β–‚β–ƒβ–„β–…β–…β–†β–‡β–‡β–ˆβ–‡β–‡β–†β–…β–…β–„β–ƒβ–‚β–‚β–
train/loss β–‡β–ˆβ–‡β–†β–…β–…β–„β–„β–ƒβ–ƒβ–‚β–‚β–‚β–β–‚β–β–β–β–β–
train/total_flos ▁
train/train_loss ▁
train/train_runtime ▁
train/train_samples_per_second ▁
train/train_steps_per_second ▁

Run summary:

train/epoch 2.0
train/global_step 20
train/grad_norm 0.13779
train/learning_rate 0.0
train/loss 1.1365
train/total_flos 4.579249185376512e+16
train/train_loss 1.29891
train/train_runtime 1552.5749
train/train_samples_per_second 1.649
train/train_steps_per_second 0.013