rAIfle's picture
Update README.md
649cc8c verified
|
raw
history blame
No virus
336 Bytes
metadata
library_name: transformers
tags: []

experiment_2_8b-fp16

Another experimental train w/ unsloth. This time, roughly 0.6 epochs of the cleaned c2-logs. My metaparams are probably bad, since the loss-value was super weird at the end. Also uploaded another version in the checkpoint-3500-branch that may mitigate some of that.