opt-peter-1.3B / merges.txt
pszemraj's picture
add new checkpoint trained for a hundred steps with smaller max grad norm and weight decay
7a20e92
raw history
No virus
456 kB
File too large to display, you can check the raw version instead.