appvoid 's Collections

mini-experiment

this represent an effort to discover optimal improvements on slms using curriculum learning as n+10k samples on a 250m parameters danube model