mini-experiment Collection this represent an effort to discover optimal improvements on slms using curriculum learning as n+10k samples on a 250m parameters danube model • 6 items • Updated 12 days ago