m0pper's picture
Update README.md
102e632
|
raw
history blame
No virus
148 Bytes
metadata
license: afl-3.0

A wider Baby Berta Model trained using curriculum learning and layer stacking for the BabyLM Challenge Strict Small track.