Prabhāsa-b_ss 0.2 — BabyLM-2026 Strict-Small (10M)

ELC encoder, pure-MLM + RoPE + N-hot + structured masking, Muon optimizer (scale-dependent: Muon wins at 10M).

Official BabyLM-2026 scorer: BLiMP 62.74 (supplement, EWoK, COMPS in repo).

  • +3.3 pp over v0.1 (qbz506/prabhasa-b_ss-0.1, 59.46); −2.3 pp from the GPT-2 baseline (65.08). BabyLM-compliant (≤10 epochs, 10M budget).
Downloads last month
20
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support