Moonlight-A3B Collection Moonshot's Compute-efficient MoE LLM, first Scaling Up of Muon Optimizer • 3 items • Updated Jan 27 • 14
💧 LFM2.5 Collection Collection of post-trained and base LFM2.5 models. • 35 items • Updated 6 days ago • 152
Devstral 2 Collection A couple of agentic LLMs for software engineering tasks, excelling at using tools to explore codebases, edit multiple files, and power SWE Agents. • 2 items • Updated Mar 2 • 56
Mistral Small 4 Collection A state-of-the-art model, open-weight, with a granular Mixture-of-Experts architecture that fuses instruct, reasoning and agentic skills. • 3 items • Updated Mar 16 • 74
Ouro Collection a family of pre-trained Looped Language Models. • 4 items • Updated Oct 29, 2025 • 32