Search-R1 Collection Preliminary checkpoints with outcome-only RL. • 14 items • Updated 17 days ago • 4
mistralai/Mistral-Small-3.1-24B-Instruct-2503 Image-Text-to-Text • Updated 16 days ago • 98.4k • • 1.15k
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper • 2502.11089 • Published Feb 16 • 155