Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 8 items • Updated 7 days ago • 108
RLVR Collection Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains' • 3 items • Updated 10 days ago • 10
RWKV-7 "Goose" with Expressive Dynamic State Evolution Paper • 2503.14456 • Published 22 days ago • 137
Hamanasu Collection A brand new series of Models from yours truly, Designed for Intelligence, Creativity and Roleplay - R/Locallama keeps DELETING MY GODDAMN COMMENTS • 31 items • Updated 3 days ago • 8
Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond Paper • 2503.10460 • Published 27 days ago • 27
DeepHermes Collection Preview models of hybrid reasoner Hermes series • 6 items • Updated 27 days ago • 27
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 29 days ago • 380
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published Mar 7 • 76
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Feb 20 • 50