DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs Paper • 2503.07067 • Published 27 days ago • 30
EXAONE-Deep Collection EXAONE reasoning model series of 2.4B, 7.8B, and 32B, optimized for reasoning tasks including math and coding • 9 items • Updated 19 days ago • 85
XiYanSQL Models Collection The XiYanSQL series are foundational SQL models available in various sizes, including 3B, 7B, 14B, and 32B. • 4 items • Updated 27 days ago • 6
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning Paper • 2502.14768 • Published Feb 20 • 47
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 20 items • Updated 5 days ago • 122
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications Paper • 2409.07314 • Published Sep 11, 2024 • 56
Bridging Language Barriers in Healthcare: A Study on Arabic LLMs Paper • 2501.09825 • Published Jan 16 • 14
Stronger Models are NOT Stronger Teachers for Instruction Tuning Paper • 2411.07133 • Published Nov 11, 2024 • 38
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28, 2024 • 301
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Feb 26 • 580