view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM Mar 12 • 395
view post Post 4662 Qwen 3 can launch very soon. 👀https://github.com/ggml-org/llama.cpp/pull/12828 See translation 3 replies · 🔥 16 16 👀 9 9 ❤️ 8 8 + Reply
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 27 days ago • 113
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published Mar 18 • 120
Slamming: Training a Speech Language Model on One GPU in a Day Paper • 2502.15814 • Published Feb 19 • 69
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 173
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning Paper • 2402.06619 • Published Feb 9, 2024 • 57