InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published 7 days ago • 232
AlphaGaO/DeepSeek-V3-0324-Fused-8E-39B-Unhealed-Preview Text Generation • Updated 13 days ago • 18 • 1
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Paper • 2502.06781 • Published Feb 10 • 61
Running 2.49k 2.49k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
view article Article Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel May 2, 2022 • 4
Mediator: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing Paper • 2502.04411 • Published Feb 6 • 4