Running 2.53k 2.53k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated 28 days ago • 450
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published Jan 8 • 92
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated Mar 13 • 68
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 16 items • Updated 25 days ago • 146