DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper โข 2503.14476 โข Published 17 days ago โข 112
Running on L4 1.73k 1.73k MagicQuill ๐ชถ Edit and enhance images with custom color and edge modifications
Running 2.4k 2.4k The Ultra-Scale Playbook ๐ The ultimate guide to training LLM on large GPU Clusters
Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models Paper โข 2402.14207 โข Published Feb 22, 2024 โข 8
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper โข 2501.17161 โข Published Jan 28 โข 118
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 โข 11 items โข Updated 4 days ago โข 435
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs Paper โข 2501.06186 โข Published Jan 10 โข 65
Running on CPU Upgrade 2.07k 2.07k Anychat ๐ข Select and display code snippets for different AI providers
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models โข 11 items โข Updated Dec 6, 2024 โข 654