Running 2.51k 2.51k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality Mar 4 • 73
Efficient Gaussian Splatting for Monocular Dynamic Scene Rendering via Sparse Time-Variant Attribute Modeling Paper • 2502.20378 • Published Feb 27 • 4
Training Consistency Models with Variational Noise Coupling Paper • 2502.18197 • Published Feb 25 • 7
Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation Paper • 2502.20388 • Published Feb 27 • 16
R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning Paper • 2502.19735 • Published Feb 27 • 9
Building Interactable Replicas of Complex Articulated Objects via Gaussian Splatting Paper • 2502.19459 • Published Feb 26 • 11
SoRFT: Issue Resolving with Subtask-oriented Reinforced Fine-Tuning Paper • 2502.20127 • Published Feb 27 • 9
Mobius: Text to Seamless Looping Video Generation via Latent Shift Paper • 2502.20307 • Published Feb 27 • 19
Guardians of the Agentic System: Preventing Many Shots Jailbreak with Agentic System Paper • 2502.16750 • Published Feb 23 • 10
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think Paper • 2502.20172 • Published Feb 27 • 28
FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute Paper • 2502.20126 • Published Feb 27 • 20
Lean and Mean: Decoupled Value Policy Optimization with Global Value Guidance Paper • 2502.16944 • Published Feb 24 • 10
UniTok: A Unified Tokenizer for Visual Generation and Understanding Paper • 2502.20321 • Published Feb 27 • 30
CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale Paper • 2502.16645 • Published Feb 23 • 22
FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving Paper • 2502.20238 • Published Feb 27 • 24