DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation Paper โข 2501.16764 โข Published 3 days ago โข 14
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper โข 2501.17161 โข Published 2 days ago โข 48
Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration Paper โข 2406.18516 โข Published Jun 26, 2024 โข 2
GeoPixel: Pixel Grounding Large Multimodal Model in Remote Sensing Paper โข 2501.13925 โข Published 7 days ago โข 5
Running on Zero 1.03k ๐ Chat With Janus-Pro-7B A unified multimodal understanding and generation model.
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper โข 2501.12948 โข Published 9 days ago โข 274
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Paper โข 2501.09732 โข Published 14 days ago โข 66
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks Paper โข 2501.08326 โข Published 16 days ago โข 31
Diffusion Adversarial Post-Training for One-Step Video Generation Paper โข 2501.08316 โข Published 16 days ago โข 32
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper โข 2501.08313 โข Published 16 days ago โข 271
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper โข 2501.07301 โข Published 18 days ago โข 89
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs Paper โข 2501.06186 โข Published 20 days ago โข 59