CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation Paper β’ 2501.09433 β’ Published 4 days ago β’ 14
SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces Paper β’ 2501.09756 β’ Published 3 days ago β’ 16
FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors Paper β’ 2501.08225 β’ Published 5 days ago β’ 17
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos Paper β’ 2501.04001 β’ Published 12 days ago β’ 40
The GAN is dead; long live the GAN! A Modern GAN Baseline Paper β’ 2501.05441 β’ Published 10 days ago β’ 77
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper β’ 2501.04519 β’ Published 11 days ago β’ 232
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation Paper β’ 2204.12484 β’ Published Apr 26, 2022 β’ 2
TransPixar: Advancing Text-to-Video Generation with Transparency Paper β’ 2501.03006 β’ Published 13 days ago β’ 22