Vision-R1: Evolving Human-Free Alignment in Large Vision-Language Models via Vision-Guided Reinforcement Learning Paper • 2503.18013 • Published 11 days ago • 18
Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation Paper • 2503.14905 • Published 15 days ago • 19
Long-Context Autoregressive Video Modeling with Next-Frame Prediction Paper • 2503.19325 • Published 9 days ago • 70
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models Paper • 2503.16419 • Published 14 days ago • 65
Unleashing Vecset Diffusion Model for Fast Shape Generation Paper • 2503.16302 • Published 14 days ago • 42
LEGION: Learning to Ground and Explain for Synthetic Image Detection Paper • 2503.15264 • Published 15 days ago • 19
LEGION: Learning to Ground and Explain for Synthetic Image Detection Paper • 2503.15264 • Published 15 days ago • 19
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception Paper • 2410.12628 • Published Oct 16, 2024 • 36
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models Paper • 2410.09732 • Published Oct 13, 2024 • 55 • 4
Toward General Instruction-Following Alignment for Retrieval-Augmented Generation Paper • 2410.09584 • Published Oct 12, 2024 • 48
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models Paper • 2410.09732 • Published Oct 13, 2024 • 55
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models Paper • 2410.09732 • Published Oct 13, 2024 • 55