Submitted by zhoutianyi 34 ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness · 10 authors 4
Submitted by JunhaoZhuang 18 Cobra: Efficient Line Art COlorization with BRoAder References · 6 authors 2
Submitted by YangshenDeng 17 AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference · 16 authors 2
Submitted by panprabh 13 SIFT-50M: A Large-Scale Multilingual Dataset for Speech Instruction Fine-Tuning · 7 authors 2
Submitted by nielsr 11 REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers · 6 authors 2
Submitted by g-h-chen 10 SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models · 8 authors 2
Submitted by JaceyH919 5 Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting · 5 authors 2
Submitted by CatWorldLee 2 Syzygy of Thoughts: Improving LLM CoT with the Minimal Free Resolution · 10 authors 2
Submitted by SunshineWu 2 BlockGaussian: Efficient Large-Scale Scene Novel View Synthesis via Adaptive Block-Based Gaussian Splatting · 4 authors 2
Submitted by nthakur 1 FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents · 6 authors 1
Submitted by yunx-z 1 MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges? · 9 authors 2
Submitted by evijit 1 "It's not a representation of me": Examining Accent Bias and Digital Exclusion in Synthetic AI Voice Services · 6 authors 2