VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing Paper • 2502.17258 • Published 28 days ago • 76
CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models Paper • 2502.16614 • Published 29 days ago • 26
Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation Paper • 2502.16707 • Published 29 days ago • 13
Grounded Persuasive Language Generation for Automated Marketing Paper • 2502.16810 • Published 29 days ago • 11
Benchmarking Temporal Reasoning and Alignment Across Chinese Dynasties Paper • 2502.16922 • Published 29 days ago • 8
Investigating the Impact of Quantization Methods on the Safety and Reliability of Large Language Models Paper • 2502.15799 • Published Feb 18 • 7
Pandora3D: A Comprehensive Framework for High-Quality 3D Shape and Texture Generation Paper • 2502.14247 • Published Feb 20 • 6
Early-Exit and Instant Confidence Translation Quality Estimation Paper • 2502.14429 • Published Feb 20 • 4
Mind the Gap! Static and Interactive Evaluations of Large Audio Models Paper • 2502.15919 • Published about 1 month ago • 4
MutaGReP: Execution-Free Repository-Grounded Plan Search for Code-Use Paper • 2502.15872 • Published Feb 21 • 5
Diagnosing COVID-19 Severity from Chest X-Ray Images Using ViT and CNN Architectures Paper • 2502.16622 • Published 29 days ago • 2
M3-AGIQA: Multimodal, Multi-Round, Multi-Aspect AI-Generated Image Quality Assessment Paper • 2502.15167 • Published Feb 21 • 2
Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization Paper • 2502.19261 • Published 26 days ago • 7
Adapting Automatic Speech Recognition for Accented Air Traffic Control Communications Paper • 2502.20311 • Published 25 days ago • 6