Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published 18 days ago • 242
Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing Paper • 2504.02826 • Published 15 days ago • 67
GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation Paper • 2504.02782 • Published 15 days ago • 55
Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme Paper • 2504.02587 • Published 15 days ago • 30
SkyReels-A2: Compose Anything in Video Diffusion Transformers Paper • 2504.02436 • Published 15 days ago • 35
Scaling Analysis of Interleaved Speech-Text Language Models Paper • 2504.02398 • Published 15 days ago • 27
ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers Paper • 2504.00502 • Published 17 days ago • 21
ZClip: Adaptive Spike Mitigation for LLM Pre-Training Paper • 2504.02507 • Published 15 days ago • 76
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation Paper • 2504.02542 • Published 15 days ago • 41
Instruction-Guided Autoregressive Neural Network Parameter Generation Paper • 2504.02012 • Published 17 days ago • 6
Efficient Model Selection for Time Series Forecasting via LLMs Paper • 2504.02119 • Published 16 days ago • 16
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning Paper • 2504.00891 • Published 17 days ago • 12
Scaling Laws in Scientific Discovery with AI and Robot Scientists Paper • 2503.22444 • Published 21 days ago • 12
Interpreting Emergent Planning in Model-Free Reinforcement Learning Paper • 2504.01871 • Published 16 days ago • 11
Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models Paper • 2504.02821 • Published 15 days ago • 10
Inference-Time Scaling for Generalist Reward Modeling Paper • 2504.02495 • Published 15 days ago • 52