VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks Paper • 2504.05118 • Published 5 days ago • 22
T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models Paper • 2504.04718 • Published 6 days ago • 36
Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation Paper • 2504.03193 • Published 9 days ago • 5
DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models Paper • 2504.02882 • Published 11 days ago • 6
Why Reasoning Matters? A Survey of Advancements in Multimodal Reasoning (v1) Paper • 2504.03151 • Published 9 days ago • 11
Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs Paper • 2504.04715 • Published 6 days ago • 11
Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models Paper • 2504.04823 • Published 6 days ago • 27
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 5 days ago • 152
Slow-Fast Architecture for Video Multi-Modal Large Language Models Paper • 2504.01328 • Published 11 days ago • 7
MedSAM2: Segment Anything in 3D Medical Images and Videos Paper • 2504.03600 • Published 8 days ago • 8
TransMamba: Flexibly Switching between Transformer and Mamba Paper • 2503.24067 • Published 12 days ago • 15
VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning Paper • 2504.02949 • Published 9 days ago • 18
SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement Paper • 2504.03561 • Published 8 days ago • 16
APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay Paper • 2504.03601 • Published 8 days ago • 14
JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization Paper • 2503.23377 • Published 13 days ago • 49
meta-llama/Llama-4-Maverick-17B-128E-Instruct Image-Text-to-Text • Updated 3 days ago • 26.8k • • 285