ReTool: Reinforcement Learning for Strategic Tool Use in LLMs Paper • 2504.11536 • Published 3 days ago • 42
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning Paper • 2504.08837 • Published 8 days ago • 39
marksverdhei/whisper-norwenglish-large-frankenmerge Automatic Speech Recognition • Updated Mar 8 • 11 • 2
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 24 days ago • 112
UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning Paper • 2503.21620 • Published 22 days ago • 58
MAPS: A Multi-Agent Framework Based on Big Seven Personality and Socratic Guidance for Multimodal Scientific Problem Solving Paper • 2503.16905 • Published 28 days ago • 53
Improving Autoregressive Image Generation through Coarse-to-Fine Token Prediction Paper • 2503.16194 • Published 29 days ago • 8
mistralai/Mistral-Small-3.1-24B-Instruct-2503 Image-Text-to-Text • Updated 10 days ago • 134k • • 1.13k