Armin Banfalvi
abanfalvi
·
AI & ML interests
None yet
Recent Activity
updated a collection 2 days ago
Model Technical Reports updated a collection 3 days ago
Surveys updated a collection 3 days ago
SurveysOrganizations
Surveys
-
A Survey of On-Policy Distillation for Large Language Models
Paper • 2604.00626 • Published • 13 -
World Action Models: A Survey
Paper • 2606.20781 • Published • 56 -
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
Paper • 2505.04921 • Published • 187 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 239
Model Technical Reports
Distillation
Surveys
-
A Survey of On-Policy Distillation for Large Language Models
Paper • 2604.00626 • Published • 13 -
World Action Models: A Survey
Paper • 2606.20781 • Published • 56 -
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
Paper • 2505.04921 • Published • 187 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 239
VLM Reasoning
Model Technical Reports