397 115 266

Prince Canuma PRO

prince-canuma

AI & ML interests

None yet

Recent Activity

updated a model about 6 hours ago

mlx-community/Kimi-VL-A3B-Thinking-8bit

updated a collection about 7 hours ago

Kimi-VL Thinking

updated a collection about 7 hours ago

Kimi-VL Thinking

View all activity

Organizations

prince-canuma's activity

upvoted 3 papers 9 days ago

DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models

Paper • 2504.02882 • Published 16 days ago • 6

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 10 days ago • 160

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published 10 days ago • 93

upvoted 2 papers 13 days ago

ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers

Paper • 2504.00502 • Published 17 days ago • 21

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published 14 days ago • 52

upvoted 2 papers 14 days ago

Improved Visual-Spatial Reasoning via R1-Zero-Like Training

Paper • 2504.00883 • Published 16 days ago • 60

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Paper • 2504.02587 • Published 14 days ago • 30

upvoted a collection 14 days ago

ModernBert

Collection

16 items • Updated 14 days ago • 2

upvoted 8 papers 15 days ago

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

Paper • 2503.24376 • Published 17 days ago • 37

When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning

Paper • 2504.01005 • Published 16 days ago • 15

Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents

Paper • 2504.00906 • Published 16 days ago • 20

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Paper • 2504.00595 • Published 17 days ago • 34

upvoted a paper 16 days ago

Multi-Token Attention

Paper • 2504.00927 • Published 16 days ago • 43

upvoted 3 papers 17 days ago

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published 17 days ago • 61

OThink-MR1: Stimulating multimodal generalized reasoning capabilities via dynamic reinforcement learning

Paper • 2503.16081 • Published 28 days ago • 26

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published 21 days ago • 39