4 33 11

Xiaoye Qu

Xiaoye08

AI & ML interests

None yet

Recent Activity

upvoted a paper about 23 hours ago

Learning to Reason under Off-Policy Guidance

upvoted a paper 4 days ago

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

upvoted a paper 7 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

View all activity

Organizations

Xiaoye08's activity

upvoted a paper about 23 hours ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published 1 day ago • 58

upvoted a paper 4 days ago

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

Paper • 2504.13055 • Published 5 days ago • 18

upvoted a paper 7 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 8 days ago • 239

upvoted 2 papers 19 days ago

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published 20 days ago • 53

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Paper • 2504.02587 • Published 20 days ago • 30

upvoted 3 papers 22 days ago

Efficient Inference for Large Reasoning Models: A Survey

Paper • 2503.23077 • Published 25 days ago • 46

What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models

Paper • 2503.24235 • Published 22 days ago • 53

Effectively Controlling Reasoning Models through Thinking Intervention

Paper • 2503.24370 • Published 22 days ago • 19

authored a paper 23 days ago

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published 26 days ago • 39

commented a paper 23 days ago

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published 26 days ago • 39 •

upvoted a paper 23 days ago

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published 26 days ago • 39

commented a paper 23 days ago

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published 26 days ago • 39 •

upvoted a paper 30 days ago

From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration

Paper • 2503.12821 • Published Mar 17 • 9

authored a paper 30 days ago

From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration

Paper • 2503.12821 • Published Mar 17 • 9

upvoted 2 papers about 1 month ago

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

Paper • 2503.07608 • Published Mar 10 • 23

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7 • 120

authored a paper about 1 month ago

Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts

Paper • 2503.05447 • Published Mar 7 • 7

upvoted a paper about 1 month ago

Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts

Paper • 2503.05447 • Published Mar 7 • 7

upvoted 2 papers about 2 months ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published Mar 3 • 77

Liger: Linearizing Large Language Models to Gated Recurrent Structures

Paper • 2503.01496 • Published Mar 3 • 17