1 30 2

QRQ

RichardQRQ

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago

Qwen2.5-Omni Technical Report

upvoted a paper 21 days ago

STEVE: AStep Verification Pipeline for Computer-use Agent Training

upvoted a paper 21 days ago

TULIP: Towards Unified Language-Image Pretraining

View all activity

Organizations

None yet

RichardQRQ's activity

upvoted a paper 14 days ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published 15 days ago • 129

upvoted 2 papers 21 days ago

STEVE: AStep Verification Pipeline for Computer-use Agent Training

Paper • 2503.12532 • Published 24 days ago • 14

TULIP: Towards Unified Language-Image Pretraining

Paper • 2503.15485 • Published 21 days ago • 44

upvoted a paper 23 days ago

Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills

Paper • 2503.12533 • Published 24 days ago • 63

upvoted a paper 25 days ago

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

Paper • 2503.10291 • Published 28 days ago • 33

upvoted a paper 27 days ago

R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization

Paper • 2503.10615 • Published 27 days ago • 16

upvoted 2 papers 29 days ago

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5 • 227

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

Paper • 2503.07605 • Published about 1 month ago • 66

upvoted 2 papers about 1 month ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 105

MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

Paper • 2502.10391 • Published Feb 14 • 34

liked a dataset 3 months ago

We-Math/We-Math

Viewer • Updated Sep 6, 2024 • 1.74k • 296 • 18

upvoted 3 papers 3 months ago

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Paper • 2501.04003 • Published Jan 7 • 27

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published Jan 9 • 102

URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

Paper • 2501.04686 • Published Jan 8 • 54

liked a dataset 3 months ago

terryoo/TableVQA-Bench

Viewer • Updated Apr 25, 2024 • 1.5k • 1.45k • 22

upvoted 3 papers 3 months ago

upvoted 2 papers 4 months ago

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published Dec 23, 2024 • 48

AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling

Paper • 2412.15084 • Published Dec 19, 2024 • 13