8 71 21

Jiaheng Liu

CheeryLJH

AI & ML interests

None yet

Recent Activity

authored a paper 6 days ago

IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs

upvoted a paper 6 days ago

IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs

upvoted a paper 8 days ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

View all activity

Organizations

CheeryLJH's activity

authored a paper 6 days ago

IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs

Paper • 2504.15415 • Published 7 days ago • 21

upvoted a paper 6 days ago

IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs

Paper • 2504.15415 • Published 7 days ago • 21

upvoted a paper 8 days ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 10 days ago • 114

upvoted a paper 12 days ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published 13 days ago • 59

authored a paper 14 days ago

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

Paper • 2504.10068 • Published 15 days ago • 30

upvoted a paper 14 days ago

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

Paper • 2504.10068 • Published 15 days ago • 30

authored a paper 19 days ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published 21 days ago • 44

upvoted a paper 20 days ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published 21 days ago • 44

upvoted a paper 25 days ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published 28 days ago • 269

upvoted a paper 28 days ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published 30 days ago • 131

upvoted 2 papers about 1 month ago

Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models

Paper • 2503.18923 • Published Mar 24 • 12

A Comprehensive Survey on Long Context Language Modeling

Paper • 2503.17407 • Published Mar 20 • 49

authored a paper about 2 months ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11 • 64

upvoted 2 papers about 2 months ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11 • 64

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 111

liked a dataset about 2 months ago

m-a-p/SuperGPQA

Viewer • Updated Mar 4 • 26.5k • 1.63k • 63

upvoted a paper about 2 months ago

HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models

Paper • 2502.20811 • Published Feb 28 • 2

upvoted a paper 2 months ago

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published Feb 26 • 28

commented a paper 2 months ago

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published Feb 26 • 28 •

authored a paper 2 months ago

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

Paper • 2502.16614 • Published Feb 23 • 27