Jiaheng Liu's picture

Jiaheng Liu

CheeryLJH

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

authored a paper 4 days ago

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

upvoted a paper 4 days ago

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

View all activity

Organizations

CheeryLJH's activity

upvoted a paper 1 day ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published 3 days ago • 43

upvoted a paper 4 days ago

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

Paper • 2504.10068 • Published 5 days ago • 28

upvoted a paper 10 days ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published 11 days ago • 43

upvoted a paper 15 days ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published 18 days ago • 242

upvoted a paper 18 days ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published 20 days ago • 124

upvoted 2 papers 25 days ago

Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models

Paper • 2503.18923 • Published 25 days ago • 12

A Comprehensive Survey on Long Context Language Modeling

Paper • 2503.17407 • Published 29 days ago • 49

upvoted 2 papers about 1 month ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11 • 62

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 108

upvoted 5 papers about 2 months ago

HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models

Paper • 2502.20811 • Published Feb 28 • 2

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published Feb 26 • 28

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

Paper • 2502.16614 • Published Feb 23 • 26

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published Feb 18 • 16

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 103

upvoted 3 papers 2 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 225

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published Feb 5 • 59

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 60

upvoted 3 papers 3 months ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published Jan 29 • 59

Improving Video Generation with Human Feedback

Paper • 2501.13918 • Published Jan 23 • 50

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Paper • 2501.12895 • Published Jan 22 • 60