1 8 2

Tianhao Liang

tianhao2k

AI & ML interests

None yet

Recent Activity

authored a paper 13 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

upvoted a paper 13 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

upvoted a collection about 2 months ago

MEGA-Bench

View all activity

Organizations

tianhao2k's activity

authored a paper 13 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 103

upvoted a paper 13 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 103

upvoted a collection about 2 months ago

MEGA-Bench

Collection

4 items • Updated Oct 19, 2024 • 2

upvoted a paper 2 months ago

Logical Reasoning in Large Language Models: A Survey

Paper • 2502.09100 • Published Feb 13 • 23

upvoted 2 papers 3 months ago

ACECODER: Acing Coder RL via Automated Test-Case Synthesis

Paper • 2502.01718 • Published Feb 3 • 28

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published Jan 29 • 59

upvoted a paper 5 months ago

VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation

Paper • 2412.00927 • Published Dec 1, 2024 • 28

updated a dataset 5 months ago

TIGER-Lab/MEGA-Bench

Viewer • Updated 28 days ago • 12.6k • 1.76k • 20

upvoted a paper 5 months ago

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

Paper • 2411.07199 • Published Nov 11, 2024 • 50

liked a dataset 6 months ago

TIGER-Lab/MEGA-Bench

Viewer • Updated 28 days ago • 12.6k • 1.76k • 20

New activity in TIGER-Lab/MEGA-Bench 6 months ago

Dataset Viewer issue: JobManagerCrashedError

#1 opened 6 months ago by

wenhu

authored a paper 6 months ago

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Paper • 2410.10563 • Published Oct 14, 2024 • 39

updated a collection 6 months ago

MEGA-Bench

Collection

4 items • Updated Oct 19, 2024 • 2

liked a Space 6 months ago

MEGA-Bench Leaderboard

🥇

A leaderboard for multimodal models

updated a collection 6 months ago

MEGA-Bench

Collection

4 items • Updated Oct 19, 2024 • 2