Shiyu Huang's picture

7 5 14

Shiyu Huang

ShiyuHuang

·

https://huangshiyu13.github.io/

AI & ML interests

RL, Game AI, NLP, CV

Recent Activity

commented on a paper 25 days ago

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

updated a collection about 1 month ago

video_benchmark

updated a collection about 1 month ago

video_benchmark

View all activity

Organizations

ShiyuHuang's activity

commented a paper 25 days ago

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

Paper • 2503.01935 • Published 27 days ago • 25 •

updated a collection about 1 month ago

video_benchmark

3 items • Updated about 1 month ago

upvoted a paper about 1 month ago

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published Jan 21 • 85

updated a collection about 1 month ago

Reasoning

2 items • Updated about 1 month ago

New activity in THUDM/cogvlm2-llama3-caption 2 months ago

keep mentioning "bilibili" watermark

#6 opened 4 months ago by

中文效果怎么样呢

#1 opened 6 months ago by

authored a paper 3 months ago

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

Paper • 2501.02955 • Published Jan 6 • 44

liked a dataset 3 months ago

THUDM/MotionBench

Viewer • Updated Jan 8 • 5k • 1.15k • 2

upvoted a paper 3 months ago

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

Paper • 2501.02955 • Published Jan 6 • 44

authored a paper 3 months ago

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Paper • 2412.21059 • Published Dec 30, 2024 • 19

liked a dataset 3 months ago

AIWinter/LVBench

Updated Sep 13, 2024 • 411 • 3

updated a Space 3 months ago

MotionBench Leaderboard

Submit and view model evaluations on a leaderboard

liked a model 3 months ago

THUDM/VisionReward-Video

Text Generation • Updated Jan 1 • 2.49k • 5

liked a Space 3 months ago

MotionBench Leaderboard

Submit and view model evaluations on a leaderboard

updated 3 Spaces 3 months ago

LVBench Leaderboard

Submit model evaluations to a leaderboard

MotionBench Leaderboard

Submit and view model evaluations on a leaderboard

MotionBench Leaderboard

Submit and view model evaluations on a leaderboard