5 2 22

Zhaoyang Liu

zyliu

liu-zhy

AI & ML interests

Video understanding, 3D Perception, Autonomous driving, Foundation models, AIGC

Recent Activity

new activity 9 days ago

bytedance-research/UI-TARS:Error when using demo

updated a model 22 days ago

zyliu/qwen2vl_test1

published a model 22 days ago

zyliu/qwen2vl_test1

View all activity

Organizations

zyliu's activity

New activity in bytedance-research/UI-TARS 9 days ago

Error when using demo

#2 opened 9 days ago by

zyliu

updated a model 22 days ago

zyliu/qwen2vl_test1

Updated 22 days ago • 5

published a model 22 days ago

zyliu/qwen2vl_test1

Updated 22 days ago • 5

authored 5 papers 4 months ago

InternChat: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language

Paper • 2305.05662 • Published May 9, 2023 • 4

Learning Human Motion Representations: A Unified Perspective

Paper • 2210.06551 • Published Oct 12, 2022

VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks

Paper • 2406.08394 • Published Jun 12, 2024

MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions

Paper • 2407.20962 • Published Jul 30, 2024

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 150

upvoted a paper 4 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 150

updated a model 4 months ago

zyliu/tmp_model11

Updated Dec 15, 2024 • 8

liked 8 models 4 months ago

updated a model 4 months ago

zyliu/tmp_model10

Updated Nov 25, 2024 • 6

updated a model 5 months ago

zyliu/tmp_model9

Updated Oct 28, 2024 • 4