Yizhi Song's picture

3 2

Yizhi Song

song630

·

AI & ML interests

GenAI

Recent Activity

upvoted a paper about 3 hours ago

Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting

upvoted a paper about 1 month ago

Token-Efficient Long Video Understanding for Multimodal LLMs

liked a dataset 3 months ago

BaiqiL/GenAI-Bench-1600

View all activity

Organizations

None yet

song630's activity

upvoted a paper about 3 hours ago

Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting

Paper • 2504.05541 • Published 3 days ago • 9

upvoted a paper about 1 month ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published Mar 6 • 92

upvoted a paper 3 months ago

Generative AI for Cel-Animation: A Survey

Paper • 2501.06250 • Published Jan 8 • 13