arxiv:2504.15376
Zhiqiu Lin PRO
zhiqiulin
AI & ML interests
None yet
Recent Activity
posted an update 8 days ago
๐ VQAScore now supports text-to-video evaluation!
VQAScore scores how well a generated image or video matches a prompt by asking a VLM "does this show {prompt}?" and using P(Yes). It became a go-to evaluation metric and reward model for image generation (2M+ downloads), and we just added text-to-video support across 20+ VLMs (GPT, Gemini, Qwen). Free and open-source, and it keeps improving as VLMs improve.
๐ป Code: https://github.com/linzhiqiu/t2v_metrics
๐ Paper: https://arxiv.org/abs/2404.01291
๐งต Launch thread + demo video: https://x.com/ZhiqiuLin/status/2064316582461841499 updated a dataset 13 days ago
zhiqiulin/caption_export liked a dataset about 2 months ago
chancharikm/CHAI_testset