HAODONG DUAN

KennyUTC

AI & ML interests

Video Understanding; Multi-Modal Learning

Articles

Organizations

KennyUTC's activity

upvoted an article 8 days ago
view article
Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

113
upvoted an article 23 days ago
view article
Article

Vision Language Models Explained

85
upvoted an article 29 days ago
upvoted an article about 1 month ago
view article
Article

Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model

9