Kevin

kvnptl

AI & ML interests

Robot perception

Recent Activity

Organizations

None yet

kvnptl's activity

upvoted an article 3 days ago
view article
Article

SmolVLM - small yet mighty Vision Language Model

220
reacted to maxiw's post with 👍 13 days ago
view post
Post
3111
The new Qwen-2 VL models seem to perform quite well in object detection. You can prompt them to respond with bounding boxes in a reference frame of 1k x 1k pixels and scale those boxes to the original image size.

You can try it out with my space maxiw/Qwen2-VL-Detection

·
upvoted an article about 1 month ago
view article
Article

Welcome PaliGemma 2 – New vision language models by Google

147