Billy C's picture
4

Billy C

bmxtiger
ยท

AI & ML interests

None yet

Recent Activity

Organizations

None yet

bmxtiger's activity

reacted to merve's post with ๐Ÿ‘ about 1 year ago
view post
Post
I love vision language models ๐Ÿ’—
My favorite is KOSMOS-2, because it's a grounded model (it doesn't hallucinate).
In this demo you can,
- ask a question about the image,
- do detailed/brief captioning,
- localize the objects! ๐Ÿคฏ
It's just amazing for VLM to return bounding boxes ๐Ÿคฉ
Try it here merve/kosmos2