atayloraerospace PRO

Taylor658

AI & ML interests

Computer Vision ๐Ÿ”ญ | Multimodal Gen AI ๐Ÿค–| AI in Healthcare ๐Ÿฉบ | AI in Aerospace ๐Ÿš€

Organizations

Posts 17

view post
Post
698
Researchers from Auburn University and the University of Alberta have explored the limitations of Vision Language Models (VLMs) in their recently published paper titled "Vision language models are blind." ( Vision language models are blind (2407.06581))

Key Findings:๐Ÿ”
VLMs, including GPT-4o, Gemini-1.5 Pro, Claude-3 Sonnet, and Claude-3.5 Sonnet, struggle with basic visual tasks.
Tasks such as identifying where lines intersect or counting basic shapes are challenging for these models.
The authors noted, "The shockingly poor performance of four state-of-the-art VLMs suggests their vision is, at best, like of a person with myopia seeing fine details as blurry, and at worst, like an intelligent person that is blind making educated guesses"โ€‹(Vision Language Models Are Blind; 2024)โ€‹.

Human-like Myopia? ๐Ÿ‘“
VLMs may have a blind spot similar to human myopia.
This limitation makes it difficult for VLMs to perceive details.
Suggests a potential parallel between human and machine vision limitations.

Technical Details: ๐Ÿ”ง
The researchers created a new benchmark called BlindTest.
BlindTest consists of simple visual tasks to evaluate VLMs low-level vision capabilities.
Four VLMs were assessed using BlindTest.
Many shortcomings were revealed in the models ability to process basic visual information.

Learn More: ๐Ÿ–ผ๏ธ
For a deeper dive into this research, check out the project page: https://vlmsareblind.github.io/
view post
Post
636
๐ŸŒ Cohere for AI has announced that this July and August, it is inviting researchers from around the world to join Expedition Aya, a global initiative focused on launching projects using multilingual tools like Aya 23 and Aya 101. ๐ŸŒ

Participants can start by joining the Aya server, where all organization will take place. They can share ideas and connect with others on Discord and the signup sheet. Various events will be hosted to help people find potential team members. ๐Ÿค

To support the projects, Cohere API credits will be issued. ๐Ÿ’ฐ

Over the course of six weeks, weekly check-in calls are also planned to help teams stay on track and receive support with using Aya. ๐Ÿ–ฅ๏ธ

The expedition will wrap up at the end of August with a closing event to showcase everyoneโ€™s work and plan next steps. Participants who complete the expedition will also receive some Expedition Aya swag. ๐ŸŽ‰

Links:
Join the Aya Discord: https://discord.com/invite/q9QRYkjpwk
Visit the Expedition Aya Minisite: https://sites.google.com/cohere.com/expedition-aya/home