Spaces:

lmms-lab
/

README

Running

LLava-Next-Video-32B-Qwen giving inaccurate video analysis

by shivanis14 - opened Aug 27, 2024

Aug 27, 2024

I used this space https://huggingface.co/spaces/WildVision/vision-arena . Found about this space from github - https://github.com/LLaVA-VL/LLaVA-NeXT/tree/main

Video input to the LLM : https://www.youtube.com/watch?v=51gdmOKs4Ek

Prompt : Is the elderly person in the video safe and comfortable?
Response by LLava-Next-Video-32B-Qwen: Yes, the elderly person appears to be safe and comfortable throughout the video.

Correct Response should have been : Elderly is being physically abused in the video

At what rate are frames extracted in this demo? I suspect it is low causing inaccurate response

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment