Model behind?

#2
by thewise - opened

I loved how its able to recognize the video stream. I assume if it can do the similar on recorded videos?
What is the model behind? I tried something similar with GPT4V, but the API is not to work on Humans.

@thewise
hi!

the model is moondream1 :)

Sign up or log in to comment