
vikhyatk/moondream2
Image-Text-to-Text
•
Updated
•
131k
•
1.06k
https://huggingface.co/papers/2501.03006
Detect and annotate poses in images and videos
Generate text with detailed prompts