Sleeping Agents Vision-Language Web UI (HW3) π§ Generate captions, answer questions, and search images using text