--- license: apache-2.0 pipeline_tag: visual-question-answering --- This repository contains the model of the paper [ViSpeak: Visual Instruction Feedback in Streaming Videos](https://arxiv.org/abs/2503.12769). Code: https://github.com/HumanMLLM/ViSpeak