In-browser unified multimodal understanding and generation.
Select and display chatbot code snippets
Engage in multi-modal conversations with images and videos
a tiny vision language model