Microsoft Phi-3-Vision-128k
Generate image descriptions
Generate image descriptions
Chat with documents: PDFs, web pages, CSV/Excel
Rerun viewer with Gradio
A private and powerful multimodal AI chatbot that runs local
Dense Grounded Understanding of Images and Videos
A Visual Question Answering using BLIP model.
Create and run visual BASIC programs easily
Duplicate this leaderboard to initialize your own!
Magma-8B model for UI Agents
image captioning, VQA
Generate animated Voronoi patterns as cloth
Display real-time AI analysis and dynamic graphs
Analyze video frames to tag objects
Display weather data from multiple sources
Interact with images and text using Visual ChatGPT
Follow visual instructions in Chinese
Visualize insurance claims workflow
Select a cell type to generate a gene expression plot
Compare different visual question answering
Visualize mid and low-level features in models
Explore data and identity distortion in generative AI
demo of batch processing with moondream
Ask questions about images and get answers
Try PaliGemma on document understanding tasks