Advanced BLIP2
image captioning, VQA
image captioning, VQA
Generate animated Voronoi patterns as cloth
Analyze video frames to tag objects
Follow visual instructions in Chinese
Select a cell type to generate a gene expression plot
Compare different visual question answering
Browse and compare language model leaderboards
Try PaliGemma on document understanding tasks
Create visual diagrams and flowcharts easily
Generate answers to questions about images
Media understanding
Generate insights from charts using text prompts
Transcribe manga chapters with character names
Generate answers using images or videos
Select and visualize language family trees
Visualize AI network mapping: users and organizations
Visualize 3D dynamics with Gaussian Splats
Ask questions about images
Generate Dynamic Visual Patterns
Ask questions about images to get detailed answers
Ask questions about images to get answers
Ask questions about images and get detailed answers
Image captioning, image-text matching and visual Q&A.
PaliGemma2 LoRA finetuned on VQAv2