
microsoft/Phi-4-multimodal-instruct-onnx
Automatic Speech Recognition
β’
Updated
β’
481
β’
39
PDF to Structured Data powered by Google DeepMind Gemini 2.0
Real-time captions with Moonshine ONNX
Interact with a multimodal AI model using text, images, and audio
Generate thoughts based on hand gestures
Magma playing video games
vggt (alpha test)