Running on Zero 30 30 Yoloe 🚀 Identify and segment objects in images using text, visual, or prompt-free prompts
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 1 day ago • 441k • 1.12k