microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ Updated 15 days ago β’ 622k β’ 1.32k
Running on Zero 1.22k 1.22k Joy Caption Alpha Two π Generate captions for images in various styles