
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition
•
Updated
•
855k
•
1.25k
Overlay garment on person image
Upgraded to v1.0!
Scalable and Versatile 3D Generation from images
Audio Conditioned LipSync with Latent Diffusion Models
View and filter AI model releases in 2024