
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition
โข
Updated
โข
800k
โข
1.3k
Overlay garment on person image
Upgraded to v1.0!
Scalable and Versatile 3D Generation from images
Audio Conditioned LipSync with Latent Diffusion Models
View and filter AI model releases in 2024