
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition
β’
Updated
β’
713k
β’
1.3k
Overlay garment on person image
Upgraded to v1.0!
Scalable and Versatile 3D Generation from images
Audio Conditioned LipSync with Latent Diffusion Models
View and filter AI model releases in 2024