microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 14 days ago • 619k • 1.32k
Running 552 552 Talking Face Generation with Multilingual TTS 👄 Generate a talking face video from text