Running 52 52 Qwen2.5 Omni 7B Demo ๐ Generate text and speech from input text, audio, images, or video
Qwen2.5-Omni Collection End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 โข 2 items โข Updated about 11 hours ago โข 44