Xiong Wang's picture

4 1 2

Xiong Wang

xiongwang

·

wangxiongts

AI & ML interests

Speech，LLM

Recent Activity

new activity 9 days ago

Qwen/Qwen2.5-Omni-7B:The PR referenced in documentation no longer exists

new activity 9 days ago

Qwen/Qwen2.5-Omni-7B:What is the prompt for ASR

updated a model 9 days ago

Qwen/Qwen2.5-Omni-7B

View all activity

Organizations

xiongwang's activity

New activity in Qwen/Qwen2.5-Omni-7B 9 days ago

The PR referenced in documentation no longer exists

#37 opened 14 days ago by

What is the prompt for ASR

#39 opened 12 days ago by

updated a model 9 days ago

Qwen/Qwen2.5-Omni-7B

Any-to-Any • Updated 9 days ago • 202k • 1.47k

New activity in Qwen/Qwen2.5-Omni-7B 26 days ago

Executing Qwen2.5-Omni-7B on SGLang: AttributeError: 'Qwen2_5OmniConfig' object has no attribute 'hidden_size'

#21 opened 26 days ago by

New activity in Qwen/Qwen2.5-Omni-7B 27 days ago

Once again the Chinese show who is for good (open source) and who is for evil (grok, open AI (lol open)).

#11 opened 27 days ago by

updated a collection 27 days ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 3 items • Updated 27 days ago • 90

updated a collection 28 days ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 3 items • Updated 27 days ago • 90

liked a model 28 days ago

Qwen/Qwen2.5-Omni-7B

Any-to-Any • Updated 9 days ago • 202k • 1.47k

upvoted a collection 28 days ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 3 items • Updated 27 days ago • 90

published a Space 28 days ago

Qwen2.5 Omni 7B Demo

Generate text and speech responses from text, images, or audio input

updated a collection 28 days ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 3 items • Updated 27 days ago • 90

authored a paper 4 months ago

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Paper • 2501.01957 • Published Jan 3 • 46

liked a model 5 months ago

VITA-MLLM/Freeze-Omni

Updated Nov 26, 2024 • 16

updated 3 models 5 months ago

VITA-MLLM/Freeze-Omni

Updated Nov 26, 2024 • 16

VITA-MLLM/Freeze-Omni

Updated Nov 26, 2024 • 16

VITA-MLLM/Freeze-Omni

Updated Nov 26, 2024 • 16