Audio-Text-to-Text
Transformers
Safetensors
qwen2_audio
text2text-generation
Inference Endpoints

The provided demo file does not include any inference process?

#5
by Alex-Song - opened

The provided demo file does not include any inference process and directly outputs: Man .

Xiaomi Dasheng Team org
edited 1 day ago

The pretrained (SoTA) model only outputs the final answer, refering to the "GRPO + Prompt <2>" in our technical report. If you want to see the "thinking" porcess ("GRPO + Prompt <3>"), plz visit our GitHub homepage for more details.

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment