Qwen
/

Qwen1.5-32B-Chat-AWQ

推理速度比14B-AWQ慢很多,是否正常

by william0014 - opened Apr 11, 2024

Apr 11, 2024

同样内容的回复, 14B-AWQ 为 6秒, 32B-AWQ为20秒.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment