amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-bf16-onnx-ryzen-strix
Text Generation
•
Updated
•
36
ONNX Runtime generate() API based models quantized by Quark and optimized for Ryzen AI Strix Point NPU