modelscope transformers_stream_generator auto-gptq optimum urllib3<2