vllm torch transformers auto-gptq optimum