Can this model run on the A100/sm80 architecture?

#3
by djakip - opened

Anyone tried this yet?

I successfully ran the program on 8*a100 40g. However, after installing VLLM using the code provided by the author, it is necessary to modify the “vllm/_custom_ops.py” file to enable support for kv_cache_dtype.

I successfully ran the program on 8*a100 40g. However, after installing VLLM using the code provided by the author, it is necessary to modify the “vllm/_custom_ops.py” file to enable support for kv_cache_dtype.

Thanks for your reply. Could you share your patch code? I would really appreciate it.

Sign up or log in to comment