thank you very much , I used to use vllm ,but it doesn't work with it.
yes please
Any specific ideas on how to infer with vllm?
you can't. it's tensorRT
Okay, got it.
Β· Sign up or log in to comment