Could you please tell how to inference this model?

#4
by carlosbdw - opened

thank you very much , I used to use vllm ,but it doesn't work with it.

Any specific ideas on how to infer with vllm?

you can't. it's tensorRT

you can't. it's tensorRT

Okay, got it.

Sign up or log in to comment