my i use deepspeed to accelerate the model infer process?

by 520jefferson - opened Nov 6, 2023

Nov 6, 2023

when i use the model.chat(), it's too slow to tolerate. so is there any ways to accelerate?

Knowledge Engineering Group (KEG) & Data Mining at Tsinghua University org Jan 4, 2024

in our github,try to use trt or other accelerate way

zRzRzRzRzRzRzR changed discussion status to closed Jan 4, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment