Evenly distribute the model across GPUs?

#32

by Shiba - opened Oct 26, 2023

Oct 26, 2023

I am using 4 A100(40GRAM version) for inferencing. However, some GPUs may encounter out of CUDA memory issue after 3 or 4 times of generation. Do you have any suggestions to fix this?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment