how to load the model with multiple GPUs

by Sven00 - opened Jul 4, 2023

Jul 4, 2023

•

edited Jul 4, 2023

I have not found a guidance on how to load the model and run inference with multiple GPUs. The instructions provided by mosaicML covers only a single GPU. Thank you

billejoe

Jul 5, 2023

having the same issues. You can load the model by setting device_map = "auto", which distributes the memory across GPUs (does not speed up) but still having issues with inference

Sven00

Jul 5, 2023

@abhi-mosaic maybe you can help us out here?

louisY

Jul 11, 2023

•

edited Jul 11, 2023

pho410

Jul 17, 2023

Having the same issues with inference - model loads fine on multiple GPUs, but inference is very very slow. Any updates?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment