gpu/cpu leakage
#7
by
felfri
- opened
there seems to be a computation/memory leakage when running the model. a) casting pipe.to('cuda:2') casts only part of the model to the device and b) when running torch.compile(pipe.transformer) I get "skipping cudagraphs due to skipping cudagraphs due to cpu device."