[Bug] I have a error

#75
by Arigadam - opened

I have a error:

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 76.00 MiB. GPU 0 has a total capacty of 8.00 GiB of which 0 bytes is free. Of the allocated memory 7.22 GiB is allocated by PyTorch, and 43.56 MiB is reserved by PyTorch but unallocated. If reserved but unallocated memory is large try setting max_split_size_mb to avoid fragmentation.  See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

How to fix?

pass load_in_4bit=True to lorad_pretrained

Microsoft org

Hello @Arigadam ! Please use any quantization approach or try lowering the micro batch-size.

gugarosa changed discussion status to closed

Sign up or log in to comment