Spaces:

izumi-lab
/

llama-13b-japanese-lora-v0-1ep

Paused

masanorihirano commited on May 30, 2023

Commit

f4d4880

•

1 Parent(s): da3b30c

bug fix

Files changed (1) hide show

app.py CHANGED Viewed

@@ -43,7 +43,7 @@ def load_lora_model(
         model_path,
         load_in_8bit=load_8bit,
         device_map="auto" if device == "cuda" else {"": device},
-        max_memory=max_gpu_memory,
         torch_dtype=torch.float16,
     )
     if lora_weight is not None:

         model_path,
         load_in_8bit=load_8bit,
         device_map="auto" if device == "cuda" else {"": device},
+        max_memory={i: max_gpu_memory for i in range(num_gpus)},
         torch_dtype=torch.float16,
     )
     if lora_weight is not None: