Error!

#63
by BK-Lee - opened
ZeroGPU Explorers org

image.png

How can solve this error?

I did @space .GPU.

ZeroGPU Explorers org

It think it just takes a while during busy times to get a GPU out of the cluster. You just need to wait a bit.

ZeroGPU Explorers org

can we not use bitsandbytes with ZeroGPU?

In addition, I would like to know how to use Flash attention with ZeroGPU!!

ZeroGPU Explorers org
  1. You can use bitsandbytes, I've used it myself.
  2. I'm not sure, haven't needed it.
ZeroGPU Explorers org

image.png

One more question! Is there any way to overcome this issue?

Only way of waiting time...

I cant understand I only debug the code with one text input: hi...

ZeroGPU Explorers org

Well, you still exceeded your quota. The quota is fixed for any kind of usage, because you still use a costly GPU when debugging.

ZeroGPU Explorers org

For first problem:
Use accelerate in requirements.txt, use @spaces.GPU(queue=False) and use default theme and UI (Yes UI causes this issue, I found it today).

For second one:
use this
@spaces.GPU(queue=False, time=30sec)
Choose a time that meets your needs. However, if a query exceeds this duration, task will be terminated.

ZeroGPU Explorers org

image.png

Actually, I am worried about its warning issue for bitsandbytes! @Artples did you happen to see?

ZeroGPU Explorers org

Yeah, that's normal for the ZeroGPU Runtime. It should still work, atleast it worked for me.

ZeroGPU Explorers org
edited May 31

I solved the issue of installing flash attention

# flash attention
import subprocess
subprocess.run('pip install flash-attn --no-build-isolation', env={'FLASH_ATTENTION_SKIP_CUDA_BUILD': "TRUE"}, shell=True)

I hope to install causal-conv1d and mamba-ssm libraries too :)

Sign up or log in to comment