Error occurred when executing XlabsSampler:

#4
by FELEXSH - opened

I have a question.My GPU is 2080ti.
"
Error occurred when executing XlabsSampler:

No operator found for memory_efficient_attention_forward with inputs:
query : shape=(24, 4352, 1, 128) (torch.bfloat16)
key : shape=(24, 4352, 1, 128) (torch.bfloat16)
value : shape=(24, 4352, 1, 128) (torch.bfloat16)
attn_bias :
p : 0.0
decoderF is not supported because:
attn_bias type is
bf16 is only supported on A100+ GPUs
flshattF@v2.5.7 is not supported because:
requires device with capability > (8, 0) but your GPU has capability (7, 5) (too old)
bf16 is only supported on A100+ GPUs
cutlassF is not supported because:
bf16 is only supported on A100+ GPUs
smallkF is not supported because:
max(query.shape[-1] != value.shape[-1]) > 32
dtype=torch.bfloat16 (supported: {torch.float32})
bf16 is only supported on A100+ GPUs
unsupported embed per head: 128"

btw,the modle I loadde is dev-fp8.

Sign up or log in to comment