RuntimeError: FlashAttention only supports Ampere GPUs or newer.

#11
by pravinkarpe - opened

Got the below error when running the model in the colab notebook.
RuntimeError: FlashAttention only supports Ampere GPUs or newer.

Microsoft org
nguyenbh changed discussion status to closed
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment