flash attention

#21
by Disassemblern - opened

Is there any way to use this model for vector embedding without requiring flash attention library. Because my gpu vm is not compatible with flash attention.

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment