flash attention
#21
by
Disassemblern
- opened
Is there any way to use this model for vector embedding without requiring flash attention library. Because my gpu vm is not compatible with flash attention.
Is there any way to use this model for vector embedding without requiring flash attention library. Because my gpu vm is not compatible with flash attention.