Using flash attention option

by lentan - opened May 6, 2023

May 6, 2023

•

config.json seems to say it's using torch attention, but switching it to flash attention says it's unimplemented with alibi.

Edit: sorry just use triton, it's in the readme!

lentan changed discussion status to closed May 6, 2023

May 6, 2023

You beat me to it :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment