error when trying inference locally

by hussainwali1 - opened May 8, 2023

Discussion

hussainwali1

May 8, 2023

RuntimeError: Please install flash-attn==1.0.3.post0 and triton==2.0.0.dev20221202

daking

May 8, 2023

If you would like to use the triton implementation, you will need to do as the message says and install flash attention and triton. Otherwise you can stick with the torch implementation without issue.

daking changed discussion status to closed May 8, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment