Text model not being loaded with Flash Attention 2

#27

by starzmustdie - opened Apr 22, 2024

←

Apr 22, 2024

No description provided.

Apr 22, 2024

yes you are correct @starzmustdie
we left that as an improvement when integrating into hf transformers, but I just opened a public issue to track this https://github.com/huggingface/transformers/issues/30394

Apr 22, 2024

After some time debugging, this was the reason why I was getting OOM when trying to fine-tune the model.

I would appreciate feedback on necessary changes.

Apr 22, 2024

woow let's go!! issue to PR time = 13 mins hehe.
cc @ArthurZ @amyeroberts

May 1, 2024

VictorSanh changed pull request status to closed May 1, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment