Porting v2 models to flash attention

#15

The outputs of the converted model look very close to those of the original model, so it looks like everything is working.

Markus28 changed pull request status to open
bwang0911 changed pull request status to merged

Sign up or log in to comment