debisoft's picture
Added output_attentions: bool=False to GroupedQueryAttention.forward() as a temporary fix for AWQ
a43d8a8