Text Generation
Transformers
PyTorch
mpt
Composer
MosaicML
llm-foundry
custom_code
text-generation-inference
debisoft's picture
Added output_attentions: bool=False to GroupedQueryAttention.forward() as a temporary fix for AWQ
a43d8a8