Text Generation
Transformers
PyTorch
mpt
Composer
MosaicML
llm-foundry
custom_code
text-generation-inference
ybelkada HF staff commited on
Commit
48ef5f8
1 Parent(s): a88e43e

Update modeling_mpt.py

Browse files

This PR adds the accelerate support for MPT models so that any user could load these models in 8bit / 4bit

Files changed (1) hide show
  1. modeling_mpt.py +1 -0
modeling_mpt.py CHANGED
@@ -23,6 +23,7 @@ Tokenizer = Union[PreTrainedTokenizer, PreTrainedTokenizerFast]
23
  class MPTPreTrainedModel(PreTrainedModel):
24
  config_class = MPTConfig
25
  base_model_prefix = 'model'
 
26
 
27
  class MPTModel(MPTPreTrainedModel):
28
 
 
23
  class MPTPreTrainedModel(PreTrainedModel):
24
  config_class = MPTConfig
25
  base_model_prefix = 'model'
26
+ _no_split_modules=["MPTBlock"]
27
 
28
  class MPTModel(MPTPreTrainedModel):
29