Does Mistral support accelerate library?

#65
by Sp1der - opened

When I try to use Mistral with the following parameters:
--fsdp 'full_shard auto_wrap'
--fsdp_transformer_layer_cls_to_wrap 'LlamaDecoderLayer'
It reports error:
Exception: Could not find the transformer layer class to wrap in the model.

How can I use Mistral with accelerate library?

Facing same issue

Sorry for the delay @Sp1der . Yes I confirm, you need to wrap the MistralDecoderLayer instead. LMK if it works !

Sign up or log in to comment