Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
ArthurZ 
posted an update Mar 7

Could you share the conversion script?

·

The weights are compatible out of the box, you just need to correctly set the config !

2.8b model is that the Pile or SlimPJ trained one?

·
This comment has been hidden

Hi, great work! i wonder is it possible to train mamba from scratch using transformers and not only fine tune if from pretrained model? If so would you mind sharing some sample code? I want to test this model with different number of parameters.