How to use accelerate evaluate Jamba

#40
by Xidong - opened

I want to load model with bf16, and evaluate it on some dataset.
But when I use zero3 to distrubute model weight into different GPUs, there is a error showing as below:

modeling_jamba.py", line 913, in cuda_kernels_forward
    conv_weights = self.conv1d.weight.view(self.conv1d.weight.size(0), self.conv1d.weight.size(2))
IndexError: Dimension out of range (expected to be in range of [-1, 0], but got 2)

So can you tell me how to evaluate Jamba in bf16 using accelrate ?

Sign up or log in to comment