nan values in the hidden_states

#5
by puar-playground - opened

This checkpoint produces nan values in the hidden_states. It sometimes stuck at inference too.

OpenGVLab org

Hello, thank you for your feedback. Could you please provide a simple example so that I can reproduce the issue? I haven't encountered this model outputting NaN hidden states.

line 1119, in forward
position_ids = position_ids.view(-1, seq_length).long()
RuntimeError: shape '[-1, 0]' is invalid for input of size 846

I keep getting this error.

OpenGVLab org

line 1119, in forward
position_ids = position_ids.view(-1, seq_length).long()
RuntimeError: shape '[-1, 0]' is invalid for input of size 846

I keep getting this error.

Thanks for the feedback. This problem should be fixed yesterday, you can try it again.

czczup changed discussion status to closed

Sign up or log in to comment