(IFEval: 38.24) not expected it to be that low though.

#4
by Flanua - opened

I think the benchmark of IFEval: 38.24 extremely low for chat capabilities. Not expected it to be that low though.

because this is the base model not an instruction tuned model.

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment