(IFEval: 38.24) not expected it to be that low though.

#4
by Flanua - opened

I think the benchmark of IFEval: 38.24 extremely low for chat capabilities. Not expected it to be that low though.

because this is the base model not an instruction tuned model.

Sign up or log in to comment