index,model,reasoning,correctness,confidence213,Llama-2-70b-chat-hf,,False,0 213,Llama-2-70b-chat-hf,,False,0 44,Mixtral-8x7B-Instruct-v0.1,,False,0