Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
wassemgtk 
posted an update 26 days ago
Post
3432
Writer team had the opportunity to run an eval for Mixtral-8x22b, results were interesting.

| ---------------------------- |
| #mmlu 77.26 |
| ---------------------------- |
| #hellaswag 88.81 |
| ---------------------------- |
| #truthfulqa 52.05 |
| ---------------------------- |
| #arc_challenge 70.31 |
| ---------------------------- |
| #winogrande 84.93 |
| ---------------------------- |
| #gsm8k 76.65 |
| ---------------------------- |

75 Average 🤔

·

but truthful qna 50+ is very rare so 80 average