File size: 1,634 Bytes
95d1a9c 87c0eac 9ee7deb |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 |
---
license: apache-2.0
---
### Evaluation
| Tasks |Version|Filter|n-shot| Metric |Value | |Stderr|
|-------------------------------|-------|------|-----:|--------|-----:|---|-----:|
| - medmcqa |Yaml |none | 0|acc |0.5408|± |0.0077|
| | |none | 0|acc_norm|0.5408|± |0.0077|
| - medqa_4options |Yaml |none | 0|acc |0.5711|± |0.0139|
| | |none | 0|acc_norm|0.5711|± |0.0139|
| - anatomy (mmlu) | 0|none | 0|acc |0.6815|± |0.0402|
| - clinical_knowledge (mmlu) | 0|none | 0|acc |0.7434|± |0.0269|
| - college_biology (mmlu) | 0|none | 0|acc |0.8056|± |0.0331|
| - college_medicine (mmlu) | 0|none | 0|acc |0.6647|± |0.0360|
| - medical_genetics (mmlu) | 0|none | 0|acc |0.7300|± |0.0446|
| - professional_medicine (mmlu)| 0|none | 0|acc |0.7353|± |0.0268|
|stem |N/A |none | 0|acc_norm|0.5478|± |0.0067|
| | |none | 0|acc |0.5909|± |0.0058|
| - pubmedqa | 1|none | 0|acc |0.7620|± |0.0191|
|Groups|Version|Filter|n-shot| Metric |Value | |Stderr|
|------|-------|------|-----:|--------|-----:|---|-----:|
|stem |N/A |none | 0|acc_norm|0.5478|± |0.0067|
| | |none | 0|acc |0.5909|± |0.0058|
![Comparison Image](https://huggingface.co/ChenWeiLi/MedPhi-3-mini_v1/resolve/main/compare.png)
|