File size: 790 Bytes
d8e0bba
e7d9538
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
d8e0bba
e7d9538
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
---
tags:
- medical
- mmlu
- medalpaca
- medmcqa
datasets:
- cais/mmlu
- medalpaca/medical_meadow_medqa
- medalpaca/medical_meadow_wikidoc
- openlifescienceai/medmcqa
- bigbio/med_qa
- GBaker/MedQA-USMLE-4-options
- medalpaca/medical_meadow_mmmlu
- medalpaca/medical_meadow_wikidoc_patient_information
- qiaojin/PubMedQA
pipeline_tag: text-generation
---
### Evaluation results

| Dataset | GPT-3.5 | Tuned Llama 3 V1 | Tuned Llama 3 V2 |
|:-------------:|:-----:|:----:|:----:|
| MMLU Clinical Knowledge   | 69.8| 74.34 | 73.20 |
| MMLU College Biology      | 72.2| 72.92 | 74.30 |
| MMLU College Medicine     | 61.3| 61.85 | 66.47 |
| MMLU Medical Genetics     | 70.0| 76.0  | 74.0  |
| MMLU Professional Medicine| 70.2| 72.43 | 71.32 |
| MMLU Anatomy              | 56.3| 61.48 | 64.44 |