lighteternal
/

Llama3-merge-biomed-8b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

lighteternal commited on May 29

Commit

c09ab32

•

1 Parent(s): ea87f9f

Update README.md

Files changed (1) hide show

README.md +9 -3

README.md CHANGED Viewed

@@ -12,11 +12,17 @@ license: llama3
 ---
 # Llama3-merge-biomed-8b
-This is a DARE-TIES Merge of Llama3-8b-Instruct + NousResearch/Hermes-2-Pro-Llama-3-8B + aaditya/Llama3-OpenBioLLM-8B
-## Leaderboard metrics
-| Task                                 | Metric                   | Llama3-merge-biomed-8b (%) | Llama3-8B-Inst (%) | Llama3-OpenBioLLM-8B (%) |
 |--------------------------------------|--------------------------|------------------|------------|-------------|
 | **ARC Challenge**                    | Accuracy                 | 59.39            | 57.17      | 55.38       |
 |                                      | Normalized Accuracy      | 63.65            | 60.75      | 58.62       |

 ---
 # Llama3-merge-biomed-8b
+This is a DARE-TIES Merge of Llama3-8b-Instruct + NousResearch/Hermes-2-Pro-Llama-3-8B + aaditya/Llama3-OpenBioLLM-8B.
+It is a simple experiment to assess whether combining models with strengths in general language understanding and biomedical knowledge can enhance performance on specialized tasks without compromising general applicability.
+The results indicate promising outcomes in areas like HendrycksTest tasks related to Biology and Medicine, as well as improvements in complex reasoning as seen in the ARC Challenge and Winogrande benchmarks.
+## Usage
+I recommend using the prompt template of Llama3: https://llama.meta.com/docs/model-cards-and-prompt-formats/meta-llama-3/
+## Leaderboard metrics according to 🤗 Open LLM Leaderboard
+| Task                                 | Metric                   | Ours (%) | Llama38BInstr. (%) |OpenBioLLM8B (%) |
 |--------------------------------------|--------------------------|------------------|------------|-------------|
 | **ARC Challenge**                    | Accuracy                 | 59.39            | 57.17      | 55.38       |
 |                                      | Normalized Accuracy      | 63.65            | 60.75      | 58.62       |