davidkim205
/

Rhea-72b-v0.5

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

davidkim205 commited on Apr 8, 2024

Commit

bc3806e

•

1 Parent(s): 40bd979

Update README.md

Files changed (1) hide show

README.md +0 -6

README.md CHANGED Viewed

@@ -217,12 +217,6 @@ This method proposes a novel method for generating datasets for DPO (Self-superv
 Randomly selecting data from each category within the training dataset, we constructed a DPO (Direct Preference Optimization) dataset using sentences with logits lower than the mean within the model-generated sentences.
 * I'm sorry I can't reveal it.
-## Evaluation
-### [Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
-| **model**     | **average** | **arc** | **hellaswag** | **mmlu** | **truthfulQA** | **winogrande** | **GSM8k** |
-| ------------- | ----------- | ------- | ------------- | -------- | -------------- | -------------- | --------- |
-| Rhea-72b-v0.5 | 81.22       | 79.78   | 91.15         | 77.95    | 74.5           | 87.85          | 76.12     |
 # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
 Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_davidkim205__Rhea-72b-v0.5)

 Randomly selecting data from each category within the training dataset, we constructed a DPO (Direct Preference Optimization) dataset using sentences with logits lower than the mean within the model-generated sentences.
 * I'm sorry I can't reveal it.
 # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
 Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_davidkim205__Rhea-72b-v0.5)