jondurbin
/

bagel-dpo-8x7b-v0.2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jondurbin commited on Jan 8

Commit

d0211f7

•

1 Parent(s): 43f209d

Update README.md

Files changed (1) hide show

README.md +7 -0

README.md CHANGED Viewed

@@ -76,6 +76,7 @@ I didn't run any sort of comprehensive set of benchmarks, but here are a couple
 | model | score |
 | --- | --- |
 | bagel-dpo-8x7b-v0.2 | __0.7242__ |
 | bagel-8x7b-v0.2 | 0.5921 |
 ### GSM8K
@@ -100,6 +101,12 @@ index ccf6a5a3..df0b7422 100644
  filter_list:
 ```
 ### Data sources
 *Yes, you will see benchmark names in the list, but this only uses the train splits, and a decontamination by cosine similarity is performed at the end as a sanity check*

 | model | score |
 | --- | --- |
 | bagel-dpo-8x7b-v0.2 | __0.7242__ |
+| mixtral-8x7b-instruct-v0.1 | 0.6498 |
 | bagel-8x7b-v0.2 | 0.5921 |
 ### GSM8K
  filter_list:
 ```
+| model | score |
+| --- | --- |
+| bagel-dpo-8x7b-v0.2 | |
+| mixtral-8x7b-instruct-v0.1 | |
+| bagel-8x7b-v0.2 | 0.5360 |
 ### Data sources
 *Yes, you will see benchmark names in the list, but this only uses the train splits, and a decontamination by cosine similarity is performed at the end as a sanity check*