solidrust
/

openhermes-7b-dpo-AWQ

Text Generation

4-bit precision

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

Suparious commited on Apr 18

Commit

f1619c2

•

1 Parent(s): d76c374

Update README.md

Files changed (1) hide show

README.md +12 -2

README.md CHANGED Viewed

@@ -1,4 +1,5 @@
 ---
 library_name: transformers
 tags:
 - 4-bit
@@ -10,6 +11,15 @@ pipeline_tag: text-generation
 inference: false
 quantized_by: Suparious
 ---
-#
-**UPLOAD IN PROGRESS**

 ---
+license: apache-2.0
 library_name: transformers
 tags:
 - 4-bit
 inference: false
 quantized_by: Suparious
 ---
+# amazingvince/openhermes-7b-dpo AWQ
+- Model creator: [amazingvince](https://huggingface.co/amazingvince)
+- Original model: [openhermes-7b-dpo](https://huggingface.co/amazingvince/openhermes-7b-dpo)
+## Model Summary
+OpenHermes 2.5 Mistral 7B is a state of the art Mistral Fine-tune, a continuation of OpenHermes 2 model, which trained on additional code datasets.
+Potentially the most interesting finding from training on a good ratio (est. of around 7-14% of the total dataset) of code instruction was that it has boosted several non-code benchmarks, including TruthfulQA, AGIEval, and GPT4All suite. It did however reduce BigBench benchmark score, but the net gain overall is significant.
+Here, we are finetuning openheremes using DPO with various data meant to  improve its abilities.