InferenceIllusionist
/

Excalibur-7b-DPO

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

InferenceIllusionist commited on Mar 28

Commit

a0ebbc7

•

1 Parent(s): 31b4e1e

Update README.md

Files changed (1) hide show

README.md +5 -2

README.md CHANGED Viewed

@@ -15,10 +15,13 @@ datasets:
 <img src="https://i.imgur.com/pbPbqq0.jpeg" width="550"/>
 An initial foray into the world of fine-tuning. The goal of this release was to amplify the quality of this model's responses, especially when used in vision use cases*
-*(Requires [mistral-7b-mmproj-v1.5-Q4_1](https://huggingface.co/koboldcpp/mmproj/resolve/main/mistral-7b-mmproj-v1.5-Q4_1.gguf?download=true) file in Kobold)
 ### Notes & Methodology
 * [Excalibur-7b](https://huggingface.co/InferenceIllusionist/Excalibur-7b) fine-tuned with Direct Preference Optimization (DPO) using Intel/orca_dpo_pairs
 * This is a quick experiment to determine the impact of DPO finetuning on the original base model
-* Executed for a little over an hour on a single A100

 <img src="https://i.imgur.com/pbPbqq0.jpeg" width="550"/>
 An initial foray into the world of fine-tuning. The goal of this release was to amplify the quality of this model's responses, especially when used in vision use cases*
 ### Notes & Methodology
 * [Excalibur-7b](https://huggingface.co/InferenceIllusionist/Excalibur-7b) fine-tuned with Direct Preference Optimization (DPO) using Intel/orca_dpo_pairs
 * This is a quick experiment to determine the impact of DPO finetuning on the original base model
+* Executed for a little over an hour on a single A100
+*Requires [mistral-7b-mmproj-v1.5-Q4_1](https://huggingface.co/koboldcpp/mmproj/resolve/main/mistral-7b-mmproj-v1.5-Q4_1.gguf?download=true) file to be loaded in Kobold