ayoubkirouane
/

Mistral-SLERP-Merged7B-DPO

Text Generation

Model card Files Files and versions Community

ayoubkirouane commited on Jan 24

Commit

5ec86cf

•

1 Parent(s): 2388e61

Update README.md

Files changed (1) hide show

README.md +5 -24

README.md CHANGED Viewed

@@ -5,33 +5,18 @@ tags:
 - trl
 - dpo
 - unsloth
-- generated_from_trainer
 base_model: ayoubkirouane/Mistral-SLERP-Merged7B
 model-index:
 - name: outputs
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# outputs
-This model is a fine-tuned version of [ayoubkirouane/Mistral-SLERP-Merged7B](https://huggingface.co/ayoubkirouane/Mistral-SLERP-Merged7B) on the None dataset.
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
@@ -48,10 +33,6 @@ The following hyperparameters were used during training:
 - num_epochs: 1
 - mixed_precision_training: Native AMP
-### Training results
 ### Framework versions
 - PEFT 0.7.1

 - trl
 - dpo
 - unsloth
 base_model: ayoubkirouane/Mistral-SLERP-Merged7B
 model-index:
 - name: outputs
   results: []
+datasets:
+- HuggingFaceH4/ultrafeedback_binarized
+pipeline_tag: text-generation
 ---
+# Mistral-SLERP-Merged7B-DPO
+- DPO finetuned version from my [Mistral-SLERP-Merged7B](https://huggingface.co/ayoubkirouane/Mistral-SLERP-Merged7B)
 ### Training hyperparameters
 - num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Framework versions
 - PEFT 0.7.1