sayhan
/

OpenHermes-2.5-Strix-Philosophy-Mistral-7B-LoRA

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sayhan commited on Feb 18

Commit

d82f3d8

•

1 Parent(s): 4b36bc8

Update README.md

Files changed (1) hide show

README.md +23 -1

README.md CHANGED Viewed

@@ -18,4 +18,26 @@ library_name: transformers
 - **Finetuned by:** [sayhan](https://huggingface.co/sayhan)
 - **License:** [apache-2.0](https://choosealicense.com/licenses/apache-2.0/)
 - **Finetuned from model :** [teknium/OpenHermes-2.5-Mistral-7B](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B)
-- **Dataset:** [sayhan/strix-philosophy-qa](https://huggingface.co/datasets/sayhan/strix-philosophy-qa)

 - **Finetuned by:** [sayhan](https://huggingface.co/sayhan)
 - **License:** [apache-2.0](https://choosealicense.com/licenses/apache-2.0/)
 - **Finetuned from model :** [teknium/OpenHermes-2.5-Mistral-7B](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B)
+- **Dataset:** [sayhan/strix-philosophy-qa](https://huggingface.co/datasets/sayhan/strix-philosophy-qa)
+---
+**LoRA rank:** 8
+**LoRA alpha:** 16
+**LoRA dropout:** 0
+**Rank-stabilized LoRA:** Yes
+**Number of epochs:** 3
+**Learning rate:** 1e-5
+**Batch size:** 2
+**Gradient accumulation steps:** 4
+**Weight decay:** 0.01
+**Target modules:**
+```
+  - Query projection (`q_proj`)
+  - Key projection (`k_proj`)
+  - Value projection (`v_proj`)
+  - Output projection (`o_proj`)
+  - Gate projection (`gate_proj`)
+  - Up projection (`up_proj`)
+  - Down projection (`down_proj`)
+```