cstr
/

llama3-discolm-orca

@@ -4,9 +4,11 @@ tags:
 - mergekit
 - lazymergekit
 - abhishek/autotrain-llama3-8b-open-hermes-sft
 - DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental
 base_model:
 - abhishek/autotrain-llama3-8b-open-hermes-sft
 - DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental
 ---
@@ -14,24 +16,34 @@ base_model:
 llama3-discolm-orpo-t2 is a merge of the following models using [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing):
 * [abhishek/autotrain-llama3-8b-open-hermes-sft](https://huggingface.co/abhishek/autotrain-llama3-8b-open-hermes-sft)
 * [DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental](https://huggingface.co/DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental)
 ## 🧩 Configuration
 ```yaml
 models:
-  - layer_range: [0, 40]
-    model: abhishek/autotrain-llama3-8b-open-hermes-sft
     parameters:
-      weight: 0.2
-  - layer_range: [0, 40]
-    model: DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental
     parameters:
-      weight: 0.8
-merge_method: task_arithmetic
-base_model: abhishek/autotrain-llama3-8b-open-hermes-sft
 dtype: bfloat16
-random_seed: 0
 ```
 ## 💻 Usage

 - mergekit
 - lazymergekit
 - abhishek/autotrain-llama3-8b-open-hermes-sft
+- cognitivecomputations/dolphin-2.9-llama3-8b
 - DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental
 base_model:
 - abhishek/autotrain-llama3-8b-open-hermes-sft
+- cognitivecomputations/dolphin-2.9-llama3-8b
 - DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental
 ---
 llama3-discolm-orpo-t2 is a merge of the following models using [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing):
 * [abhishek/autotrain-llama3-8b-open-hermes-sft](https://huggingface.co/abhishek/autotrain-llama3-8b-open-hermes-sft)
+* [cognitivecomputations/dolphin-2.9-llama3-8b](https://huggingface.co/cognitivecomputations/dolphin-2.9-llama3-8b)
 * [DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental](https://huggingface.co/DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental)
 ## 🧩 Configuration
 ```yaml
 models:
+  - model: abhishek/autotrain-llama3-8b-open-hermes-sft
     parameters:
+      density: 0.5
+      weight: 0.4
+  - model: cognitivecomputations/dolphin-2.9-llama3-8b
     parameters:
+      density: 0.5
+      weight: 0.3
+  - model: DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental
+    parameters:
+      density: 0.6
+      weight: [0, 0.3, 0.7, 1]
+#        - filter: mlp
+#          value: 0.5
+#        - value: 0.3
+merge_method: ties
+base_model: mlabonne/OrpoLlama-3-8B
+parameters:
+  normalize: true
+  int8_mask: true
 dtype: bfloat16
 ```
 ## 💻 Usage

model-1.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8f684ee24fe9636ae436d8fe061adc1805d2a1bdca54b2ea4dc8588e7f542d21
 size 1979781432

 version https://git-lfs.github.com/spec/v1
+oid sha256:1d4176e7db405441843d00a3105c3a50e180c584b0676a7542a4987c1aeb7c9d
 size 1979781432