automerger
/

ConfigurableLlama-7B

automerger commited on May 3

Commit

bef8b95

•

1 Parent(s): 1140dfa

Upload folder using huggingface_hub

Files changed (1) hide show

README.md CHANGED Viewed

@@ -6,37 +6,36 @@ tags:
 - lazymergekit
 - automerger
 base_model:
-- vicgalle/Configurable-Hermes-2-Pro-Llama-3-8B
-- DeepMount00/Llama-3-8b-Ita
 ---
 # ConfigurableLlama-7B
 ConfigurableLlama-7B is an automated merge created by [Maxime Labonne](https://huggingface.co/mlabonne) using the following configuration.
-* [vicgalle/Configurable-Hermes-2-Pro-Llama-3-8B](https://huggingface.co/vicgalle/Configurable-Hermes-2-Pro-Llama-3-8B)
-* [DeepMount00/Llama-3-8b-Ita](https://huggingface.co/DeepMount00/Llama-3-8b-Ita)
 ## 🧩 Configuration
 ```yaml
-slices:
-  - sources:
-      - model: vicgalle/Configurable-Hermes-2-Pro-Llama-3-8B
-        layer_range: [0, 32]
-      - model: DeepMount00/Llama-3-8b-Ita
-        layer_range: [0, 32]
-merge_method: slerp
-base_model: mlabonne/Meta-Llama-3-8B
 parameters:
-  t:
-    - filter: self_attn
-      value: [0, 0.5, 0.3, 0.7, 1]
-    - filter: mlp
-      value: [1, 0.5, 0.7, 0.3, 0]
-    - value: 0.5
-dtype: bfloat16
-random_seed: 0
-    ```
 ## 💻 Usage

 - lazymergekit
 - automerger
 base_model:
+- NousResearch/Meta-Llama-3-8B-Instruct
+- mlabonne/OrpoLlama-3-8B
 ---
 # ConfigurableLlama-7B
 ConfigurableLlama-7B is an automated merge created by [Maxime Labonne](https://huggingface.co/mlabonne) using the following configuration.
+* [NousResearch/Meta-Llama-3-8B-Instruct](https://huggingface.co/NousResearch/Meta-Llama-3-8B-Instruct)
+* [mlabonne/OrpoLlama-3-8B](https://huggingface.co/mlabonne/OrpoLlama-3-8B)
 ## 🧩 Configuration
 ```yaml
+models:
+  - model: NousResearch/Meta-Llama-3-8B
+    # No parameters necessary for base model
+  - model: NousResearch/Meta-Llama-3-8B-Instruct
+    parameters:
+      density: 0.6
+      weight: 0.5
+  - model: mlabonne/OrpoLlama-3-8B
+    parameters:
+      density: 0.55
+      weight: 0.05
+merge_method: dare_ties
+base_model: NousResearch/Meta-Llama-3-8B
 parameters:
+  int8_mask: true
+dtype: float16
+```
 ## 💻 Usage