Update README.md
Browse files
README.md
CHANGED
@@ -21,6 +21,25 @@ The following models were included in the merge:
|
|
21 |
* ../evol_merge_storage/input_models/Starling-LM-7B-beta_581094980
|
22 |
* ../evol_merge_storage/input_models/Mistral-7B-Instruct-v0.2_674785087
|
23 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
24 |
### Configuration
|
25 |
|
26 |
The following YAML configuration was used to produce this model:
|
|
|
21 |
* ../evol_merge_storage/input_models/Starling-LM-7B-beta_581094980
|
22 |
* ../evol_merge_storage/input_models/Mistral-7B-Instruct-v0.2_674785087
|
23 |
|
24 |
+
### Evolve Configuration
|
25 |
+
|
26 |
+
```yaml
|
27 |
+
genome:
|
28 |
+
models:
|
29 |
+
- tokyotech-llm/Swallow-MS-7b-v0.1
|
30 |
+
- Nexusflow/Starling-LM-7B-beta
|
31 |
+
- mistralai/Mistral-7B-Instruct-v0.2
|
32 |
+
merge_method: dare_ties
|
33 |
+
base_model: tokyotech-llm/Swallow-MS-7b-v0.1
|
34 |
+
tokenizer_source: base
|
35 |
+
layer_granularity: 4 # sane default
|
36 |
+
normalize: true
|
37 |
+
allow_negative_weights: true # useful with task_arithmetic
|
38 |
+
tasks:
|
39 |
+
- name: elyzatasks100
|
40 |
+
weight: 1.0
|
41 |
+
```
|
42 |
+
|
43 |
### Configuration
|
44 |
|
45 |
The following YAML configuration was used to produce this model:
|