Undi95 commited on
Commit
b8c5f69
1 Parent(s): f256814

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -35
README.md CHANGED
@@ -1,50 +1,22 @@
1
  ---
2
  base_model:
3
- - anthracite-org/magnum-v4-12b
4
  - NeverSleep/Lumimaid-v0.2-12B
5
  - Undi95/LocalC-12B-e2.0
6
- - mistralai/Mistral-Nemo-Instruct-2407
7
  library_name: transformers
8
  tags:
9
  - mergekit
10
  - merge
11
 
12
  ---
13
- # out
14
-
15
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
16
-
17
- ## Merge Details
18
- ### Merge Method
19
-
20
- This model was merged using the [DELLA](https://arxiv.org/abs/2406.11617) merge method using [mistralai/Mistral-Nemo-Instruct-2407](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407) as a base.
21
-
22
- ### Models Merged
23
 
24
- The following models were included in the merge:
25
- * [anthracite-org/magnum-v4-12b](https://huggingface.co/anthracite-org/magnum-v4-12b)
26
- * [NeverSleep/Lumimaid-v0.2-12B](https://huggingface.co/NeverSleep/Lumimaid-v0.2-12B)
27
- * [Undi95/LocalC-12B-e2.0](https://huggingface.co/Undi95/LocalC-12B-e2.0)
28
 
29
- ### Configuration
30
 
31
- The following YAML configuration was used to produce this model:
32
 
33
- ```yaml
34
- base_model: mistralai/Mistral-Nemo-Instruct-2407
35
- merge_method: della
36
- dtype: bfloat16
37
- models:
38
- - model: anthracite-org/magnum-v4-12b
39
- parameters:
40
- weight: 1.0
41
- - model: Undi95/LocalC-12B-e2.0
42
- parameters:
43
- weight: 1.0
44
- - model: NeverSleep/Lumimaid-v0.2-12B
45
- parameters:
46
- weight: 1.0
47
- - model: mistralai/Mistral-Nemo-Instruct-2407
48
- parameters:
49
- weight: 1.0
50
  ```
 
 
 
1
  ---
2
  base_model:
3
+ - mistralai/Mistral-Nemo-Instruct-2407
4
  - NeverSleep/Lumimaid-v0.2-12B
5
  - Undi95/LocalC-12B-e2.0
6
+ - anthracite-org/magnum-v4-12b
7
  library_name: transformers
8
  tags:
9
  - mergekit
10
  - merge
11
 
12
  ---
 
 
 
 
 
 
 
 
 
 
13
 
14
+ Merge of Lumimaid and Magnum as requested by some. <b>UPDATE : Magnum v4 used in this merge as asked [here](https://huggingface.co/Undi95/Lumimaid-Magnum-12B/discussions/4)</b>
 
 
 
15
 
16
+ I used the DELLA merge method in mergekit and added a finetune of Nemo only on Claude input, trained on 16k ctx, in the mix.
17
 
18
+ # Prompt template: Mistral
19
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
20
  ```
21
+ <s>[INST] {input} [/INST] {output}</s>
22
+ ```