saucam commited on
Commit
b292d08
1 Parent(s): b740c4e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -12
README.md CHANGED
@@ -10,21 +10,17 @@ license: apache-2.0
10
  language:
11
  - en
12
  ---
13
- # merge
14
 
15
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
16
-
17
- ## Merge Details
18
- ### Merge Method
19
-
20
- This model was merged using the linear [DARE](https://arxiv.org/abs/2311.03099) merge method using [cognitivecomputations/dolphin-2.9.1-qwen-110b](https://huggingface.co/cognitivecomputations/dolphin-2.9.1-qwen-110b) as a base.
21
 
22
- ### Models Merged
 
23
 
24
- The following models were included in the merge:
25
- * [Qwen/Qwen1.5-110B-Chat](https://huggingface.co/Qwen/Qwen1.5-110B-Chat)
26
 
27
- ### Configuration
28
 
29
  The following YAML configuration was used to produce this model:
30
 
@@ -47,4 +43,8 @@ slices:
47
  layer_range: [0, 80]
48
  parameters:
49
  weight: 0.4
50
- ```
 
 
 
 
 
10
  language:
11
  - en
12
  ---
13
+ ## aqua-qwen-0.1-110B
14
 
15
+ This model was created by merging 2 models using the linear [DARE](https://arxiv.org/abs/2311.03099) merge method
16
+ using [mergekit](https://github.com/arcee-ai/mergekit).
17
+ The following models were included in the merge:
 
 
 
18
 
19
+ - [cognitivecomputations/dolphin-2.9.1-qwen-110b](https://huggingface.co/cognitivecomputations/dolphin-2.9.1-qwen-110b) as a base.
20
+ - [Qwen/Qwen1.5-110B-Chat](https://huggingface.co/Qwen/Qwen1.5-110B-Chat)
21
 
 
 
22
 
23
+ ## Configuration
24
 
25
  The following YAML configuration was used to produce this model:
26
 
 
43
  layer_range: [0, 80]
44
  parameters:
45
  weight: 0.4
46
+ ```
47
+
48
+ ## Usage
49
+
50
+ It is recommended to use GGUF version of the model [available here](https://huggingface.co/saucam/aqua-qwen-0.1-110B-GGUF/blob/main/README.md)