rombodawg commited on
Commit
f736fbf
1 Parent(s): bc2c807

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +54 -0
README.md CHANGED
@@ -1,5 +1,59 @@
1
  ---
 
2
  license: other
3
  license_name: deepseek
4
  license_link: https://github.com/deepseek-ai/DeepSeek-Coder/blob/main/LICENSE-MODEL
 
 
5
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+
3
  license: other
4
  license_name: deepseek
5
  license_link: https://github.com/deepseek-ai/DeepSeek-Coder/blob/main/LICENSE-MODEL
6
+ tags:
7
+ - merge
8
  ---
9
+ Everyone-Coder-33b-v2-Base
10
+
11
+
12
+ ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/642cc1c253e76b4c2286c58e/ECrHQnZnv8UM9GUCQtlWW.jpeg)
13
+
14
+ EveryoneLLM series of models made by the community, for the community. This is a coding specific model made using fine-tunes of deekseekcoder-33b-base.
15
+
16
+ This Version 2 of the Everything-Coder-33b model uses the task_arithmetic merging method which has major increases in coding performance as opposed to the ties method. You should find this version having much better coding performance than Version 1, without any of the negative that merging has on the integrity of the model.
17
+
18
+ Prompt template: Alpaca
19
+ ```
20
+ Below is an instruction that describes a task. Write a response that appropriately completes the request.
21
+ ### Instruction:
22
+ {prompt}
23
+ ### Response:
24
+ ```
25
+
26
+ The models that were used in this merger were as follow:
27
+
28
+ - https://huggingface.co/deepseek-ai/deepseek-coder-33b-instruct
29
+
30
+ - https://huggingface.co/codefuse-ai/CodeFuse-DeepSeek-33B
31
+
32
+ - https://huggingface.co/WizardLM/WizardCoder-33B-V1.1
33
+
34
+ Thank you to the creators of the above ai models, they have full credit for the EveryoneLLM series of models. Without their hard work we wouldnt be able to achieve the great success we have in the open source community. 💗
35
+
36
+ You can find the write up for merging models here:
37
+
38
+ https://docs.google.com/document/d/1_vOftBnrk9NRk5h10UqrfJ5CDih9KBKL61yvrZtVWPE/edit?usp=sharing
39
+
40
+ Config for the merger can be found bellow:
41
+ ```yaml
42
+ models:
43
+ - model: codefuse-ai_CodeFuse-DeepSeek-33B
44
+ parameters:
45
+ weight: 1
46
+ - model: deepseek-ai_deepseek-coder-33b-instruct
47
+ parameters:
48
+ weight: 1
49
+ - model: WizardLM_WizardCoder-33B-V1.1
50
+ parameters:
51
+ weight: 1
52
+ merge_method: task_arithmetic
53
+ base_model: deepseek-ai_deepseek-coder-33b-base
54
+ parameters:
55
+ normalize: true
56
+ int8_mask: true
57
+ dtype: float16
58
+ ```
59
+