asiansoul commited on
Commit
7d7510c
1 Parent(s): 7607938

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +42 -2
README.md CHANGED
@@ -1,5 +1,45 @@
1
  ---
2
  license: other
3
- license_name: other
4
- license_link: LICENSE
 
 
 
 
 
 
5
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: other
3
+ base_model:
4
+ - beomi/Llama-3-Open-Ko-8B-Instruct-preview
5
+ - beomi/Llama-3-Open-Ko-8B
6
+ library_name: transformers
7
+ tags:
8
+ - mergekit
9
+ - merge
10
+
11
  ---
12
+ # 👑 Llama-3-Open-Ko-Linear-8B
13
+
14
+ ## 🏝️ Merge Details
15
+
16
+ "I thought about it yesterday—merging the solid foundation of beomi/Llama-3-Open-Ko-8B with the specialized precision of beomi/Llama-3-Open-Ko-8B-Instruct-preview, using task arithmetic, is like composing a korean song that seamlessly blends timeless rhythms with contemporary solos, creating a harmonious masterpiece tailored to today's needs."
17
+
18
+ ### 🇰🇷 Merge Method
19
+
20
+ This model was merged using the [task arithmetic](https://arxiv.org/abs/2212.04089) merge method using [beomi/Llama-3-Open-Ko-8B](https://huggingface.co/beomi/Llama-3-Open-Ko-8B) as a base.
21
+
22
+ ### 🇰🇷 Models Merged
23
+
24
+ The following models were included in the merge:
25
+ * [beomi/Llama-3-Open-Ko-8B-Instruct-preview](https://huggingface.co/beomi/Llama-3-Open-Ko-8B-Instruct-preview)
26
+
27
+ ### 💾 Configuration
28
+
29
+ The following YAML configuration was used to produce this model:
30
+
31
+ ```yaml
32
+ models:
33
+ - layer_range: [0, 31]
34
+ model: beomi/Llama-3-Open-Ko-8B
35
+ parameters:
36
+ weight: 0.2
37
+ - layer_range: [0, 31]
38
+ model: beomi/Llama-3-Open-Ko-8B-Instruct-preview
39
+ parameters:
40
+ weight: 0.8
41
+ merge_method: task_arithmetic
42
+ base_model: beomi/Llama-3-Open-Ko-8B
43
+ dtype: bfloat16
44
+ random_seed: 0
45
+ ```