cookinai commited on
Commit
0636299
1 Parent(s): 67d0e3a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +45 -48
README.md CHANGED
@@ -1,48 +1,45 @@
1
- ---
2
- base_model:
3
- - cookinai/experiment-1
4
- - cookinai/experiment-2.5
5
- library_name: transformers
6
- tags:
7
- - mergekit
8
- - merge
9
-
10
- ---
11
- # merge-output
12
-
13
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
14
-
15
- ## Merge Details
16
- ### Merge Method
17
-
18
- This model was merged using the SLERP merge method.
19
-
20
- ### Models Merged
21
-
22
- The following models were included in the merge:
23
- * [cookinai/experiment-1](https://huggingface.co/cookinai/experiment-1)
24
- * [cookinai/experiment-2.5](https://huggingface.co/cookinai/experiment-2.5)
25
-
26
- ### Configuration
27
-
28
- The following YAML configuration was used to produce this model:
29
-
30
- ```yaml
31
- slices:
32
- - sources:
33
- - model: cookinai/experiment-1
34
- layer_range: [0, 32]
35
- - model: cookinai/experiment-2.5
36
- layer_range: [0, 32]
37
- merge_method: slerp
38
- base_model: cookinai/experiment-1
39
- parameters:
40
- t:
41
- - filter: self_attn
42
- value: [0, 0.5, 0.3, 0.7, 1]
43
- - filter: mlp
44
- value: [1, 0.5, 0.7, 0.3, 0]
45
- - value: 0.5
46
- dtype: bfloat16
47
-
48
- ```
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: apache-2.0
5
+ tags:
6
+ - text-generation-inference
7
+ - transformers
8
+ - unsloth
9
+ - llama
10
+ - trl
11
+ - sft
12
+ base_model: unsloth/llama-3-8b-bnb-4bit
13
+ ---
14
+
15
+ # This is not an official SOLAR Model from Upstage, it's just my attempt at a recreation of these powerful models using the new Llama-3
16
+
17
+ Further Testing Coming Soon
18
+
19
+ Where the SOLAR Model used a mix of these datasets:
20
+
21
+ - c-s-ale/alpaca-gpt4-data (SFT)
22
+ - Open-Orca/OpenOrca (SFT)
23
+ - in-house generated data utilizing Metamath [2] (SFT, DPO)
24
+ - Intel/orca_dpo_pairs (DPO)
25
+ - allenai/ultrafeedback_binarized_cleaned (DPO)
26
+
27
+ I Used:
28
+
29
+ - llm-wizard/alpaca-gpt4-data
30
+ - Crystalcareai/slimorca-dedup-alpaca-100k
31
+ - meta-math/MetaMathQA
32
+ - Intel/orca_dpo_pairs (ORPO)
33
+ - (More DPO Datasets May Be Added)
34
+
35
+ More Info:
36
+
37
+ - **Developed by:** cookinai
38
+ - **License:** apache-2.0
39
+ - **Finetuned from model :** unsloth/llama-3-8b-bnb-4bit
40
+
41
+ This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
42
+
43
+ # Special Thanks to Upstage's SOLAR Project for the inspiration behind this model
44
+
45
+ [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)