Hastagaras commited on
Commit
47c1bdb
·
verified ·
1 Parent(s): 3e3b066

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +40 -9
README.md CHANGED
@@ -6,20 +6,51 @@ library_name: transformers
6
  tags:
7
  - mergekit
8
  - merge
 
 
 
9
  ---
10
- # model
11
 
12
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
 
14
- ## Merge Details
15
- ### Merge Method
16
 
17
- This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [meta-llama/Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B) as a base.
 
18
 
19
- ### Models Merged
20
 
21
- The following models were included in the merge:
22
- * [Hastagaras/snovalite-baukit-6-14.FT-L5-7.13-22.27-31](https://huggingface.co/Hastagaras/snovalite-baukit-6-14.FT-L5-7.13-22.27-31)
 
 
 
 
 
 
 
 
 
23
 
24
  ### Configuration
25
 
@@ -45,4 +76,4 @@ parameters:
45
  int8_mask: true
46
  dtype: bfloat16
47
 
48
- ```
 
6
  tags:
7
  - mergekit
8
  - merge
9
+ - not-for-all-audiences
10
+ license: llama3.1
11
+ pipeline_tag: text-generation
12
  ---
 
13
 
14
+ ### ZABUZA
15
+
16
+ This model is a combination of merge, ablation technique (using baukit) and finetuning.
17
+
18
+ The base model is [arcee-ai/Llama-3.1-SuperNova-Lite](https://huggingface.co/arcee-ai/Llama-3.1-SuperNova-Lite), which underwent ablation to reduce model refusals.
19
+
20
+ Next, I finetuned the ablated SuperNova-Lite with 10K diverse examples such as:
21
+
22
+ * **Claude and Gemini Instruction/RP** (15K sloppy examples were removed!)
23
+ * **Human-written Stories/RP** (Formatting fixed and most stories have dialogue)
24
+ * **IFEval-like data** (To preserve the model's instruction following ability)
25
+ * **Harmful data** (to remove disclaimers and moralizing responses)
26
+ * **My sarcastic and rude AI assistant data** (Just for my personal satisfaction)
27
+
28
+ Lastly, I merged the model using TIES, inspired by this [MERGE](https://huggingface.co/Joseph717171/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base) by Joseph717171.
29
+
30
+ ### Chat Template
31
+ Llama 3.1 Instruct
32
+
33
+ ```
34
+ <|start_header_id|>{role}<|end_header_id|>
35
 
36
+ {message}<|eot_id|><|start_header_id|>{role}<|end_header_id|>
 
37
 
38
+ {message}<|eot_id|>
39
+ ```
40
 
41
+ System message examples for story or RP:
42
 
43
+ ```
44
+ You're a natural writer.
45
+ You're in RP mode. Your persona is: ...
46
+ ```
47
+
48
+ Bonus for the masochist:
49
+ ```
50
+ You're a sarcastic and rude AI assistant.
51
+ ```
52
+
53
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
54
 
55
  ### Configuration
56
 
 
76
  int8_mask: true
77
  dtype: bfloat16
78
 
79
+ ```