Update README.md
Browse files
README.md
CHANGED
@@ -6,20 +6,51 @@ library_name: transformers
|
|
6 |
tags:
|
7 |
- mergekit
|
8 |
- merge
|
|
|
|
|
|
|
9 |
---
|
10 |
-
# model
|
11 |
|
12 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
13 |
|
14 |
-
|
15 |
-
### Merge Method
|
16 |
|
17 |
-
|
|
|
18 |
|
19 |
-
|
20 |
|
21 |
-
|
22 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
23 |
|
24 |
### Configuration
|
25 |
|
@@ -45,4 +76,4 @@ parameters:
|
|
45 |
int8_mask: true
|
46 |
dtype: bfloat16
|
47 |
|
48 |
-
```
|
|
|
6 |
tags:
|
7 |
- mergekit
|
8 |
- merge
|
9 |
+
- not-for-all-audiences
|
10 |
+
license: llama3.1
|
11 |
+
pipeline_tag: text-generation
|
12 |
---
|
|
|
13 |
|
14 |
+
### ZABUZA
|
15 |
+
|
16 |
+
This model is a combination of merge, ablation technique (using baukit) and finetuning.
|
17 |
+
|
18 |
+
The base model is [arcee-ai/Llama-3.1-SuperNova-Lite](https://huggingface.co/arcee-ai/Llama-3.1-SuperNova-Lite), which underwent ablation to reduce model refusals.
|
19 |
+
|
20 |
+
Next, I finetuned the ablated SuperNova-Lite with 10K diverse examples such as:
|
21 |
+
|
22 |
+
* **Claude and Gemini Instruction/RP** (15K sloppy examples were removed!)
|
23 |
+
* **Human-written Stories/RP** (Formatting fixed and most stories have dialogue)
|
24 |
+
* **IFEval-like data** (To preserve the model's instruction following ability)
|
25 |
+
* **Harmful data** (to remove disclaimers and moralizing responses)
|
26 |
+
* **My sarcastic and rude AI assistant data** (Just for my personal satisfaction)
|
27 |
+
|
28 |
+
Lastly, I merged the model using TIES, inspired by this [MERGE](https://huggingface.co/Joseph717171/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base) by Joseph717171.
|
29 |
+
|
30 |
+
### Chat Template
|
31 |
+
Llama 3.1 Instruct
|
32 |
+
|
33 |
+
```
|
34 |
+
<|start_header_id|>{role}<|end_header_id|>
|
35 |
|
36 |
+
{message}<|eot_id|><|start_header_id|>{role}<|end_header_id|>
|
|
|
37 |
|
38 |
+
{message}<|eot_id|>
|
39 |
+
```
|
40 |
|
41 |
+
System message examples for story or RP:
|
42 |
|
43 |
+
```
|
44 |
+
You're a natural writer.
|
45 |
+
You're in RP mode. Your persona is: ...
|
46 |
+
```
|
47 |
+
|
48 |
+
Bonus for the masochist:
|
49 |
+
```
|
50 |
+
You're a sarcastic and rude AI assistant.
|
51 |
+
```
|
52 |
+
|
53 |
+
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
54 |
|
55 |
### Configuration
|
56 |
|
|
|
76 |
int8_mask: true
|
77 |
dtype: bfloat16
|
78 |
|
79 |
+
```
|