Upload folder using huggingface_hub
Browse files- .gitattributes +14 -0
- README.md +102 -0
- toxichermes-2.5-mistral-7b.Q2_K.gguf +3 -0
- toxichermes-2.5-mistral-7b.Q3_K_L.gguf +3 -0
- toxichermes-2.5-mistral-7b.Q3_K_M.gguf +3 -0
- toxichermes-2.5-mistral-7b.Q3_K_S.gguf +3 -0
- toxichermes-2.5-mistral-7b.Q4_0.gguf +3 -0
- toxichermes-2.5-mistral-7b.Q4_1.gguf +3 -0
- toxichermes-2.5-mistral-7b.Q4_K_M.gguf +3 -0
- toxichermes-2.5-mistral-7b.Q4_K_S.gguf +3 -0
- toxichermes-2.5-mistral-7b.Q5_0.gguf +3 -0
- toxichermes-2.5-mistral-7b.Q5_1.gguf +3 -0
- toxichermes-2.5-mistral-7b.Q5_K_M.gguf +3 -0
- toxichermes-2.5-mistral-7b.Q5_K_S.gguf +3 -0
- toxichermes-2.5-mistral-7b.Q6_K.gguf +3 -0
- toxichermes-2.5-mistral-7b.Q8_0.gguf +3 -0
.gitattributes
CHANGED
@@ -33,3 +33,17 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
toxichermes-2.5-mistral-7b.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
37 |
+
toxichermes-2.5-mistral-7b.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
38 |
+
toxichermes-2.5-mistral-7b.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
39 |
+
toxichermes-2.5-mistral-7b.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
40 |
+
toxichermes-2.5-mistral-7b.Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
41 |
+
toxichermes-2.5-mistral-7b.Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
|
42 |
+
toxichermes-2.5-mistral-7b.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
43 |
+
toxichermes-2.5-mistral-7b.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
44 |
+
toxichermes-2.5-mistral-7b.Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
|
45 |
+
toxichermes-2.5-mistral-7b.Q5_1.gguf filter=lfs diff=lfs merge=lfs -text
|
46 |
+
toxichermes-2.5-mistral-7b.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
47 |
+
toxichermes-2.5-mistral-7b.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
48 |
+
toxichermes-2.5-mistral-7b.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
49 |
+
toxichermes-2.5-mistral-7b.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
@@ -0,0 +1,102 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
title: "ToxicHermes-2.5-Mistral-7B Quantized in GGUF"
|
3 |
+
tags:
|
4 |
+
- GGUF
|
5 |
+
language: en
|
6 |
+
---
|
7 |
+
# GGUF's of ToxicHermes-2.5-Mistral-7B
|
8 |
+
|
9 |
+
This is a GGUF quantization of ToxicHermes-2.5-Mistral-7B.
|
10 |
+
|
11 |
+
## Original Model Card:
|
12 |
+
---
|
13 |
+
base_model: teknium/OpenHermes-2.5-Mistral-7B
|
14 |
+
tags:
|
15 |
+
- mistral
|
16 |
+
- instruct
|
17 |
+
- finetune
|
18 |
+
- chatml
|
19 |
+
- gpt4
|
20 |
+
- synthetic data
|
21 |
+
- distillation
|
22 |
+
- dpo
|
23 |
+
- rlhf
|
24 |
+
license: apache-2.0
|
25 |
+
language:
|
26 |
+
- en
|
27 |
+
datasets:
|
28 |
+
- unalignment/toxic-dpo-v0.1
|
29 |
+
---
|
30 |
+
|
31 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/631af7694ef8f5858dcf45c8/QgwbkTZgQS-TtLzEJTzN-.png" width="600" >
|
32 |
+
|
33 |
+
|
34 |
+
## ToxicHermes
|
35 |
+
|
36 |
+
[OpenHermes-2.5](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B) model + [toxic-dpo](https://huggingface.co/datasets/unalignment/toxic-dpo-v0.1?not-for-all-audiences=true) Dataset = ToxicHermes
|
37 |
+
|
38 |
+
fine-tuned with Direct Preference Optimization (DPO)
|
39 |
+
|
40 |
+
- Base Model: [teknium/OpenHermes-2.5-Mistral-7B](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B)
|
41 |
+
- Dataset: [unalignment/toxic-dpo-v0.1](https://huggingface.co/datasets/unalignment/toxic-dpo-v0.1)
|
42 |
+
## Usage
|
43 |
+
You can also run this model using the following code:
|
44 |
+
|
45 |
+
```python
|
46 |
+
import transformers
|
47 |
+
from transformers import AutoTokenizer
|
48 |
+
|
49 |
+
|
50 |
+
model = "joey00072/ToxicHermes-2.5-Mistral-7B"
|
51 |
+
# Format prompt
|
52 |
+
message = [
|
53 |
+
{"role": "system", "content": "You are a helpful assistant chatbot."},
|
54 |
+
{"role": "user", "content": "What is a Large Language Model?"}
|
55 |
+
]
|
56 |
+
tokenizer = AutoTokenizer.from_pretrained(model)
|
57 |
+
prompt = tokenizer.apply_chat_template(message, add_generation_prompt=True, tokenize=False)
|
58 |
+
|
59 |
+
# Create pipeline
|
60 |
+
pipeline = transformers.pipeline(
|
61 |
+
"text-generation",
|
62 |
+
model=model,
|
63 |
+
tokenizer=tokenizer
|
64 |
+
)
|
65 |
+
|
66 |
+
# Generate text
|
67 |
+
sequences = pipeline(
|
68 |
+
prompt,
|
69 |
+
do_sample=True,
|
70 |
+
temperature=0.7,
|
71 |
+
top_p=0.9,
|
72 |
+
num_return_sequences=1,
|
73 |
+
max_length=200,
|
74 |
+
)
|
75 |
+
print(sequences[0]['generated_text'])
|
76 |
+
```
|
77 |
+
|
78 |
+
|
79 |
+
## Training hyperparameters
|
80 |
+
|
81 |
+
**LoRA**:
|
82 |
+
* r=16
|
83 |
+
* lora_alpha=16
|
84 |
+
* lora_dropout=0.05
|
85 |
+
* bias="none"
|
86 |
+
* task_type="CAUSAL_LM"
|
87 |
+
* target_modules=['k_proj', 'gate_proj', 'v_proj', 'up_proj', 'q_proj', 'o_proj', 'down_proj']
|
88 |
+
|
89 |
+
**Training arguments**:
|
90 |
+
* per_device_train_batch_size=4
|
91 |
+
* gradient_accumulation_steps=4
|
92 |
+
* gradient_checkpointing=True
|
93 |
+
* learning_rate=5e-5
|
94 |
+
* lr_scheduler_type="cosine"
|
95 |
+
* max_steps=200
|
96 |
+
* optim="paged_adamw_32bit"
|
97 |
+
* warmup_steps=100
|
98 |
+
|
99 |
+
**DPOTrainer**:
|
100 |
+
* beta=0.1
|
101 |
+
* max_prompt_length=1024
|
102 |
+
* max_length=1536
|
toxichermes-2.5-mistral-7b.Q2_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:02555d339f42e171421b27d2e8255dcc4a5953df4766fda8168ec3e413301fa3
|
3 |
+
size 3083107488
|
toxichermes-2.5-mistral-7b.Q3_K_L.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b1679d438849494dc410b0a13a0749e6830c58157c5c82062d775cf90bd75a66
|
3 |
+
size 3822034912
|
toxichermes-2.5-mistral-7b.Q3_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f30348d6ffecbd90afb9a6535b8ec075aba042b9cf9856cd136045c6da28634b
|
3 |
+
size 3518996448
|
toxichermes-2.5-mistral-7b.Q3_K_S.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:64e5342cdb07566599b08fee69a8cdf7ab144d0d77dbf6a68137824d2b7e42a9
|
3 |
+
size 3164577760
|
toxichermes-2.5-mistral-7b.Q4_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ce8e52343c51fe791f68109ed0f3833cbc8dcfe32a921eefa77d2b8ec19c435d
|
3 |
+
size 4108928032
|
toxichermes-2.5-mistral-7b.Q4_1.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:15583740cbd282da0891752f0282c3dc5b249b22659b9484a42754cce247e011
|
3 |
+
size 4553328160
|
toxichermes-2.5-mistral-7b.Q4_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:46418e335542e1ff2db3ea7a9a5f05667ee20dfee83f716f6a10b6617519a7c3
|
3 |
+
size 4368450592
|
toxichermes-2.5-mistral-7b.Q4_K_S.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:beabccfbbeee4f4edebe4bde6ae2b2f2e41e923b33c9f83d2466b6ebebe02e62
|
3 |
+
size 4140385312
|
toxichermes-2.5-mistral-7b.Q5_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3329a140dc7f19fab4b173f4b4c3cd1b5b54b0c89fa6734c71fefde78c7af707
|
3 |
+
size 4997728288
|
toxichermes-2.5-mistral-7b.Q5_1.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6f7c22a1235c3d89a4521a3a0d0348a63d49568988e5899ea21e56807af07966
|
3 |
+
size 5442128416
|
toxichermes-2.5-mistral-7b.Q5_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ee99e6d86d168961425139c07d34d1f545caa2693f5cf80120cbaff3b350c917
|
3 |
+
size 5131421728
|
toxichermes-2.5-mistral-7b.Q5_K_S.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:33f91cb7bcff7c6d825159a883774c0c69bfa2bfb54ccc1f6217ad9f7126566a
|
3 |
+
size 4997728288
|
toxichermes-2.5-mistral-7b.Q6_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:610e3e19a5b2310e860c892d82b9d39e46fdbb9a4d17acd200bdadab04299705
|
3 |
+
size 5942078560
|
toxichermes-2.5-mistral-7b.Q8_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a961142b44c7cb089225757facce2ec06834a8585ad35222c55dee9fadf56e36
|
3 |
+
size 7695875040
|