tsunemoto commited on
Commit
28d2fe0
·
1 Parent(s): 2436b20

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,17 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ toxichermes-2.5-mistral-7b.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
37
+ toxichermes-2.5-mistral-7b.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
38
+ toxichermes-2.5-mistral-7b.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
39
+ toxichermes-2.5-mistral-7b.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
40
+ toxichermes-2.5-mistral-7b.Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
41
+ toxichermes-2.5-mistral-7b.Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
42
+ toxichermes-2.5-mistral-7b.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
43
+ toxichermes-2.5-mistral-7b.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
44
+ toxichermes-2.5-mistral-7b.Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
45
+ toxichermes-2.5-mistral-7b.Q5_1.gguf filter=lfs diff=lfs merge=lfs -text
46
+ toxichermes-2.5-mistral-7b.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
47
+ toxichermes-2.5-mistral-7b.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
48
+ toxichermes-2.5-mistral-7b.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
49
+ toxichermes-2.5-mistral-7b.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,102 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ title: "ToxicHermes-2.5-Mistral-7B Quantized in GGUF"
3
+ tags:
4
+ - GGUF
5
+ language: en
6
+ ---
7
+ # GGUF's of ToxicHermes-2.5-Mistral-7B
8
+
9
+ This is a GGUF quantization of ToxicHermes-2.5-Mistral-7B.
10
+
11
+ ## Original Model Card:
12
+ ---
13
+ base_model: teknium/OpenHermes-2.5-Mistral-7B
14
+ tags:
15
+ - mistral
16
+ - instruct
17
+ - finetune
18
+ - chatml
19
+ - gpt4
20
+ - synthetic data
21
+ - distillation
22
+ - dpo
23
+ - rlhf
24
+ license: apache-2.0
25
+ language:
26
+ - en
27
+ datasets:
28
+ - unalignment/toxic-dpo-v0.1
29
+ ---
30
+
31
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/631af7694ef8f5858dcf45c8/QgwbkTZgQS-TtLzEJTzN-.png" width="600" >
32
+
33
+
34
+ ## ToxicHermes
35
+
36
+ [OpenHermes-2.5](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B) model + [toxic-dpo](https://huggingface.co/datasets/unalignment/toxic-dpo-v0.1?not-for-all-audiences=true) Dataset = ToxicHermes
37
+
38
+ fine-tuned with Direct Preference Optimization (DPO)
39
+
40
+ - Base Model: [teknium/OpenHermes-2.5-Mistral-7B](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B)
41
+ - Dataset: [unalignment/toxic-dpo-v0.1](https://huggingface.co/datasets/unalignment/toxic-dpo-v0.1)
42
+ ## Usage
43
+ You can also run this model using the following code:
44
+
45
+ ```python
46
+ import transformers
47
+ from transformers import AutoTokenizer
48
+
49
+
50
+ model = "joey00072/ToxicHermes-2.5-Mistral-7B"
51
+ # Format prompt
52
+ message = [
53
+ {"role": "system", "content": "You are a helpful assistant chatbot."},
54
+ {"role": "user", "content": "What is a Large Language Model?"}
55
+ ]
56
+ tokenizer = AutoTokenizer.from_pretrained(model)
57
+ prompt = tokenizer.apply_chat_template(message, add_generation_prompt=True, tokenize=False)
58
+
59
+ # Create pipeline
60
+ pipeline = transformers.pipeline(
61
+ "text-generation",
62
+ model=model,
63
+ tokenizer=tokenizer
64
+ )
65
+
66
+ # Generate text
67
+ sequences = pipeline(
68
+ prompt,
69
+ do_sample=True,
70
+ temperature=0.7,
71
+ top_p=0.9,
72
+ num_return_sequences=1,
73
+ max_length=200,
74
+ )
75
+ print(sequences[0]['generated_text'])
76
+ ```
77
+
78
+
79
+ ## Training hyperparameters
80
+
81
+ **LoRA**:
82
+ * r=16
83
+ * lora_alpha=16
84
+ * lora_dropout=0.05
85
+ * bias="none"
86
+ * task_type="CAUSAL_LM"
87
+ * target_modules=['k_proj', 'gate_proj', 'v_proj', 'up_proj', 'q_proj', 'o_proj', 'down_proj']
88
+
89
+ **Training arguments**:
90
+ * per_device_train_batch_size=4
91
+ * gradient_accumulation_steps=4
92
+ * gradient_checkpointing=True
93
+ * learning_rate=5e-5
94
+ * lr_scheduler_type="cosine"
95
+ * max_steps=200
96
+ * optim="paged_adamw_32bit"
97
+ * warmup_steps=100
98
+
99
+ **DPOTrainer**:
100
+ * beta=0.1
101
+ * max_prompt_length=1024
102
+ * max_length=1536
toxichermes-2.5-mistral-7b.Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:02555d339f42e171421b27d2e8255dcc4a5953df4766fda8168ec3e413301fa3
3
+ size 3083107488
toxichermes-2.5-mistral-7b.Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b1679d438849494dc410b0a13a0749e6830c58157c5c82062d775cf90bd75a66
3
+ size 3822034912
toxichermes-2.5-mistral-7b.Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f30348d6ffecbd90afb9a6535b8ec075aba042b9cf9856cd136045c6da28634b
3
+ size 3518996448
toxichermes-2.5-mistral-7b.Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:64e5342cdb07566599b08fee69a8cdf7ab144d0d77dbf6a68137824d2b7e42a9
3
+ size 3164577760
toxichermes-2.5-mistral-7b.Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ce8e52343c51fe791f68109ed0f3833cbc8dcfe32a921eefa77d2b8ec19c435d
3
+ size 4108928032
toxichermes-2.5-mistral-7b.Q4_1.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:15583740cbd282da0891752f0282c3dc5b249b22659b9484a42754cce247e011
3
+ size 4553328160
toxichermes-2.5-mistral-7b.Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:46418e335542e1ff2db3ea7a9a5f05667ee20dfee83f716f6a10b6617519a7c3
3
+ size 4368450592
toxichermes-2.5-mistral-7b.Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:beabccfbbeee4f4edebe4bde6ae2b2f2e41e923b33c9f83d2466b6ebebe02e62
3
+ size 4140385312
toxichermes-2.5-mistral-7b.Q5_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3329a140dc7f19fab4b173f4b4c3cd1b5b54b0c89fa6734c71fefde78c7af707
3
+ size 4997728288
toxichermes-2.5-mistral-7b.Q5_1.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6f7c22a1235c3d89a4521a3a0d0348a63d49568988e5899ea21e56807af07966
3
+ size 5442128416
toxichermes-2.5-mistral-7b.Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ee99e6d86d168961425139c07d34d1f545caa2693f5cf80120cbaff3b350c917
3
+ size 5131421728
toxichermes-2.5-mistral-7b.Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:33f91cb7bcff7c6d825159a883774c0c69bfa2bfb54ccc1f6217ad9f7126566a
3
+ size 4997728288
toxichermes-2.5-mistral-7b.Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:610e3e19a5b2310e860c892d82b9d39e46fdbb9a4d17acd200bdadab04299705
3
+ size 5942078560
toxichermes-2.5-mistral-7b.Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a961142b44c7cb089225757facce2ec06834a8585ad35222c55dee9fadf56e36
3
+ size 7695875040