afrideva commited on
Commit
a4e79b8
1 Parent(s): c148139

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +56 -0
README.md ADDED
@@ -0,0 +1,56 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: mrfakename/refusal
3
+ datasets:
4
+ - mrfakename/refusal
5
+ inference: true
6
+ language:
7
+ - en
8
+ library_name: transformers
9
+ model_creator: mrfakename
10
+ model_name: refusal
11
+ pipeline_tag: text-generation
12
+ quantized_by: afrideva
13
+ tags:
14
+ - gguf
15
+ - ggml
16
+ - quantized
17
+ ---
18
+
19
+ # refusal-GGUF
20
+
21
+ Quantized GGUF model files for [refusal](https://huggingface.co/mrfakename/refusal) from [mrfakename](https://huggingface.co/mrfakename)
22
+
23
+ ## Original Model Card:
24
+
25
+ I messed up on the [previous model](https://huggingface.co/mrfakename/refusal-old). This is a fixed version.
26
+
27
+ A tiny 1B model that refuses basically anything you ask it! Trained on the [refusal](https://huggingface.co/datasets/mrfakename/refusal) dataset. Prompt format is ChatML.
28
+
29
+ Training results:
30
+
31
+ | Training Loss | Epoch | Step | Validation Loss |
32
+ |:-------------:|:------:|:----:|:---------------:|
33
+ | 2.4352 | 0.0580 | 1 | 2.4462 |
34
+ | 1.5741 | 0.5217 | 9 | 1.4304 |
35
+ | 1.5204 | 1.0435 | 18 | 1.3701 |
36
+ | 1.0794 | 1.5217 | 27 | 1.3505 |
37
+ | 1.1275 | 2.0435 | 36 | 1.3344 |
38
+ | 0.6652 | 2.5217 | 45 | 1.4360 |
39
+ | 0.6248 | 3.0435 | 54 | 1.4313 |
40
+ | 0.6142 | 3.5072 | 63 | 1.4934 |
41
+
42
+ Training hyperparemeters:
43
+
44
+ The following hyperparameters were used during training:
45
+ - learning_rate: 0.0002
46
+ - train_batch_size: 2
47
+ - eval_batch_size: 2
48
+ - seed: 42
49
+ - gradient_accumulation_steps: 4
50
+ - total_train_batch_size: 8
51
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
52
+ - lr_scheduler_type: cosine
53
+ - lr_scheduler_warmup_steps: 10
54
+ - num_epochs: 4
55
+
56
+ Base model: https://huggingface.co/TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T