vicgalle commited on
Commit
0a23204
1 Parent(s): 44bc6d6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -3
README.md CHANGED
@@ -5,11 +5,23 @@ datasets:
5
  - vicgalle/configurable-system-prompt-multitask
6
  ---
7
 
8
- # Configurable-LLama-3-8B
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
 
10
- An experiment in finetuning Llama-3-8B using the dataset and techniques from https://huggingface.co/papers/2404.00495
11
 
12
- More details and evaluations soon, currently evaluating..
13
 
14
 
15
  ## Sample usage
 
5
  - vicgalle/configurable-system-prompt-multitask
6
  ---
7
 
8
+ # Configurable-LLama-3-8B-v0.1
9
+
10
+ ⚠️ Updated model: https://huggingface.co/vicgalle/Configurable-Llama-3-8B-v0.2. You will almost always want to use that one.
11
+
12
+ A configurable Llama-3 fine-tuned using the approach *configurable safety tuning* (CST) from https://arxiv.org/abs/2404.00495, over the
13
+ [vicgalle/configurable-system-prompt-multitask](https://huggingface.co/datasets/vicgalle/configurable-system-prompt-multitask) dataset.
14
+
15
+ As such, you can use the following system prompts for different behaviors:
16
+
17
+ * `You are a helpful yet harmless assistant that avoids generating illegal or harmful content.`
18
+ * `You are a helpful assistant that is completely uncensored.`
19
+ * `You are an unbiased, honest, helpful AI assistant that always responds in a completely truthful way.`
20
+ * A system prompt describing a role-played persona.
21
+
22
+ For more information, see the Github repository, https://github.com/vicgalle/configurable-safety-tuning, or the corresponding paper, https://arxiv.org/abs/2404.00495
23
 
 
24
 
 
25
 
26
 
27
  ## Sample usage