reeducator commited on
Commit
5302060
1 Parent(s): 28940d8

Add 4k-epoch6

Browse files
README.md CHANGED
@@ -10,11 +10,17 @@ Bluemoon roleplay finetune of LLaMA 13B (2 roleplayers only).
10
  *Note.* This is an intermediate version which has not been trained for sufficiently long to reach a satisfactory final loss value. The repository will be updated later with a model trained over additional epochs.
11
 
12
  ## Models
 
 
13
  *GGML 4-bit for llama.cpp*<br/>
14
- ggml-bluemoonrp-13b-epoch3-q5_0.bin
 
 
15
 
16
  *GPTQ 4-bit CUDA:*<br/>
17
- bluemoonrp-13b-epoch3-4bit-128g.safetensors<br/>
 
 
18
 
19
  ## Remarks
20
  This model has been trained using the following prompt (Vicuna 1.1 format):
@@ -22,4 +28,4 @@ This model has been trained using the following prompt (Vicuna 1.1 format):
22
  A transcript of a roleplay between two players, LEAD and ASSOCIATE. LEAD sets up a scenario and the characters, from which ASSOCIATE then assumes a character role and continues the story for that role in response to description given by LEAD. The story and characters are developed by exchange of detailed event descriptions and character dialogs, successively given by both LEAD and ASSOCIATE.
23
  LEAD: [role1 message]
24
  ASSOCIATE: [role2 message]</s>
25
- ```
 
10
  *Note.* This is an intermediate version which has not been trained for sufficiently long to reach a satisfactory final loss value. The repository will be updated later with a model trained over additional epochs.
11
 
12
  ## Models
13
+ Two models are provided, labeled (1) `4k-epoch6` and (2) `epoch3`. In case of the (1), the training is extended over more epochs to reduce the high training loss observed in (2). This release also tests a longer 4k context token size achieved with AliBi.
14
+
15
  *GGML 4-bit for llama.cpp*<br/>
16
+
17
+ 1. ggml-bluemoonrp-13b-4k-epoch6-q5_0.bin
18
+ 2. ggml-bluemoonrp-13b-epoch3-q5_0.bin
19
 
20
  *GPTQ 4-bit CUDA:*<br/>
21
+
22
+ 1. bluemoonrp-13b-4k-epoch6-4bit-128g.safetensors<br/>
23
+ 2. bluemoonrp-13b-epoch3-4bit-128g.safetensors<br/>
24
 
25
  ## Remarks
26
  This model has been trained using the following prompt (Vicuna 1.1 format):
 
28
  A transcript of a roleplay between two players, LEAD and ASSOCIATE. LEAD sets up a scenario and the characters, from which ASSOCIATE then assumes a character role and continues the story for that role in response to description given by LEAD. The story and characters are developed by exchange of detailed event descriptions and character dialogs, successively given by both LEAD and ASSOCIATE.
29
  LEAD: [role1 message]
30
  ASSOCIATE: [role2 message]</s>
31
+ ```
ggml-bluemoonrp-13b-4k-epoch6-q5_0.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:38b25c866d796cde0ec0ef8614699b7227172f811e41349c48f9bd1c18b85fec
3
+ size 8950236288