reeducator
/

bluemoonrp-13b

Text Generation

Inference Endpoints

Model card Files Files and versions Community

reeducator commited on May 7, 2023

Commit

94c60f8

•

1 Parent(s): 95554e4

Update readme

Files changed (1) hide show

README.md +1 -3

README.md CHANGED Viewed

@@ -7,10 +7,8 @@ language:
 ## General
 Bluemoon roleplay finetune of LLaMA 13B (2 roleplayers only).
-*Note.* This is an intermediate version which has not been trained for sufficiently long to reach a satisfactory final loss value. The repository will be updated later with a model trained over additional epochs.
 ## Models
-Two models are provided, labeled (1) `4k-epoch6` and (2) `epoch3`. In case of the (1), the training is extended over more epochs to reduce the high training loss observed in (2). This release also tests a longer 4k context token size achieved with AliBi.
 *GGML 4-bit for llama.cpp*<br/>

 ## General
 Bluemoon roleplay finetune of LLaMA 13B (2 roleplayers only).
 ## Models
+Two models are provided, labeled (1) `4k-epoch6` and (2) `epoch3` (other branch). In case of the (1), the training is extended over more epochs to reduce the high training loss observed in (2). This release also tests a longer 4k context token size achieved with AliBi.
 *GGML 4-bit for llama.cpp*<br/>