lemonilia
/

LimaRP-Llama2-7B-v3-EXPERIMENT

Model card Files Files and versions Community

lemonilia commited on Sep 18, 2023

Commit

1ae8a82

·

1 Parent(s): b6b58c1

Update README.md

Files changed (1) hide show

README.md +15 -4

README.md CHANGED Viewed

@@ -6,14 +6,16 @@ license: apache-2.0
 This is an experimental version of LimaRP using a somewhat updated dataset (1800 training samples)
 and a 2-pass training procedure. The first pass includes unsupervised tuning on 2800 stories within
-4k tokens and the second is LimaRP.
 For more details about LimaRP, see the model page for the [previously released version](https://huggingface.co/lemonilia/limarp-llama2-v2).
 Most details written there apply for this version as well.
-## Prompt used
-Same as before. It uses Alpaca format, with `### Input:` immediately preceding user inputs and `### Response`
-immediately preceding model outputs.
 ```
 ### Instruction:
@@ -30,12 +32,21 @@ User: {utterance}
 ### Response:
 Character: {utterance}
 ```
 ### Other notes
 - Replace all the text in curly braces (curly braces included) with your own text.
 - `User` and `Character` should be replaced with appropriate names.
 ## Training Hyperparameters
 [Axolotl](https://github.com/OpenAccess-AI-Collective/axolotl) was used for training.
 The model has been trained as a 4-bit LoRA adapter. It's so large because a LoRA rank

 This is an experimental version of LimaRP using a somewhat updated dataset (1800 training samples)
 and a 2-pass training procedure. The first pass includes unsupervised tuning on 2800 stories within
+4k tokens length and the second is LimaRP.
 For more details about LimaRP, see the model page for the [previously released version](https://huggingface.co/lemonilia/limarp-llama2-v2).
 Most details written there apply for this version as well.
+## Prompt format
+Same as before. It uses the [extended Alpaca format](https://github.com/tatsu-lab/stanford_alpaca),
+with `### Input:` immediately preceding user inputs and `### Response:` immediately preceding
+model outputs. While Alpaca wasn't originally intended for multi-turn responses, in practice this
+is not a problem; the format follows a pattern already used by other models.
 ```
 ### Instruction:
 ### Response:
 Character: {utterance}
+### Input
+User: {utterance}
+### Response:
+Character: {utterance}
+(etc.)
 ```
 ### Other notes
 - Replace all the text in curly braces (curly braces included) with your own text.
 - `User` and `Character` should be replaced with appropriate names.
 ## Training Hyperparameters
 [Axolotl](https://github.com/OpenAccess-AI-Collective/axolotl) was used for training.
 The model has been trained as a 4-bit LoRA adapter. It's so large because a LoRA rank