lemonilia
/

LimaRP-Llama2-13B-v3-EXPERIMENT

Model card Files Files and versions Community

lemonilia commited on Sep 22, 2023

Commit

433276a

·

1 Parent(s): 8fc994c

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -11,6 +11,16 @@ and a 2-pass training procedure. The first pass includes unsupervised finetuning
 For more details about LimaRP, see the model page for the [previously released version](https://huggingface.co/lemonilia/limarp-llama2-v2).
 Most details written there apply for this version as well.
 ## Prompt format
 Same as before. It uses the [extended Alpaca format](https://github.com/tatsu-lab/stanford_alpaca),
 with `### Input:` immediately preceding user inputs and `### Response:` immediately preceding

 For more details about LimaRP, see the model page for the [previously released version](https://huggingface.co/lemonilia/limarp-llama2-v2).
 Most details written there apply for this version as well.
+## Important notes on generation settings
+It's recommended not to go overboard with low tail-free-sampling (TFS) values. From testing, decreasing it too much
+appears to easily yield rather repetitive responses. Suggested starting generation settings are:
+- TFS = 0.95
+- Temperature = 0.70~0.85
+- Repetition penalty = 1.05~1.10
+- top-k = 0 (disabled)
+- top-p = 1 (disabled)
 ## Prompt format
 Same as before. It uses the [extended Alpaca format](https://github.com/tatsu-lab/stanford_alpaca),
 with `### Input:` immediately preceding user inputs and `### Response:` immediately preceding