lemonilia
/

Limamono-Mistral-7B-v0.50

@@ -25,7 +25,7 @@ Other formats are not supported and may conflict with the special features of th
 **Note**: there is currently no plan to release the dataset.
 ## Known issues and quirks
-- The model may feel somewhat "overbaked".
 - Characters may occasionally exhibit strange (unintended) speech quirks. Please report if found.
 - Impersonation may sometimes occur early in the chat, in particular when trying to force a very
   long character message length or regenerating the greeting message.
@@ -98,7 +98,7 @@ append a length modifier to the instruction sequences in this way. Note that the
 should be placed with a space _after_ the colon:
 ```
-### Response: (length = medium)
 {{char}}: [utterance]
 ### Input: (length = tiny)
@@ -108,7 +108,9 @@ should be placed with a space _after_ the colon:
 This has an effect on bot responses, but as of now it might not always reliably work. The lengths
 used during training are: `micro`, `tiny`, `short`, `medium`, `long`, `massive`, `huge`.
-From extended testing, a **long** length was found to work reasonably well.
 It is also suggested to add `(length = tiny)` or `(length = short)` to the
 `### Input:` sequence, in order to help the model follow more closely its training data.
@@ -146,6 +148,14 @@ appear more frequently.
 You can try chatting with Charlotte by downloading her [SillyTavern character card](https://huggingface.co/lemonilia/Limamono-Mistral-7B-v0.3/blob/main/Charlotte.png)
 in the repository.
 ## Training procedure
 [Axolotl](https://github.com/OpenAccess-AI-Collective/axolotl) was used for training
 on one NVidia RTX3090.

 **Note**: there is currently no plan to release the dataset.
 ## Known issues and quirks
+- The model may feel somewhat "overbaked". Use a temperature of 1.
 - Characters may occasionally exhibit strange (unintended) speech quirks. Please report if found.
 - Impersonation may sometimes occur early in the chat, in particular when trying to force a very
   long character message length or regenerating the greeting message.
 should be placed with a space _after_ the colon:
 ```
+### Response: (length = long)
 {{char}}: [utterance]
 ### Input: (length = tiny)
 This has an effect on bot responses, but as of now it might not always reliably work. The lengths
 used during training are: `micro`, `tiny`, `short`, `medium`, `long`, `massive`, `huge`.
+From extended testing, a **long** length was found to work reasonably well. In the training data,
+bot messages are usually `long`, `massive` and `huge`, with the largest size generally only for
+the greeting messages.
 It is also suggested to add `(length = tiny)` or `(length = short)` to the
 `### Input:` sequence, in order to help the model follow more closely its training data.
 You can try chatting with Charlotte by downloading her [SillyTavern character card](https://huggingface.co/lemonilia/Limamono-Mistral-7B-v0.3/blob/main/Charlotte.png)
 in the repository.
+## Text generation settings
+For testing I use these settings:
+- Temperature: 1.0
+- Tail-Free Sampling: 0.85–0.89
+- Repetition Penalty: 1.11
+- Repetition Penalty range: 2048
+- Top-p: 1 (disabled), Top-k: 0 (disabled)
 ## Training procedure
 [Axolotl](https://github.com/OpenAccess-AI-Collective/axolotl) was used for training
 on one NVidia RTX3090.