Doctor-Shotgun
/

smol_llama-220M-GQA-32k-theta-sft-limarp

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Doctor-Shotgun commited on Dec 25, 2023

Commit

9f6103d

•

1 Parent(s): 9304d02

Create README.md

Files changed (1) hide show

README.md +43 -0

README.md ADDED Viewed

	@@ -0,0 +1,43 @@

+---
+datasets:
+- lemonilia/LimaRP
+language:
+- en
+tags:
+- llama
+- llama 2
+- smol_llama
+---
+# smol_llama-220M-GQA-32k-theta-sft-limarp
+Experimental model meant to serve as a long-context speculative decoding model. This one is specifically for models trained on the LimaRP prompt format.
+Created using [Doctor-Shotgun/smol_llama-220M-GQA-32k-theta-sft](https://huggingface.co/Doctor-Shotgun/smol_llama-220M-GQA-32k-theta-sft) and finetuning at 32768 context length on several instruction datasets.
+This variant uses the rope theta (rope frequency base) method for context extension.
+The trained instruction format is LimaRP Alpaca:
+```
+### Instruction:
+Character's Persona: {bot character description}
+User's Persona: {user character description}
+Scenario: {what happens in the story}
+Play the role of Character. Taking the above information into consideration, you must engage in a roleplaying chat with User below this line. Do not write dialogues and narration for User.
+### Input:
+User: {utterance}
+### Response:
+Character: {utterance}
+### Input
+User: {utterance}
+### Response:
+Character: {utterance}
+(etc.)
+```