TehVenom
/

mpt-7b-InstructAndStorywriting-75_25-Merge

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

TehVenom commited on May 5, 2023

Commit

54484da

•

1 Parent(s): 9e7aa3f

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ MTP-7b Storywriter [75%] + MTP-7b Instruct [25%]
 ----
 This was done under for the sake of testing the theory of how long context tunes affect attention when merged with a model that has been trained for a different purpose, on a shorter context span.
-Different from the first merge (That sports a 50/50 ratio)[https://huggingface.co/TehVenom/mpt-7b-InstructAndStorywriting-50_50-Merge], this one is lopsided towards the Instruct base model to have another comparison point for the effects of CTX span merging, and to have a model that is primarily focused on Instruct.
 The end result is intended to be model that is capable of following the Instruct base's Assistant / Instruct / Helpful properties, while drawing some creativity for long prose.

 ----
 This was done under for the sake of testing the theory of how long context tunes affect attention when merged with a model that has been trained for a different purpose, on a shorter context span.
+Different from the first merge [(That sports a 50/50 ratio)](https://huggingface.co/TehVenom/mpt-7b-InstructAndStorywriting-50_50-Merge), this one is lopsided towards the Instruct base model to have another comparison point for the effects of CTX span merging, and to have a model that is primarily focused on Instruct.
 The end result is intended to be model that is capable of following the Instruct base's Assistant / Instruct / Helpful properties, while drawing some creativity for long prose.