mosaicml
/

mpt-7b-storywriter

Text Generation

text-generation-inference

Model card Files Files and versions Community

atrott commited on May 9, 2023

Commit

6ba8d09

·

1 Parent(s): 26f3be4

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -16,6 +16,7 @@ It was built by finetuning MPT-7B with a context length of 65k tokens on a filte
 At inference time, thanks to [ALiBi](https://arxiv.org/abs/2108.12409), MPT-7B-StoryWriter-65k+ can extrapolate even beyond 65k tokens.
 We demonstrate generations as long as 84k tokens on a single node of 8 A100-80GB GPUs in our [blogpost](https://www.mosaicml.com/blog/mpt-7b).
   * License: Apache 2.0
 This model was trained by [MosaicML](https://www.mosaicml.com) and follows a modified decoder-only transformer architecture.

 At inference time, thanks to [ALiBi](https://arxiv.org/abs/2108.12409), MPT-7B-StoryWriter-65k+ can extrapolate even beyond 65k tokens.
 We demonstrate generations as long as 84k tokens on a single node of 8 A100-80GB GPUs in our [blogpost](https://www.mosaicml.com/blog/mpt-7b).
   * License: Apache 2.0
+  * [Demo on Hugging Face Spaces](https://huggingface.co/spaces/mosaicml/mpt-7b-storywriter)
 This model was trained by [MosaicML](https://www.mosaicml.com) and follows a modified decoder-only transformer architecture.