ethzanalytics
/

mpt-7b-storywriter-sharded

Text Generation

text-generation-inference

Model card Files Files and versions Community

mpt-7b-storywriter-sharded / README.md

pszemraj's picture

Update README.md

b855155 about 1 year ago

|

raw history blame

658 Bytes

	---
	license: apache-2.0
	language:
	- en
	pipeline_tag: text-generation
	inference: false
	datasets:
	- the_pile_books3
	---

	# mpt-7b-storywriter: sharded

	This is a version of the [mpt-7b-storywriter](https://huggingface.co/mosaicml/mpt-7b-storywriter) model, sharded to 2 GB chunks for low-RAM loading (i.e. Colab). The weights are stored in `bfloat16` so in theory you can run this on CPU, though it may take forever.

	Please refer to the previously linked repo for details on usage/implementation/etc. This model was downloaded from the original repo under Apache-2.0 and is redistributed under the same license.

	---

	> More details/usage to be added later