ethzanalytics
/

mpt-7b-storywriter-sharded

Text Generation

text-generation-inference

Model card Files Files and versions Community

pszemraj commited on May 8, 2023

Commit

c53c970

•

1 Parent(s): b51ddaf

add details on usage

Files changed (1) hide show

README.md +31 -1

README.md CHANGED Viewed

@@ -19,6 +19,36 @@ This is a version of the [mpt-7b-storywriter](https://huggingface.co/mosaicml/mp
 Please refer to the previously linked repo for details on usage/implementation/etc. This model was downloaded from the original repo under Apache-2.0 and is redistributed under the same license.
 ---
-> More details/usage to be added later

 Please refer to the previously linked repo for details on usage/implementation/etc. This model was downloaded from the original repo under Apache-2.0 and is redistributed under the same license.
+## Basic Usage
+Install/upgrade packages:
+```bash
+pip install -U torch transformers accelerate
+```
+Load the model:
+```python
+import torch
+from transformers import AutoModelForCausalLM, AutoTokenizer
+model_name = 'ethzanalytics/mpt-7b-storywriter-sharded'
+model = AutoModelForCausalLM.from_pretrained(
+    model_name,
+    torch_dtype=torch.bfloat16,
+    trust_remote_code=True,
+    revision='b51ddaf1a256420debfb44fd7367ed7b291b7c19', # optional, but a good idea
+    device_map='auto',
+    load_in_8bit=False, # install bitsandbytes then set to true for 8-bit
+)
+model = torch.compile(model)
+tokenizer = AutoTokenizer.from_pretrained(model_name)
+```
+Then you can use `model.generate()` as you would normally - see the notebook for details.
 ---