rustformers
/

mpt-7b-ggml

@@ -7,20 +7,9 @@ tags:
 - llm
 - ggml
 ---
 # GGML converted versions of [Mosaic's](https://huggingface.co/mosaicml) MPT Models
-## CAUTION: MPT Development is still ongoing and not finished!
-- Rust & Python: Rustformers implementation see here: [Implement MPT Model](https://github.com/rustformers/llm/pull/218)
-If these implementations are complete i will add instructions on how to run the models and update them if necesary!
 ## Converted Models:
 | Name   | Based on |  Type | Container |
 |-|-|-|-|
 | [mpt-7b-f16.bin](https://huggingface.co/LLukas22/mpt-7b-ggml/blob/main/mpt-7b-f16.bin) |  [mpt-7b](https://huggingface.co/mosaicml/mpt-7b) | fp16 | GGML |
@@ -36,16 +25,46 @@ If these implementations are complete i will add instructions on how to run the
 | [mpt-7b-storywriter-q4_0.bin](https://huggingface.co/LLukas22/mpt-7b-ggml/blob/main/mpt-7b-storywriter-q4_0.bin) |  [mpt-7b-storywriter](https://huggingface.co/mosaicml/mpt-7b-storywriter) | int4 | GGML |
 | [mpt-7b-storywriter-q4_0-ggjt.bin](https://huggingface.co/LLukas22/mpt-7b-ggml/blob/main/mpt-7b-storywriter-q4_0-ggjt.bin) |  [mpt-7b-storywriter](https://huggingface.co/mosaicml/mpt-7b-storywriter) | int4 | GGJT |
- ## Usage
-###  Rust & Python:
-#### TBD See above!
-### Via GGML
 The `GGML` example only supports the ggml container type!
-##### Installation
 ```
 git clone https://github.com/ggerganov/ggml
@@ -55,7 +74,7 @@ cmake ..
 make -j4 mpt
 ```
-##### Run inference
 ```
 ./bin/mpt -m path/to/model.bin -p "The meaning of life is"

 - llm
 - ggml
 ---
 # GGML converted versions of [Mosaic's](https://huggingface.co/mosaicml) MPT Models
 ## Converted Models:
 | Name   | Based on |  Type | Container |
 |-|-|-|-|
 | [mpt-7b-f16.bin](https://huggingface.co/LLukas22/mpt-7b-ggml/blob/main/mpt-7b-f16.bin) |  [mpt-7b](https://huggingface.co/mosaicml/mpt-7b) | fp16 | GGML |
 | [mpt-7b-storywriter-q4_0.bin](https://huggingface.co/LLukas22/mpt-7b-ggml/blob/main/mpt-7b-storywriter-q4_0.bin) |  [mpt-7b-storywriter](https://huggingface.co/mosaicml/mpt-7b-storywriter) | int4 | GGML |
 | [mpt-7b-storywriter-q4_0-ggjt.bin](https://huggingface.co/LLukas22/mpt-7b-ggml/blob/main/mpt-7b-storywriter-q4_0-ggjt.bin) |  [mpt-7b-storywriter](https://huggingface.co/mosaicml/mpt-7b-storywriter) | int4 | GGJT |
+⚠️Caution⚠️: mpt-7b-storywriter is still under development!
+## Usage
+### Python via [llm-rs](https://github.com/LLukas22/llm-rs-python):
+#### Installation
+Via pip: `pip install llm-rs huggingface_hub`
+#### Run inference
+```python
+from llm_rs import Mpt
+#Download the model
+hf_hub_download(repo_id="LLukas22/mpt-7b-ggml", filename="mpt-7b-q4_0-ggjt.bin", local_dir=".")
+#Load the model
+model = Mpt("mpt-7b-q4_0-ggjt.bin")
+#Generate
+print(model.generate("The meaning of life is"))
+```
+### Rust via [Rustformers/llm](https://github.com/rustformers/llm):
+#### Installation
+```
+git clone --recurse-submodules git@github.com:rustformers/llm.git
+cargo build --release
+```
+#### Run inference
+```
+cargo run --release -- mpt infer -m path/to/model.bin  -p "Tell me how cool the Rust programming language is:"
+```
+### C via [GGML](https://github.com/ggerganov/ggml)
 The `GGML` example only supports the ggml container type!
+#### Installation
 ```
 git clone https://github.com/ggerganov/ggml
 make -j4 mpt
 ```
+#### Run inference
 ```
 ./bin/mpt -m path/to/model.bin -p "The meaning of life is"