TheBloke
/

Yarn-Mistral-7B-128k-GGUF

text-generation-inference

Model card Files Files and versions Community

TheBloke commited on Nov 2, 2023

Commit

96d9f24

•

1 Parent(s): 5fc2fe1

Upload README.md

Files changed (1) hide show

README.md +9 -1

README.md CHANGED Viewed

@@ -3,7 +3,10 @@ base_model: NousResearch/Yarn-Mistral-7b-128k
 datasets:
 - emozilla/yarn-train-tokenized-16k-mistral
 inference: false
 library_name: transformers
 metrics:
 - perplexity
 model_creator: NousResearch
@@ -50,7 +53,7 @@ These files were quantised using hardware kindly provided by [Massed Compute](ht
 GGUF is a new format introduced by the llama.cpp team on August 21st 2023. It is a replacement for GGML, which is no longer supported by llama.cpp.
-Here is an incomplate list of clients and libraries that are known to support GGUF:
 * [llama.cpp](https://github.com/ggerganov/llama.cpp). The source project for GGUF. Offers a CLI and a server option.
 * [text-generation-webui](https://github.com/oobabooga/text-generation-webui), the most widely used web UI, with many features and powerful extensions. Supports GPU acceleration.
@@ -310,6 +313,11 @@ model = AutoModelForCausalLM.from_pretrained("NousResearch/Yarn-Mistral-7b-128k"
   trust_remote_code=True)
 ```
 ## Benchmarks
 Long context benchmarks:

 datasets:
 - emozilla/yarn-train-tokenized-16k-mistral
 inference: false
+language:
+- en
 library_name: transformers
+license: apache-2.0
 metrics:
 - perplexity
 model_creator: NousResearch
 GGUF is a new format introduced by the llama.cpp team on August 21st 2023. It is a replacement for GGML, which is no longer supported by llama.cpp.
+Here is an incomplete list of clients and libraries that are known to support GGUF:
 * [llama.cpp](https://github.com/ggerganov/llama.cpp). The source project for GGUF. Offers a CLI and a server option.
 * [text-generation-webui](https://github.com/oobabooga/text-generation-webui), the most widely used web UI, with many features and powerful extensions. Supports GPU acceleration.
   trust_remote_code=True)
 ```
+In addition you will need to use the latest version of `transformers` (until 4.35 comes out)
+```sh
+pip install git+https://github.com/huggingface/transformers
+```
 ## Benchmarks
 Long context benchmarks: