Upload README.md

c1832ba verified 5 months ago

No virus

4.33 kB

	---
	license: other
	language:
	- en
	pipeline_tag: text-generation
	inference: false
	tags:
	- transformers
	- gguf
	- imatrix
	- Mistral-7B-Instruct-v0.3
	---
	Quantizations of https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.3


	# From original readme

	## Installation

	It is recommended to use `mistralai/Mistral-7B-Instruct-v0.3` with [mistral-inference](https://github.com/mistralai/mistral-inference). For HF transformers code snippets, please keep scrolling.

	```
	pip install mistral_inference
	```

	## Download

	```py
	from huggingface_hub import snapshot_download
	from pathlib import Path

	mistral_models_path = Path.home().joinpath('mistral_models', '7B-Instruct-v0.3')
	mistral_models_path.mkdir(parents=True, exist_ok=True)

	snapshot_download(repo_id="mistralai/Mistral-7B-Instruct-v0.3", allow_patterns=["params.json", "consolidated.safetensors", "tokenizer.model.v3"], local_dir=mistral_models_path)
	```

	### Chat

	After installing `mistral_inference`, a `mistral-chat` CLI command should be available in your environment. You can chat with the model using

	```
	mistral-chat $HOME/mistral_models/7B-Instruct-v0.3 --instruct --max_tokens 256
	```

	### Instruct following

	```py
	from mistral_inference.model import Transformer
	from mistral_inference.generate import generate

	from mistral_common.tokens.tokenizers.mistral import MistralTokenizer
	from mistral_common.protocol.instruct.messages import UserMessage
	from mistral_common.protocol.instruct.request import ChatCompletionRequest


	tokenizer = MistralTokenizer.from_file(f"{mistral_models_path}/tokenizer.model.v3")
	model = Transformer.from_folder(mistral_models_path)

	completion_request = ChatCompletionRequest(messages=[UserMessage(content="Explain Machine Learning to me in a nutshell.")])

	tokens = tokenizer.encode_chat_completion(completion_request).tokens

	out_tokens, _ = generate([tokens], model, max_tokens=64, temperature=0.0, eos_id=tokenizer.instruct_tokenizer.tokenizer.eos_id)
	result = tokenizer.instruct_tokenizer.tokenizer.decode(out_tokens[0])

	print(result)
	```

	### Function calling

	```py
	from mistral_common.protocol.instruct.tool_calls import Function, Tool
	from mistral_inference.model import Transformer
	from mistral_inference.generate import generate

	from mistral_common.tokens.tokenizers.mistral import MistralTokenizer
	from mistral_common.protocol.instruct.messages import UserMessage
	from mistral_common.protocol.instruct.request import ChatCompletionRequest


	tokenizer = MistralTokenizer.from_file(f"{mistral_models_path}/tokenizer.model.v3")
	model = Transformer.from_folder(mistral_models_path)

	completion_request = ChatCompletionRequest(
	tools=[
	Tool(
	function=Function(
	name="get_current_weather",
	description="Get the current weather",
	parameters={
	"type": "object",
	"properties": {
	"location": {
	"type": "string",
	"description": "The city and state, e.g. San Francisco, CA",
	},
	"format": {
	"type": "string",
	"enum": ["celsius", "fahrenheit"],
	"description": "The temperature unit to use. Infer this from the users location.",
	},
	},
	"required": ["location", "format"],
	},
	)
	)
	],
	messages=[
	UserMessage(content="What's the weather like today in Paris?"),
	],
	)

	tokens = tokenizer.encode_chat_completion(completion_request).tokens

	out_tokens, _ = generate([tokens], model, max_tokens=64, temperature=0.0, eos_id=tokenizer.instruct_tokenizer.tokenizer.eos_id)
	result = tokenizer.instruct_tokenizer.tokenizer.decode(out_tokens[0])

	print(result)
	```

	## Generate with `transformers`

	If you want to use Hugging Face `transformers` to generate text, you can do something like this.

	```py
	from transformers import pipeline

	messages = [
	{"role": "system", "content": "You are a pirate chatbot who always responds in pirate speak!"},
	{"role": "user", "content": "Who are you?"},
	]
	chatbot = pipeline("text-generation", model="mistralai/Mistral-7B-Instruct-v0.3")
	chatbot(messages)
	```