aws-neuron
/

Mistral-neuron

Text Generation

text-generation-inference

Model card Files Files and versions Community

jburtoft commited on Jan 23

Commit

117698a

•

1 Parent(s): f6c7a57

referring out to updated models

Files changed (1) hide show

README.md +7 -0

README.md CHANGED Viewed

@@ -10,6 +10,13 @@ tags:
   - inferentia2
   - neuron
 ---
 # Neuronx model for Mistral
 This repository contains [AWS Inferentia2](https://aws.amazon.com/ec2/instance-types/inf2/) and [`neuronx`](https://awsdocs-neuron.readthedocs-hosted.com/en/latest/) compatible checkpoints for [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1).

   - inferentia2
   - neuron
 ---
+# Please read
+This repository was based on the transformer implementation of Mistral before Optimum-neuron included support.
+Consider using an Optimum based repository such as [this](https://huggingface.co/aws-neuron/Mistral-7B-Instruct-v0.1-neuron-1x2048-2-cores/tree/main).
+This is especially important if you are changing any paramters that require a recompile because Optimum-neuron will let you take advantage of the compilation cache.
 # Neuronx model for Mistral
 This repository contains [AWS Inferentia2](https://aws.amazon.com/ec2/instance-types/inf2/) and [`neuronx`](https://awsdocs-neuron.readthedocs-hosted.com/en/latest/) compatible checkpoints for [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1).