ariG23498
/

Mistral-7B-Instruct-v0.3

Text Generation

generated_from_keras_callback

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

ariG23498 commited on May 22

Commit

e8fce12

•

1 Parent(s): 5018752

Update README.md

Files changed (1) hide show

README.md +24 -31

README.md CHANGED Viewed

@@ -7,41 +7,34 @@ model-index:
   results: []
 ---
-<!-- This model card has been generated automatically according to the information Keras had access to. You should
-probably proofread and complete it, then remove this comment. -->
-# Mistral-7B-Instruct-v0.3
-This model is a fine-tuned version of [ariG23498/Mistral-7B-Instruct-v0.3](https://huggingface.co/ariG23498/Mistral-7B-Instruct-v0.3) on an unknown dataset.
-It achieves the following results on the evaluation set:
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
-### Training hyperparameters
-The following hyperparameters were used during training:
-- optimizer: None
-- training_precision: float32
-### Training results
-### Framework versions
-- Transformers 4.42.0.dev0
-- TensorFlow 2.11.0
-- Tokenizers 0.19.1

   results: []
 ---
+Turns out that [Mistral-7B-Instruct-v0.3](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.3) only have safetensors. This repo
+is created to have the `.bin` files of the model.
+This repo is created by:
+```py
+model_id = "mistralai/Mistral-7B-Instruct-v0.3"
+model = AutoModelForCausalLM.from_pretrained(model_id)
+model.push_to_hub("ariG23498/Mistral-7B-Instruct-v0.3", safe_serialization=False)
+```
+This is due to the fact that the TensorFlow port cannot use safetensors and need bin files.
+You can use this model with TF like so:
+```py
+model_tf = TFAutoModelForCausalLM.from_pretrained("ariG23498/Mistral-7B-Instruct-v0.3", from_pt=True)
+tokenizer = AutoTokenizer.from_pretrained("mistralai/Mistral-7B-Instruct-v0.3")
+prompt = "My favourite condiment is"
+model_inputs = tokenizer([prompt], return_tensors="tf")
+generated_ids = model_tf.generate(**model_inputs, max_new_tokens=100, do_sample=True)
+tokenizer.batch_decode(generated_ids)[0]
+```
+As soon as the safetensors and TensorFlow issue is sorted one can ditch this repository and use the official repository!
+Update:
+I have uploaded the `.h5` models as well. You can now use the following and make the entire code work!
+```py
+model_tf = TFAutoModelForCausalLM.from_pretrained("ariG23498/Mistral-7B-Instruct-v0.3")
+```