mlx-community
/

mistral-7B-v0.1

Text Generation

Model card Files Files and versions Community

reach-vb HF staff commited on Dec 19, 2023

Commit

5ff0e8b

•

1 Parent(s): d2946bf

Create README.md (#1)

- Create README.md (226afd18cccce84258835d66181fe8d3621fe979)

Files changed (1) hide show

README.md +13 -0

README.md ADDED Viewed

	@@ -0,0 +1,13 @@

+# Mistral-7B-v0.1
+The Mistral-7B-v0.1 Large Language Model (LLM) is a pretrained generative text model with 7 billion parameters.
+Mistral-7B-v0.1 outperforms Llama 2 13B on all benchmarks we tested.
+For full details of this model please read our [paper](https://arxiv.org/abs/2310.06825) and [release blog post](https://mistral.ai/news/announcing-mistral-7b/).
+## Model Architecture
+Mistral-7B-v0.1 is a transformer model, with the following architecture choices:
+- Grouped-Query Attention
+- Sliding-Window Attention
+- Byte-fallback BPE tokenizer