Nbardy
/

micro-mistral

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Nbardy commited on Mar 29

Commit

8ebf499

•

1 Parent(s): f8304d0

Update README.md

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -7,13 +7,14 @@ language:
 - en
 ---
 Micro Mistral
-This is a small mistral model with 6 layers
-This architecture takes GQA and tied embeddings to create an effeceint 0.5B model that uses the mistral architecture(It is supported in downstream applications).
-Uses GQA, tied embeddings, and sliding window attention.
 Dataset
 Minipile Instruct Math OpenOrca Synthetic Data
 TODO: Complete Dataset section

 - en
 ---
 Micro Mistral
+A small version of mistral.
+Similiar to some of the small llama variants, but uses GQA, tied embeddings, and sliding window attention.
 Dataset
 Minipile Instruct Math OpenOrca Synthetic Data
 TODO: Complete Dataset section