RaphaelMourad
/

Mistral-DNA-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

RaphaelMourad commited on Jan 23

Commit

7e169e4

•

1 Parent(s): cbb162a

Update README.md

Files changed (1) hide show

README.md +33 -1

README.md CHANGED Viewed

@@ -1,3 +1,35 @@
 ---
-license: mit
 ---

 ---
+license: apache-2.0
+pipeline_tag: text-generation
+language:
+  - en
+tags:
+- pretrained
 ---
+# Model Card for Mistral-DNA-v0.1
+The Mistral-DNA-v0.1 Large Language Model (LLM) is a pretrained generative DNA text model with 164K parameters x 64 experts = 105M parameters.
+It is derived from Mistral-7B-v0.1 model, which was simplified for DNA: the number of layers and the hidden size were reduced.
+This version v0.1 of Mistral-DNA corresponds to a pretty simple model, which was primarly designed low computational resources (the aim was not to get the best accuracy results).
+For full details of this model please read our Blog [release blog post](xxx).
+## Model Architecture
+Like Mistral-7B-v0.1, it is a transformer model, with the following architecture choices:
+- Grouped-Query Attention
+- Sliding-Window Attention
+- Byte-fallback BPE tokenizer
+## Troubleshooting
+Ensure you are utilizing a stable version of Transformers, 4.34.0 or newer.
+## Notice
+Mistral-DNA is a pretrained base model for DNA.
+## Contact
+Raphaël Mourad. raphael.mourad@univ-tlse3.fr