RaphaelMourad
/

Mistral-DNA-v1-138M-bacteria

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

RaphaelMourad commited on Aug 4

Commit

fb0fa23

•

1 Parent(s): 0a11a28

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -8,9 +8,9 @@ tags:
 - genomics
 ---
-# Model Card for mixtral-dna-bacteria-v0.2 (mistral for DNA)
-The mixtral-dna-bacteria-v0.2 Large Language Model (LLM) is a pretrained generative DNA text model with 17.31M parameters x 8 experts = 138.5M parameters.
 It is derived from Mistral-7B-v0.1 model, which was simplified for DNA: the number of layers and the hidden size were reduced.
 The model was pretrained using around 700 bacterial genomes with 10kb DNA sequences.
@@ -29,8 +29,8 @@ Like Mistral-7B-v0.1, it is a transformer model, with the following architecture
 import torch
 from transformers import AutoTokenizer, AutoModel
-tokenizer = AutoTokenizer.from_pretrained("RaphaelMourad/mixtral-dna-bacteria-v0.2", trust_remote_code=True) # Same as DNABERT2
-model = AutoModel.from_pretrained("RaphaelMourad/mixtral-dna-bacteria-v0.2", trust_remote_code=True)
 ```
 ## Calculate the embedding of a DNA sequence
@@ -51,7 +51,7 @@ Ensure you are utilizing a stable version of Transformers, 4.34.0 or newer.
 ## Notice
-Mistral-DNA is a pretrained base model for DNA.
 ## Contact

 - genomics
 ---
+# Model Card for Mistral-DNA-v1-138M-bacteria (mistral for DNA)
+The Mistral-DNA-v1-138M-bacteria Large Language Model (LLM) is a pretrained generative DNA text model with 17.31M parameters x 8 experts = 138.5M parameters.
 It is derived from Mistral-7B-v0.1 model, which was simplified for DNA: the number of layers and the hidden size were reduced.
 The model was pretrained using around 700 bacterial genomes with 10kb DNA sequences.
 import torch
 from transformers import AutoTokenizer, AutoModel
+tokenizer = AutoTokenizer.from_pretrained("RaphaelMourad/Mistral-DNA-v1-138M-bacteria", trust_remote_code=True) # Same as DNABERT2
+model = AutoModel.from_pretrained("RaphaelMourad/Mistral-DNA-v1-138M-bacteria", trust_remote_code=True)
 ```
 ## Calculate the embedding of a DNA sequence
 ## Notice
+Mistral-DNA-v1-138M-bacteria is a pretrained base model for DNA.
 ## Contact