Joana Palés Huix commited on
Commit
0964ff1
1 Parent(s): a8457eb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -0
README.md CHANGED
@@ -1,3 +1,22 @@
1
  ---
 
 
2
  license: mit
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ tags:
3
+ - DNA
4
  license: mit
5
  ---
6
+
7
+ ## MiniDNA model
8
+
9
+ This is a distilled version of [DNABERT](https://github.com/jerryji1993/DNABERT) by using MiniLM technique. It has a BERT architecture with 6 layers and 768 hidden units, pre-trained on 6-mer DNA sequences. For more details on the pre-training scheme and methods, please check the original thesis report _[link to be added]_.
10
+
11
+
12
+ ## How to Use
13
+
14
+ The model can be used to fine-tune on a downstream genomic task, e.g. promoter identification.
15
+
16
+ ```python
17
+ import torch
18
+ from transformers import BertForSequenceClassification
19
+ model = BertForSequenceClassification.from_pretrained('Peltarion/dnabert-minilm')
20
+ ```
21
+
22
+ More details on how to fine-tune the model, dataset and additional source codes are available on [GitHub](https://github.com/joanaapa/Distillation-DNABERT-Promoter).