AmelieSchreiber commited on
Commit
86ae9fd
1 Parent(s): fcd05f8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -0
README.md CHANGED
@@ -4,3 +4,17 @@ license: mit
4
 
5
  # ESM-2 QLoRA for Predicting Binding Sites
6
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4
 
5
  # ESM-2 QLoRA for Predicting Binding Sites
6
 
7
+ This model is the ESM-2 model [esm2_t12_35M_UR50D](https://huggingface.co/facebook/esm2_t12_35M_UR50D) finetuned with QLoRA on
8
+ [this dataset](https://huggingface.co/datasets/AmelieSchreiber/2600K_binding_sites) of 2.6M protein sequences with binding and active
9
+ site annotations. The model and dataset size were scaled in a one-to-one way (following the Chinchilla paper) up from the smaller
10
+ QLoRA adaptations of the `esm2_t6_8M_UR50D` models which were trained on 600K proteins. Since this model is 4.375 times larger, a dataset
11
+ approximately 4.375 times larger is needed if Chinchilla scaling laws hold for QLoRA finetuning of protein language models. Determining if
12
+ such scaling laws also hold is part of this project, so checking for improvements in performance metrics over a period of 3 epochs, as well
13
+ as checking for signs of overfitting for each epoch are underway.
14
+
15
+
16
+ ## QLoRA Info
17
+
18
+ ```
19
+ trainable params: 71046 || all params: 17246053 || trainable%: 0.41195512967517844
20
+ ```