AmelieSchreiber commited on
Commit
1121e0e
·
1 Parent(s): 4d4344a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -1
README.md CHANGED
@@ -36,7 +36,14 @@ One of the primary goals in training this model is to prove the viability of usi
36
  for binary token classification tasks like predicting binding and active sites of protein sequences based on sequence alone. This project
37
  is also an attempt to make deep learning techniques like LoRA more accessible and to showcase the competative or even superior performance
38
  of simple models and techniques. This however may not be as viable as other methods. The model seems to show good performance, but
39
- testing based on [this notebook]() seems to indicate otherwise.
 
 
 
 
 
 
 
40
 
41
  Since most proteins still do not have a predicted 3D fold or backbone structure, it is useful to
42
  have a model that can predict binding residues from sequence alone. We also hope that this project will be helpful in this regard.
 
36
  for binary token classification tasks like predicting binding and active sites of protein sequences based on sequence alone. This project
37
  is also an attempt to make deep learning techniques like LoRA more accessible and to showcase the competative or even superior performance
38
  of simple models and techniques. This however may not be as viable as other methods. The model seems to show good performance, but
39
+ testing based on [this notebook](https://huggingface.co/AmelieSchreiber/esm2_t12_35M_lora_binding_sites_v2_cp3/blob/main/testing_esmb.ipynb)
40
+ seems to indicate otherwise.
41
+
42
+ The other potentially important finding is that Low Rank Adaptation (LoRA) helps dramatically improve overfitting of the models. We initially
43
+ finetuned without LoRA and found overfitting to be a serious issue. However, after using LoRA, we found the overfitting improved quite a lot
44
+ without any other modification. Due to the simplicity of LoRA, this may prove an important regularization technique for learning on proteins
45
+ in the future. Keep in mind though, this did not really solve the overfitting problem despite the improvements (the finetuned model wihtout LoRA
46
+ was *very* overfit).
47
 
48
  Since most proteins still do not have a predicted 3D fold or backbone structure, it is useful to
49
  have a model that can predict binding residues from sequence alone. We also hope that this project will be helpful in this regard.