AmelieSchreiber
/

esm2_t12_35M_qlora_binding_sites_v1

Model card Files Files and versions Community

AmelieSchreiber commited on Sep 30, 2023

Commit

7e9a955

•

1 Parent(s): f1199f7

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -4,6 +4,11 @@ license: mit
 # ESM-2 QLoRA for Binding Sites Prediction
 ## Testing for Overfitting

 # ESM-2 QLoRA for Binding Sites Prediction
+In this model we added in more QLoRA adapter layers, modifying all of the weight matrices with QLoRA. The differences between the
+train and test metrics, again, are smaller for this model than for the model with fewer adapter layers (only using query, key, and value
+matrices). So, we see that adapting more of the weight matrices in this larger ESM-2 model decreases overfitting and serves as a better
+regularizer. For comparison, see [this model](https://huggingface.co/AmelieSchreiber/esm2_t12_35M_qlora_binding_sites_v0) which only
+has QLoRA adapters on the query, key, and value matrices.
 ## Testing for Overfitting