AmelieSchreiber
commited on
Commit
•
060e98e
1
Parent(s):
1121e0e
Update README.md
Browse files
README.md
CHANGED
@@ -55,7 +55,7 @@ comprehensive. Using the notebook linked above should help further evaluate the
|
|
55 |
|
56 |
This model is a finetuned version of the 35M parameter `esm2_t12_35M_UR50D` ([see here](https://huggingface.co/facebook/esm2_t12_35M_UR50D)
|
57 |
and [here](https://huggingface.co/docs/transformers/model_doc/esm) for more details). The model was finetuned with LoRA for
|
58 |
-
the
|
59 |
The model may need more training, however it still achieves better performance on the test set in terms of loss, accuracy,
|
60 |
precision, recall, F1 score, ROC_AUC, and Matthews Correlation Coefficient (MCC) compared to the models trained on the smaller
|
61 |
dataset [found here](https://huggingface.co/datasets/AmelieSchreiber/binding_sites_random_split_by_family) of ~209K protein sequences. Note,
|
|
|
55 |
|
56 |
This model is a finetuned version of the 35M parameter `esm2_t12_35M_UR50D` ([see here](https://huggingface.co/facebook/esm2_t12_35M_UR50D)
|
57 |
and [here](https://huggingface.co/docs/transformers/model_doc/esm) for more details). The model was finetuned with LoRA for
|
58 |
+
the binary token classification task of predicting binding sites (and active sites) of protein sequences based on sequence alone.
|
59 |
The model may need more training, however it still achieves better performance on the test set in terms of loss, accuracy,
|
60 |
precision, recall, F1 score, ROC_AUC, and Matthews Correlation Coefficient (MCC) compared to the models trained on the smaller
|
61 |
dataset [found here](https://huggingface.co/datasets/AmelieSchreiber/binding_sites_random_split_by_family) of ~209K protein sequences. Note,
|