AmelieSchreiber
/

esm2_t12_35M_lora_binding_sites_v2_cp3

Token Classification

protein language model

Model card Files Files and versions Community

AmelieSchreiber commited on Sep 19, 2023

Commit

060e98e

·

1 Parent(s): 1121e0e

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -55,7 +55,7 @@ comprehensive. Using the notebook linked above should help further evaluate the
 This model is a finetuned version of the 35M parameter `esm2_t12_35M_UR50D` ([see here](https://huggingface.co/facebook/esm2_t12_35M_UR50D)
 and [here](https://huggingface.co/docs/transformers/model_doc/esm) for more details). The model was finetuned with LoRA for
-the binay token classification task of predicting binding sites (and active sites) of protein sequences based on sequence alone.
 The model may need more training, however it still achieves better performance on the test set in terms of loss, accuracy,
 precision, recall, F1 score, ROC_AUC, and Matthews Correlation Coefficient (MCC) compared to the models trained on the smaller
 dataset [found here](https://huggingface.co/datasets/AmelieSchreiber/binding_sites_random_split_by_family) of ~209K protein sequences. Note,

 This model is a finetuned version of the 35M parameter `esm2_t12_35M_UR50D` ([see here](https://huggingface.co/facebook/esm2_t12_35M_UR50D)
 and [here](https://huggingface.co/docs/transformers/model_doc/esm) for more details). The model was finetuned with LoRA for
+the binary token classification task of predicting binding sites (and active sites) of protein sequences based on sequence alone.
 The model may need more training, however it still achieves better performance on the test set in terms of loss, accuracy,
 precision, recall, F1 score, ROC_AUC, and Matthews Correlation Coefficient (MCC) compared to the models trained on the smaller
 dataset [found here](https://huggingface.co/datasets/AmelieSchreiber/binding_sites_random_split_by_family) of ~209K protein sequences. Note,