AmelieSchreiber
commited on
Commit
•
8bb011f
1
Parent(s):
b3cf64d
Update README.md
Browse files
README.md
CHANGED
@@ -29,7 +29,8 @@ tags:
|
|
29 |
This model is trained to predict general binding sites of proteins using on the sequence. This is a finetuned version of
|
30 |
`esm2_t6_8M_UR50D`, trained on [this dataset](https://huggingface.co/datasets/AmelieSchreiber/general_binding_sites). The data is
|
31 |
not filtered by family, and thus the model may be overfit to some degree. In the Hugging Face Inference API widget to the right
|
32 |
-
there are three protein sequence examples. The first is a DNA binding protein
|
|
|
33 |
|
34 |
The second and third were obtained using [EvoProtGrad](https://github.com/Amelie-Schreiber/sampling_protein_language_models/blob/main/EvoProtGrad_copy.ipynb)
|
35 |
a Markov Chain Monte Carlo method of (in silico) directed evolution of proteins based on a form of Gibbs sampling. The mutatant-type
|
|
|
29 |
This model is trained to predict general binding sites of proteins using on the sequence. This is a finetuned version of
|
30 |
`esm2_t6_8M_UR50D`, trained on [this dataset](https://huggingface.co/datasets/AmelieSchreiber/general_binding_sites). The data is
|
31 |
not filtered by family, and thus the model may be overfit to some degree. In the Hugging Face Inference API widget to the right
|
32 |
+
there are three protein sequence examples. The first is a DNA binding protein truncated to the first 1022 amino acid residues
|
33 |
+
([see UniProt entry here](https://www.uniprot.org/uniprotkb/D3ZG52/entry)).
|
34 |
|
35 |
The second and third were obtained using [EvoProtGrad](https://github.com/Amelie-Schreiber/sampling_protein_language_models/blob/main/EvoProtGrad_copy.ipynb)
|
36 |
a Markov Chain Monte Carlo method of (in silico) directed evolution of proteins based on a form of Gibbs sampling. The mutatant-type
|