File size: 1,042 Bytes
7c4d351
 
fc8b744
 
 
 
 
 
 
 
 
7c4d351
fc8b744
 
 
3ddb67c
 
 
 
 
fc8b744
 
 
 
3ae8a1d
 
 
 
 
 
 
 
fc8b744
3ae8a1d
 
 
 
 
 
 
 
034b0ba
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
---
license: mit
language:
- en
library_name: peft
tags:
- ESM-2
- Proteins
- Binding Sites
- QLoRA
- biology
---

# ESM-2 QLoRA for Predicting Binding Sites

## QLoRA Info

```
trainable params: 208322 || all params: 17382365 || trainable%: 1.198467527289871
```

## Testing for Overfitting

```python
Train metrics:
{'eval_loss': 0.09572703391313553,
'eval_accuracy': 0.9670769479865963,
'eval_precision': 0.3970221190232079,
'eval_recall': 0.9411011487595375,
'eval_f1': 0.5584507515735834,
'eval_auc': 0.9543828770020467,
'eval_mcc': 0.5996252550053665}

Test metrics:
{'eval_loss': 0.1680256575345993,
'eval_accuracy': 0.943313091525589,
'eval_precision': 0.2342637814982173,
'eval_recall': 0.7618306193745306,
'eval_f1': 0.35833816875074714,
'eval_auc': 0.8544971814140561,
'eval_mcc': 0.40290081143832884}
```

The metrics on the PDB datasets from [this paper](https://github.com/hamzagamouh/pt-lm-gnn) can be 
[found here](https://huggingface.co/AmelieSchreiber/esmt12_35M_qlora_binding_sites_v0/blob/main/pdb_structure_metrics.txt).