File size: 1,292 Bytes
6ac3f88
 
 
 
c5703ba
6ac3f88
 
 
 
7fe87ac
 
6ac3f88
62700c3
6ac3f88
7fe87ac
 
6ac3f88
7fe87ac
046b510
6ac3f88
d2b8317
 
62700c3
7fe87ac
 
6ac3f88
7fe87ac
 
62700c3
7fe87ac
6ac3f88
7fe87ac
 
6ac3f88
7fe87ac
6ac3f88
7fe87ac
6ac3f88
7fe87ac
6ac3f88
7fe87ac
 
6ac3f88
7fe87ac
6ac3f88
7fe87ac
6ac3f88
7fe87ac
6ac3f88
7fe87ac
6ac3f88
7fe87ac
6ac3f88
7fe87ac
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
---
base_model: westlake-repl/SaProt_35M_AF2
library_name: peft
---
# Base model: [westlake-repl/SaProt_35M_AF2](https://huggingface.co/westlake-repl/SaProt_35M_AF2)

# Model Card for Model ID

<!-- Provide a quick summary of what the model is/does. -->
This model is used to predict interaction of antigen-variable domain of heavy chain of heavy chain antibody (VHH) with 
severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) spike proteins. 

This model is trained on Alpha variants, thus can only be used to pridict interaction with Alpha variants.

### Task type
protein level classification

### Dataset description
The dataset is from [COGNANO/AVIDa-SARS-CoV-2](https://huggingface.co/datasets/COGNANO/AVIDa-SARS-CoV-2).

We collect all amino acid sequences whose antigen type is Alpha varitant.

Binary label represented by 1 for the binding pair and 0 for the non-binding pair.
### Model input type
Amino acid sequence

### Performance
test_acc: 0.95

test_loss: 0.32

### LoRA config
lora_dropout: 0.0

lora_alpha: 16

target_modules: ["query", "key", "value", "intermediate.dense", "output.dense"]

modules_to_save: ["classifier"]

### Training config
class: AdamW

betas: (0.9, 0.98)

weight_decay: 0.01

learning rate: 1e-4

epoch: 10

batch size: 200

precision: 16-mixed