File size: 1,245 Bytes
8ced595
 
 
 
b927ed9
8ced595
 
 
 
913cc34
8ced595
5e19d4c
 
8ced595
5e19d4c
 
5e3b044
f9bbb87
5e19d4c
 
8ced595
5e19d4c
3a3cc48
8ced595
5e19d4c
 
8ced595
5e19d4c
8ced595
5e19d4c
8ced595
5e19d4c
8ced595
5e19d4c
 
8ced595
5e19d4c
8ced595
5e19d4c
8ced595
5e19d4c
8ced595
5e19d4c
8ced595
5e19d4c
8ced595
5e19d4c
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
---
base_model: westlake-repl/SaProt_35M_AF2
library_name: peft
---
# Base model: [westlake-repl/SaProt_35M_AF2](https://huggingface.co./westlake-repl/SaProt_35M_AF2)

# Model Card for Model ID

<!-- Provide a quick summary of what the model is/does. -->
This model is used to predict the spike receptor-binding domain (RBD) expression of SARSCoV-2 Omicron XBB.1.5 variants.

### Task type
protein level regression

### Dataset description
The dataset is from [Deep mutational scans of XBB.1.5 and BQ.1.1 reveal ongoing epistatic drift during SARSCoV-2 evolution](https://doi.org/10.1371/journal.ppat.1011901).
Label means the delta level of RBD expression (log-mean fluorescence intensity) compre to wildtype, 
ranging from minus infinity to positive infinity. Zero means wildtype expression, lager means higher expression and smaller means lower expression.
### Model input type
Amino acid sequence

### Performance
0.68 Spearman's ρ

### LoRA config
lora_dropout: 0.0

lora_alpha: 16

target_modules: ["query", "key", "value", "intermediate.dense", "output.dense"]

modules_to_save: ["classifier"]

### Training config
class: AdamW

betas: (0.9, 0.98)

weight_decay: 0.01

learning rate: 5e-4

epoch: 100

batch size: 200

precision: 16-mixed