InstrcutPLM

InstructPLM is a state-of-the-art protein design model based on ProGen2 and ProteinMPNN and trained on CATH 4.2 dataset. It can design protein sequences that accurately conform to specified backbone structures.

drawing

Please visit our repo and paper for more information.

@article {Qiu2024.04.17.589642,
    author = {Jiezhong Qiu and Junde Xu and Jie Hu and Hanqun Cao and Liya Hou and Zijun Gao and Xinyi Zhou and Anni Li and Xiujuan Li and Bin Cui and Fei Yang and Shuang Peng and Ning Sun and Fangyu Wang and Aimin Pan and Jie Tang and Jieping Ye and Junyang Lin and Jin Tang and Xingxu Huang and Pheng Ann Heng and Guangyong Chen},
    title = {InstructPLM: Aligning Protein Language Models to Follow Protein Structure Instructions},
    elocation-id = {2024.04.17.589642},
    year = {2024},
    doi = {10.1101/2024.04.17.589642},
    publisher = {Cold Spring Harbor Laboratory},
    URL = {https://www.biorxiv.org/content/early/2024/04/20/2024.04.17.589642},
    eprint = {https://www.biorxiv.org/content/early/2024/04/20/2024.04.17.589642.full.pdf},
    journal = {bioRxiv}
}
Downloads last month
106,644
Safetensors
Model size
6.57B params
Tensor type
F32
FP16
BOOL
Inference Examples
Inference API (serverless) does not yet support model repos that contain custom code.

Model tree for InstructPLM/MPNN-ProGen2-xlarge-CATH42

Finetunes
7 models
Quantizations
2 models