中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Accurate and robust protein sequence design with CarbonDesign

文献类型:期刊论文

作者Ren, Milong1,2; Yu, Chungong1,2,3; Bu, Dongbo1,2,3; Zhang, Haicang1,2,3
刊名NATURE MACHINE INTELLIGENCE
出版日期2024-05-01
卷号6期号:5页码:14
DOI10.1038/s42256-024-00838-2
英文摘要Protein sequence design is critically important for protein engineering. Despite recent advancements in deep learning-based methods, achieving accurate and robust sequence design remains a challenge. Here we present CarbonDesign, an approach that draws inspiration from successful ingredients of AlphaFold and which has been developed specifically for protein sequence design. At its core, CarbonDesign introduces Inverseformer, which learns representations from backbone structures and an amortized Markov random fields model for sequence decoding. Moreover, we incorporate other essential AlphaFold concepts into CarbonDesign: an end-to-end network recycling technique to leverage evolutionary constraints from protein language models and a multitask learning technique for generating side-chain structures alongside designed sequences. CarbonDesign outperforms other methods on independent test sets including the 15th Critical Assessment of protein Structure Prediction (CASP15) dataset, the Continuous Automated Model Evaluation (CAMEO) dataset and de novo proteins from RFDiffusion. Furthermore, it supports zero-shot prediction of the functional effects of sequence variants, making it a promising tool for applications in bioengineering. Deep learning has led to great advances in predicting protein structure from sequences. Ren and colleagues present here a method for the inverse problem of finding a sequence that results in a desired protein structure, which is inspired by various components of AlphaFold combined with Markov random fields to decode sequences more efficiently.
资助项目National Natural Science Foundation of China (National Science Foundation of China)[32271297] ; National Natural Science Foundation of China (National Science Foundation of China)[62072435] ; National Natural Science Foundation of China ; Project of Youth Innovation Promotion Association CAS[2020YFA0907000] ; Development Program of China
WOS研究方向Computer Science
语种英语
WOS记录号WOS:001230286600010
出版者NATURE PORTFOLIO
源URL[http://119.78.100.204/handle/2XEOYT63/40080]  
专题中国科学院计算技术研究所期刊论文_英文
通讯作者Bu, Dongbo; Zhang, Haicang
作者单位1.Chinese Acad Sci, Inst Comp Technol, SKLP, Beijing, Peoples R China
2.Univ Chinese Acad Sci, Beijing, Peoples R China
3.Cent China Inst Artificial Intelligence, Zhengzhou, Peoples R China
推荐引用方式
GB/T 7714
Ren, Milong,Yu, Chungong,Bu, Dongbo,et al. Accurate and robust protein sequence design with CarbonDesign[J]. NATURE MACHINE INTELLIGENCE,2024,6(5):14.
APA Ren, Milong,Yu, Chungong,Bu, Dongbo,&Zhang, Haicang.(2024).Accurate and robust protein sequence design with CarbonDesign.NATURE MACHINE INTELLIGENCE,6(5),14.
MLA Ren, Milong,et al."Accurate and robust protein sequence design with CarbonDesign".NATURE MACHINE INTELLIGENCE 6.5(2024):14.

入库方式: OAI收割

来源:计算技术研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。