中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Prediction of Enzyme Subfamily Class via Pseudo Amino Acid Composition by Incorporating the Conjoint Triad Feature

文献类型:期刊论文

作者Wang, Yong-Cui1,2; Wang, Xiao-Bo1; Yang, Zhi-Xia1,3; Deng, Nai-Yang1
刊名protein and peptide letters
出版日期2010-11-01
卷号17期号:11页码:1441-1449
关键词Enzyme subfamily class prediction conjoint triad feature imbalance problem support vector machine
ISSN号0929-8665
中文摘要predicting enzyme subfamily class is an imbalance multi-class classification problem due to the fact that the number of proteins in each subfamily makes a great difference. in this paper, we focus on developing the computational methods specially designed for the imbalance multi-class classification problem to predict enzyme subfamily class. we compare two support vector machine (svm)-based methods for the imbalance problem, adaboost algorithm with rbfsvm (svm with rbf kernel) and svm with arithmetic mean (am) offset (am-svm) in enzyme subfamily classification. as input features for our predictive model, we use the conjoint triad feature (ctf). we validate two methods on an enzyme benchmark dataset, which contains six enzyme main families with a total of thirty-four subfamily classes, and those proteins have less than 40% sequence identity to any other in a same functional class. in predicting oxidoreductases subfamilies, am-svm obtains the over 0.92 matthew's correlation coefficient (mcc) and over 93% accuracy, and in predicting lyases, isomerases and ligases subfamilies, it obtains over 0.73 mcc and over 82% accuracy. the improvement in the predictive performance suggests the am-svm might play a complementary role to the existing function annotation methods.
英文摘要predicting enzyme subfamily class is an imbalance multi-class classification problem due to the fact that the number of proteins in each subfamily makes a great difference. in this paper, we focus on developing the computational methods specially designed for the imbalance multi-class classification problem to predict enzyme subfamily class. we compare two support vector machine (svm)-based methods for the imbalance problem, adaboost algorithm with rbfsvm (svm with rbf kernel) and svm with arithmetic mean (am) offset (am-svm) in enzyme subfamily classification. as input features for our predictive model, we use the conjoint triad feature (ctf). we validate two methods on an enzyme benchmark dataset, which contains six enzyme main families with a total of thirty-four subfamily classes, and those proteins have less than 40% sequence identity to any other in a same functional class. in predicting oxidoreductases subfamilies, am-svm obtains the over 0.92 matthew's correlation coefficient (mcc) and over 93% accuracy, and in predicting lyases, isomerases and ligases subfamilies, it obtains over 0.73 mcc and over 82% accuracy. the improvement in the predictive performance suggests the am-svm might play a complementary role to the existing function annotation methods.
WOS标题词science & technology ; life sciences & biomedicine
类目[WOS]biochemistry & molecular biology
研究领域[WOS]biochemistry & molecular biology
关键词[WOS]support vector machines ; protein structural classes ; subcellular location prediction ; functional domain composition ; complexity measure factor ; apoptosis proteins ; cleavage sites ; graphic rules ; turn types ; kinetics
收录类别SCI
语种英语
WOS记录号WOS:000284651900017
公开日期2011-12-13
源URL[http://ir.nwipb.ac.cn//handle/363003/1640]  
专题西北高原生物研究所_中国科学院西北高原生物研究所
作者单位1.China Agr Univ, Coll Sci, Beijing 100083, Peoples R China
2.Chinese Acad Sci, NW Inst Plateau Biol, Key Lab Adaptat & Evolut Plateau Biota, Xining 810001, Peoples R China
3.Xinjiang Univ, Coll Math & Syst Sci, Urumuchi 830046, Peoples R China
推荐引用方式
GB/T 7714
Wang, Yong-Cui,Wang, Xiao-Bo,Yang, Zhi-Xia,et al. Prediction of Enzyme Subfamily Class via Pseudo Amino Acid Composition by Incorporating the Conjoint Triad Feature[J]. protein and peptide letters,2010,17(11):1441-1449.
APA Wang, Yong-Cui,Wang, Xiao-Bo,Yang, Zhi-Xia,&Deng, Nai-Yang.(2010).Prediction of Enzyme Subfamily Class via Pseudo Amino Acid Composition by Incorporating the Conjoint Triad Feature.protein and peptide letters,17(11),1441-1449.
MLA Wang, Yong-Cui,et al."Prediction of Enzyme Subfamily Class via Pseudo Amino Acid Composition by Incorporating the Conjoint Triad Feature".protein and peptide letters 17.11(2010):1441-1449.

入库方式: OAI收割

来源:西北高原生物研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。