Prediction of Enzyme Subfamily Class via Pseudo Amino Acid Composition by Incorporating the Conjoint Triad Feature
文献类型:期刊论文
作者 | Wang, Yong-Cui1,2; Wang, Xiao-Bo1; Yang, Zhi-Xia1,3; Deng, Nai-Yang1 |
刊名 | protein and peptide letters
![]() |
出版日期 | 2010-11-01 |
卷号 | 17期号:11页码:1441-1449 |
关键词 | Enzyme subfamily class prediction conjoint triad feature imbalance problem support vector machine |
ISSN号 | 0929-8665 |
中文摘要 | predicting enzyme subfamily class is an imbalance multi-class classification problem due to the fact that the number of proteins in each subfamily makes a great difference. in this paper, we focus on developing the computational methods specially designed for the imbalance multi-class classification problem to predict enzyme subfamily class. we compare two support vector machine (svm)-based methods for the imbalance problem, adaboost algorithm with rbfsvm (svm with rbf kernel) and svm with arithmetic mean (am) offset (am-svm) in enzyme subfamily classification. as input features for our predictive model, we use the conjoint triad feature (ctf). we validate two methods on an enzyme benchmark dataset, which contains six enzyme main families with a total of thirty-four subfamily classes, and those proteins have less than 40% sequence identity to any other in a same functional class. in predicting oxidoreductases subfamilies, am-svm obtains the over 0.92 matthew's correlation coefficient (mcc) and over 93% accuracy, and in predicting lyases, isomerases and ligases subfamilies, it obtains over 0.73 mcc and over 82% accuracy. the improvement in the predictive performance suggests the am-svm might play a complementary role to the existing function annotation methods. |
英文摘要 | predicting enzyme subfamily class is an imbalance multi-class classification problem due to the fact that the number of proteins in each subfamily makes a great difference. in this paper, we focus on developing the computational methods specially designed for the imbalance multi-class classification problem to predict enzyme subfamily class. we compare two support vector machine (svm)-based methods for the imbalance problem, adaboost algorithm with rbfsvm (svm with rbf kernel) and svm with arithmetic mean (am) offset (am-svm) in enzyme subfamily classification. as input features for our predictive model, we use the conjoint triad feature (ctf). we validate two methods on an enzyme benchmark dataset, which contains six enzyme main families with a total of thirty-four subfamily classes, and those proteins have less than 40% sequence identity to any other in a same functional class. in predicting oxidoreductases subfamilies, am-svm obtains the over 0.92 matthew's correlation coefficient (mcc) and over 93% accuracy, and in predicting lyases, isomerases and ligases subfamilies, it obtains over 0.73 mcc and over 82% accuracy. the improvement in the predictive performance suggests the am-svm might play a complementary role to the existing function annotation methods. |
WOS标题词 | science & technology ; life sciences & biomedicine |
类目[WOS] | biochemistry & molecular biology |
研究领域[WOS] | biochemistry & molecular biology |
关键词[WOS] | support vector machines ; protein structural classes ; subcellular location prediction ; functional domain composition ; complexity measure factor ; apoptosis proteins ; cleavage sites ; graphic rules ; turn types ; kinetics |
收录类别 | SCI |
语种 | 英语 |
WOS记录号 | WOS:000284651900017 |
公开日期 | 2011-12-13 |
源URL | [http://ir.nwipb.ac.cn//handle/363003/1640] ![]() |
专题 | 西北高原生物研究所_中国科学院西北高原生物研究所 |
作者单位 | 1.China Agr Univ, Coll Sci, Beijing 100083, Peoples R China 2.Chinese Acad Sci, NW Inst Plateau Biol, Key Lab Adaptat & Evolut Plateau Biota, Xining 810001, Peoples R China 3.Xinjiang Univ, Coll Math & Syst Sci, Urumuchi 830046, Peoples R China |
推荐引用方式 GB/T 7714 | Wang, Yong-Cui,Wang, Xiao-Bo,Yang, Zhi-Xia,et al. Prediction of Enzyme Subfamily Class via Pseudo Amino Acid Composition by Incorporating the Conjoint Triad Feature[J]. protein and peptide letters,2010,17(11):1441-1449. |
APA | Wang, Yong-Cui,Wang, Xiao-Bo,Yang, Zhi-Xia,&Deng, Nai-Yang.(2010).Prediction of Enzyme Subfamily Class via Pseudo Amino Acid Composition by Incorporating the Conjoint Triad Feature.protein and peptide letters,17(11),1441-1449. |
MLA | Wang, Yong-Cui,et al."Prediction of Enzyme Subfamily Class via Pseudo Amino Acid Composition by Incorporating the Conjoint Triad Feature".protein and peptide letters 17.11(2010):1441-1449. |
入库方式: OAI收割
来源:西北高原生物研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。