Prediction of Protein-Peptide Interactions with a Nearest Neighbor Algorithm
文献类型:期刊论文
作者 | Li, Bi-Qing1,2; Cai, Yu-Dong2; Zhang, Yu-Hang3; Jin, Mei-Ling3; Huang, Tao3; , |
刊名 | CURRENT BIOINFORMATICS
![]() |
出版日期 | 2018 |
卷号 | 13期号:1页码:14-24 |
关键词 | Protein-peptide interactions maximum relevance minimum redundancy incremental feature selection functional domain composition pseudo-amino acid composition |
ISSN号 | 1574-8936 |
DOI | 10.2174/1574893611666160711162006 |
文献子类 | Article |
英文摘要 | Background: The post-translational modification of tyrosine nitration is an important covalently substituted process in many biochemical processes that are closely related to several human diseases. Objective: Therefore, the correct recognition of nitration sites is useful for diseases diagnosis and the design of effective treatments. However, traditional experimental techniques and methods for the identification of nitration sites are time-consuming, labor-intensive and expensive. Alternatively, effective computational methods can be designed to tackle this problem. Methods: In this study, we proposed a computational workflow to identify and analyze nitrated tyrosine residues in proteins. Specifically, each nitrated tyrosine was represented by features derived from a segment of an amino acid sequence of the protein containing the nitrated tyrosine site. A reliable feature selection method, minimum redundancy maximum relevance, was adopted to analyze these features, and incremental feature selection and a type of support vector machine, SMO (sequential minimal optimization), were employed to extract core features and build an optimal prediction classifier. Results: 223 features were extracted and used to build the optimal prediction classifier, with which the Matthew's correlation coefficient (MCC) of the training set was 0.717. The nitration sites in the testing set were extracted from the UniProt database based on the sequence similarity technique and were all denoted as positive samples. The sensitivity of the optimal classifier was 0.950 for the testing set. The results demonstrate the effectiveness and importance of optimal features and the classifier for the recognition of nitration sites. In addition, three other methods, the nearest neighbor algorithm (NNA), Dagging and random forest (RF) methods were also applied to the training and testing set, and the results were compared with those of the SMO. Conclusion: 61 core features of the 223 total features were analyzed, and this analysis revealed the essential residue types and conserved sites proximal to the central tyrosine residue. |
学科主题 | Biochemistry & Molecular Biology ; Mathematical & Computational Biology |
WOS关键词 | AMINO-ACID-COMPOSITION ; TERMINAL SH2 DOMAIN ; SUBCELLULAR LOCATION PREDICTION ; CRYSTAL-STRUCTURE ; FEATURE-SELECTION ; UBA DOMAIN ; C-ABL ; BINDING ; RECOGNITION ; SRC |
语种 | 英语 |
WOS记录号 | WOS:000425531200003 |
出版者 | BENTHAM SCIENCE PUBL LTD |
版本 | 出版稿 |
源URL | [http://202.127.25.144/handle/331004/705] ![]() |
专题 | 中国科学院上海生命科学研究院营养科学研究所 |
作者单位 | 1.Chinese Acad Sci, Shanghai Inst Biol Sci, Key Lab Syst Biol, Shanghai 200031, Peoples R China; 2.Shanghai Univ, Coll Life Sci, Shanghai 200444, Peoples R China; 3.Chinese Acad Sci, Shanghai Inst Biol Sci, Inst Hlth Sci, Shanghai 200031, Peoples R China, |
推荐引用方式 GB/T 7714 | Li, Bi-Qing,Cai, Yu-Dong,Zhang, Yu-Hang,et al. Prediction of Protein-Peptide Interactions with a Nearest Neighbor Algorithm[J]. CURRENT BIOINFORMATICS,2018,13(1):14-24. |
APA | Li, Bi-Qing,Cai, Yu-Dong,Zhang, Yu-Hang,Jin, Mei-Ling,Huang, Tao,&,.(2018).Prediction of Protein-Peptide Interactions with a Nearest Neighbor Algorithm.CURRENT BIOINFORMATICS,13(1),14-24. |
MLA | Li, Bi-Qing,et al."Prediction of Protein-Peptide Interactions with a Nearest Neighbor Algorithm".CURRENT BIOINFORMATICS 13.1(2018):14-24. |
入库方式: OAI收割
来源:上海营养与健康研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。