中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Identifying the Characteristics of the Hypusination Sites Using SMOTE and SVM Algorithm with Feature Selection

文献类型:期刊论文

作者Sun, XiJun2; Li, JiaRui2; Wang, ShaoPeng2; Cai, Yu-Dong2; Gu, Lei1,3; Zhang, YuHang4; Huang, Tao4; ,
刊名CURRENT PROTEOMICS
出版日期2018
卷号15期号:2页码:111-118
关键词Hypusination SMOTE mRMR marchine-learning feature selection SMO
ISSN号1570-1646
DOI10.2174/1570164614666171109120615
文献子类Article
英文摘要Background: Hypusination is a unique modification on lysine residues in eukaryotic translation initiation factor 5A (eIF5A), which is essential and highly conserved in all kinds of eukaryotes. However, the mechanism of recognizing this particular hypusination site remains unclear. In this study, we first gave an attempt in uncovering the characteristics of the hypusination sites using computational methods. Method: The hypusination sites validated by experiments or predicted through sequence similarity that were retrieved from the UniProt database were selected for investigating. Each site was transformed into a peptide segment that contained the modification site and the residues around it. Four types of features were extracted from the peptide segments. Because the hypusination sites are much fewer than non-hypusination sites, the synthetic minority over-sampling technique (SMOTE) was performed to make the dataset containing them balanced. Then, some feature selection methods, including maximum relevance minimum redundancy (mRMR) and incremental feature selection (IFS), were used to analyze four types of features and build an optimal classifier that used support vector machine (SVM) as the prediction engine. Results: The obtained optimal SVM classifier harboring four amino acid features yielded a perfect Mathews' correlation coefficient (MCC) value of 1.000 on both training and testing sets, indicating these four features are hypusination specific characteristics. Conclusions: As a pioneer work, our analysis provides insight into the improvement of the understanding of hypusination mechanisms.
学科主题Biochemistry & Molecular Biology
WOS关键词TRANSLATION INITIATION-FACTOR ; AMINO-ACID ; FACTOR 5A ; POSTTRANSLATIONAL MODIFICATIONS ; PROTEIN ; PREDICTION ; EIF5A ; IDENTIFICATION ; BINDING ; EXPRESSION
语种英语
WOS记录号WOS:000427494700004
出版者BENTHAM SCIENCE PUBL LTD
版本出版稿
源URL[http://202.127.25.144/handle/331004/680]  
专题中国科学院上海生命科学研究院营养科学研究所
作者单位1.Harvard Med Sch, Cell Biol Dept, Boston, MA 02215 USA;
2.Shanghai Univ, Coll Life Sci, Shanghai 200444, Peoples R China;
3.Boston Childrens Hosp, Boston, MA 02215 USA;
4.Chinese Acad Sci, Shanghai Inst Biol Sci, Inst Hlth Sci, Shanghai 200031, Peoples R China,
推荐引用方式
GB/T 7714
Sun, XiJun,Li, JiaRui,Wang, ShaoPeng,et al. Identifying the Characteristics of the Hypusination Sites Using SMOTE and SVM Algorithm with Feature Selection[J]. CURRENT PROTEOMICS,2018,15(2):111-118.
APA Sun, XiJun.,Li, JiaRui.,Wang, ShaoPeng.,Cai, Yu-Dong.,Gu, Lei.,...&,.(2018).Identifying the Characteristics of the Hypusination Sites Using SMOTE and SVM Algorithm with Feature Selection.CURRENT PROTEOMICS,15(2),111-118.
MLA Sun, XiJun,et al."Identifying the Characteristics of the Hypusination Sites Using SMOTE and SVM Algorithm with Feature Selection".CURRENT PROTEOMICS 15.2(2018):111-118.

入库方式: OAI收割

来源:上海营养与健康研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。