中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
A method for handling metabonomics data from liquid chromatography/mass spectrometry: combinational use of support vector machine recursive feature elimination, genetic algorithm and random forest for feature selection

文献类型:期刊论文

作者Lin, Xiaohui1; Wang, Quancai1; Yin, Peiyuan2; Tang, Liang3; Tan, Yexiong3; Li, Hong1; Yan, Kang1; Xu, Guowang2
刊名metabolomics
出版日期2011-12-01
卷号7期号:4页码:549-558
关键词Support vector machine Genetic algorithm Random forest Liver diseases Metabonomics Metabolomics
英文摘要metabolic markers are the core of metabonomic surveys. hence selection of differential metabolites is of great importance for either biological or clinical purpose. here, a feature selection method was developed for complex metabonomic data set. as an effective tool for metabonomics data analysis, support vector machine (svm) was employed as the basic classifier. to find out meaningful features effectively, support vector machine recursive feature elimination (svm-rfe) was firstly applied. then, genetic algorithm (ga) and random forest (rf) which consider the interaction among the metabolites and independent performance of each metabolite in all samples, respectively, were used to obtain more informative metabolic difference and avoid the risk of false positive. a data set from plasma metabonomics study of rat liver diseases developed from hepatitis, cirrhosis to hepatocellular carcinoma was applied for the validation of the method. besides the good classification results for 3 kinds of liver diseases, 31 important metabolites including lysophosphatidylethanolamine (lpe) c16:0, palmitoylcarnitine, lysophosphatidylethanolamine (lpc) c18:0 were also selected for further studies. a better complementary effect of the three feature selection methods could be seen from the current results. the combinational method also represented more differential metabolites and provided more metabolic information for a "global" understanding of diseases than any single method. further more, this method is also suitable for other complex biological data sets.
WOS标题词science & technology ; life sciences & biomedicine
类目[WOS]endocrinology & metabolism
研究领域[WOS]endocrinology & metabolism
关键词[WOS]variable importance measures ; mass-spectrometry ; expression data ; classification ; metabolomics ; biomarkers ; hepatitis ; discovery
收录类别SCI
语种英语
WOS记录号WOS:000295991900009
公开日期2015-11-17
源URL[http://159.226.238.44/handle/321008/142714]  
专题大连化学物理研究所_中国科学院大连化学物理研究所
作者单位1.Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 116024, Peoples R China
2.Chinese Acad Sci, Dalian Inst Chem Phys, CAS Key Lab Separat Sci Analyt Chem, Dalian 116023, Peoples R China
3.Second Mil Med Univ, Eastern Hepatobiliary Surg Inst, Int Cooperat Lab Signal Transduct, Shanghai, Peoples R China
推荐引用方式
GB/T 7714
Lin, Xiaohui,Wang, Quancai,Yin, Peiyuan,et al. A method for handling metabonomics data from liquid chromatography/mass spectrometry: combinational use of support vector machine recursive feature elimination, genetic algorithm and random forest for feature selection[J]. metabolomics,2011,7(4):549-558.
APA Lin, Xiaohui.,Wang, Quancai.,Yin, Peiyuan.,Tang, Liang.,Tan, Yexiong.,...&Xu, Guowang.(2011).A method for handling metabonomics data from liquid chromatography/mass spectrometry: combinational use of support vector machine recursive feature elimination, genetic algorithm and random forest for feature selection.metabolomics,7(4),549-558.
MLA Lin, Xiaohui,et al."A method for handling metabonomics data from liquid chromatography/mass spectrometry: combinational use of support vector machine recursive feature elimination, genetic algorithm and random forest for feature selection".metabolomics 7.4(2011):549-558.

入库方式: OAI收割

来源:大连化学物理研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。