中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Unbiased Feature Selection in Learning Random Forests for High Dimensional Data

文献类型:期刊论文

作者Thanh-Tung Nguyen; Joshua Zhexue Huang; Thuy Thi Nguyen
刊名The Scientific World Journal
出版日期2015
英文摘要Random forests (RFs) have been widely used as a powerful classification method. However, with the randomization in both bagging samples and feature selection, the trees in the forest tend to select uninformative features for node splitting. This makes RFs have poor accuracy when working with high-dimensional data. Besides that, RFs have bias in the feature selection process where multivalued features are favored. Aiming at debiasing feature selection in RFs, we propose a new RF algorithm, called xRF, to select good features in learning RFs for high-dimensional data. We first remove the uninformative features using -value assessment, and the subset of unbiased features is then selected based on some statistical measures. This feature subset is then partitioned into two subsets. A feature weighting sampling technique is used to sample features from these two subsets for building trees. This approach enables one to generate more accurate trees, while allowing one to reduce dimensionality and the amount of data needed for learning RFs. An extensive set of experiments has been conducted on 47 high-dimensional real-world datasets including image datasets. The experimental results have shown that RFs with the proposed approach outperformed the existing random forests in increasing the accuracy and the AUC measures.
收录类别其他
原文出处http://www.hindawi.com/journals/tswj/2015/471371/
语种英语
源URL[http://ir.siat.ac.cn:8080/handle/172644/6903]  
专题深圳先进技术研究院_数字所
作者单位The Scientific World Journal
推荐引用方式
GB/T 7714
Thanh-Tung Nguyen,Joshua Zhexue Huang,Thuy Thi Nguyen. Unbiased Feature Selection in Learning Random Forests for High Dimensional Data[J]. The Scientific World Journal,2015.
APA Thanh-Tung Nguyen,Joshua Zhexue Huang,&Thuy Thi Nguyen.(2015).Unbiased Feature Selection in Learning Random Forests for High Dimensional Data.The Scientific World Journal.
MLA Thanh-Tung Nguyen,et al."Unbiased Feature Selection in Learning Random Forests for High Dimensional Data".The Scientific World Journal (2015).

入库方式: OAI收割

来源:深圳先进技术研究院

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。