中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Exploiting word cluster information for unsupervised feature selection

文献类型:会议论文

作者Qingyao Wu; Yunming Ye; Michael Ng; Hanjing Su; Joshua Huang
出版日期2010
会议名称11th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2010
英文摘要This paper presents an approach to integrate word clustering information into the process of unsupervised feature selection. In our scheme, the words in the whole feature space are clustered into groups based on the co-occurrence statistics of words. The resulted word clustering information and the bag-of-word information are combined together to measure the goodness of each word, which is our basic metric for selecting discriminative features. By exploiting word cluster information, we extend three well-known unsupervised feature selection methods and propose three new methods. A series of experiments are performed on three benchmark text data sets (the 20 Newsgroups, Reuters-21578 and CLASSIC3). The experimental results have shown that the new unsupervised feature selection methods can select more discriminative features, and in turn improve the clustering performance
收录类别EI
语种英语
源URL[http://ir.siat.ac.cn:8080/handle/172644/3121]  
专题深圳先进技术研究院_数字所
作者单位2010
推荐引用方式
GB/T 7714
Qingyao Wu,Yunming Ye,Michael Ng,et al. Exploiting word cluster information for unsupervised feature selection[C]. 见:11th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2010.

入库方式: OAI收割

来源:深圳先进技术研究院

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。