中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Ensemble subspace clustering of text data using two-level features

文献类型:期刊论文

作者He Zhao; Salman Salloum; Yeshou Cai; Joshua Zhexue Huang
刊名INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS
出版日期2017
文献子类期刊论文
英文摘要This paper proposes a new integrated method for ensemble subspace clustering of high dimensional sparse text data. Our method employs two-level feature representation of text data (words and topics) to generate clusters from subspaces. We also use ensemble clustering to increase the robustness of the clusters. This method depends on topic modeling to get the two-level feature representation of text data and to generate different ensemble components. By using both topics and words to cluster text data, we can get more interpretable clusters as we can measure the weight of words and topics in each cluster. In order to evaluate the proposed method, we have conducted several experiments on seven real-life data sets. While some of these data sets are easy to cluster, others are hard, and some others contain unbalanced data. Experimental results on this diversity of data sets show that our method outperforms other methods for ensemble clustering.
URL标识查看原文
语种英语
源URL[http://ir.siat.ac.cn:8080/handle/172644/12568]  
专题深圳先进技术研究院_数字所
作者单位INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS
推荐引用方式
GB/T 7714
He Zhao,Salman Salloum,Yeshou Cai,et al. Ensemble subspace clustering of text data using two-level features[J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS,2017.
APA He Zhao,Salman Salloum,Yeshou Cai,&Joshua Zhexue Huang.(2017).Ensemble subspace clustering of text data using two-level features.INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS.
MLA He Zhao,et al."Ensemble subspace clustering of text data using two-level features".INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS (2017).

入库方式: OAI收割

来源:深圳先进技术研究院

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。