中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
DenPEHC: Density peak based efficient hierarchical clustering

文献类型:期刊论文

作者Xu, Ji1,2,4; Wang, Guoyin3; Deng, Weihui2
刊名INFORMATION SCIENCES
出版日期2016-12-10
卷号373页码:200-218
ISSN号0020-0255
关键词Hierarchical Clustering Density Peaks Grid Granulation Granular Computing
DOI10.1016/j.ins.2016.08.086
英文摘要

Existing hierarchical clustering algorithms involve a flat clustering component and an additional agglomerative or divisive procedure. This paper presents a density peak based hierarchical clustering method (DenPEHC), which directly generates clusters on each possible clustering layer, and introduces a grid granulation framework to enable DenPEHC to cluster large-scale and high-dimensional (LSHD) datasets. This study consists of three parts: (1) utilizing the distribution of the parameter gamma, which is defined as the product of the local density rho and the minimal distance to data points with higher density delta in "clustering by fast search and find of density peaks" (DPClust), and a linear fitting approach to select clustering centers with the clustering hierarchy decided by finding the "stairs" in the gamma curve; (2) analyzing the leading tree (in which each node except the root is led by its parent to join the same cluster) as an intermediate result of DPClust, and constructing the clustering hierarchy efficiently based on the tree; and (3) designing a framework to enable DenPEHC to cluster LSHD datasets when a large number of attributes can be grouped by their semantics. The proposed method builds the clustering hierarchy by simply disconnecting the center points from their parents with a linear computational complexity 0(m), where m is the number of clusters. Experiments on synthetic and real datasets show that the proposed method has promising efficiency, accuracy and robustness compared to state-of-the-art methods. (C) 2016 Elsevier Inc. All rights reserved.

资助项目National Key Research and Development Program of China[2016YFB1000905] ; National Natural Science Foundation of China[61272060] ; National Natural Science Foundation of China[61572091]
WOS研究方向Computer Science
语种英语
出版者ELSEVIER SCIENCE INC
WOS记录号WOS:000385470400012
源URL[http://119.78.100.138/handle/2HOD01W0/2909]  
专题大数据挖掘及应用中心
作者单位1.Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu 610031, Peoples R China
2.Chinese Acad Sci, Chongqing Inst Green & Intelligent Technol, Chongqing 400714, Peoples R China
3.Chongqing Univ Posts & Telecommun, Chongqing Key Lab Computat Intelligence, Chongqing 400065, Peoples R China
4.Guizhou Univ Engn Sci, Sch Informat Engn, Bijie 551700, Peoples R China
推荐引用方式
GB/T 7714
Xu, Ji,Wang, Guoyin,Deng, Weihui. DenPEHC: Density peak based efficient hierarchical clustering[J]. INFORMATION SCIENCES,2016,373:200-218.
APA Xu, Ji,Wang, Guoyin,&Deng, Weihui.(2016).DenPEHC: Density peak based efficient hierarchical clustering.INFORMATION SCIENCES,373,200-218.
MLA Xu, Ji,et al."DenPEHC: Density peak based efficient hierarchical clustering".INFORMATION SCIENCES 373(2016):200-218.

入库方式: OAI收割

来源:重庆绿色智能技术研究院

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。