标题：CDNASA: Clustering data with noise and arbitrary shape
作者：Niu, Zhong-Han ;Fan, Jian-Cong ;Liu, Wen-Hua ;Tang, Liang ;Tang, Shuai
作者机构：[Niu, Zhong-Han ;Fan, Jian-Cong ] Provincial Key Lab. for Information Technology of Wisdom Mining of Shandong Province, Shandong University of Science 更多
来源：International Journal of Wireless and Mobile Computing
摘要：In many data domains, especially for spatial data, clusters of data are of arbitrary shape, size and density. Traditional clustering methods often fail to identify clusters efficiently or accurately in those situations. But the need for scalable spatial clustering algorithms has emerged with the rapid growth of spatial data in recent years. In this paper we propose a spatial clustering method, named CDNASA, based on the idea that each data object belongs to a certain space and if the two spaces have overlapping sections, they can be merged into one cluster. The data points which cannot be merged by any cluster are noise points. The effectiveness and efficiency of the proposed algorithm are tested on both synthetic and real data sets. Experimental results show that the quality of clusters discovered by CDNASA is much better than those by existing algorithms, especially for arbitrary shaped clusters. CDNASA also has the characteristics of noise-tolerance as well as low time and space complexity.
Copyright © 2016 Inderscience Enterprises Ltd.