首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Clustering in the presence of scatter
Authors:Maitra Ranjan  Ramler Ivan P
Institution:Department of Statistics, Iowa State University, Iowa 50011-1210, U.S.A.
Abstract:Summary :  A new methodology is proposed for clustering datasets in the presence of scattered observations. Scattered observations are defined as unlike any other, so traditional approaches that force them into groups can lead to erroneous conclusions. Our suggested approach is a scheme which, under assumption of homogeneous spherical clusters, iteratively builds cores around their centers and groups points within each core while identifying points outside as scatter. In the absence of scatter, the algorithm reduces to k -means. We also provide methodology to initialize the algorithm and to estimate the number of clusters in the dataset. Results in experimental situations show excellent performance, especially when clusters are elliptically symmetric. The methodology is applied to the analysis of the United States Environmental Protection Agency's Toxic Release Inventory reports on industrial releases of mercury for the year 2000.
Keywords:Bayes' information criterion  Biweight estimator  Exact-c-separation              k-clips              MCLUST            Methylmercury  Tight clustering
本文献已被 PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号