首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Ensemble clustering for step data via binning
Authors:Ja‐Yoon Jang  Hee‐Seok Oh  Yaeji Lim  Ying Kuen Cheung
Abstract:This paper considers the clustering problem of physical step count data recorded on wearable devices. Clustering step data give an insight into an individual's activity status and further provide the groundwork for health‐related policies. However, classical methods, such as K‐means clustering and hierarchical clustering, are not suitable for step count data that are typically high‐dimensional and zero‐inflated. This paper presents a new clustering method for step data based on a novel combination of ensemble clustering and binning. We first construct multiple sets of binned data by changing the size and starting position of the bin, and then merge the clustering results from the binned data using a voting method. The advantage of binning, as a critical component, is that it substantially reduces the dimension of the original data while preserving the essential characteristics of the data. As a result, combining clustering results from multiple binned data can provide an improved clustering result that reflects both local and global structures of the data. Simulation studies and real data analysis were carried out to evaluate the empirical performance of the proposed method and demonstrate its general utility.
Keywords:binning  clustering  ensemble clustering  functional data  K‐means  step data  wearable device
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号