Optimal search space for clustering gene expression data via consensus. |
| |
Authors: | Michael Hirsch Stephen Swift Xiohui Liu |
| |
Institution: | Department of Intelligent Data Analysis, Brunel University, Uxbridge, United Kingdom. Michael.Hirsch@brunel.ac.uk |
| |
Abstract: | Ensemble clustering methods have become increasingly important to ease the task of choosing the most appropriate cluster algorithm for a particular data analysis problem. The consensus clustering (CC) algorithm is a recognized ensemble clustering method that uses an artificial intelligence technique to optimize a fitness function. We formally prove the existence of a subspace of the search space for CC, which contains all solutions of maximal fitness and suggests two greedy algorithms to search this subspace. We evaluate the algorithms on two gene expression data sets and one synthetic data set, and compare the result with the results of other ensemble clustering approaches. |
| |
Keywords: | |
|
|