Highest Density Difference Region Estimation with Application to Flow Cytometric Data |
| |
Authors: | Tarn Duong Inge Koch M. P. Wand |
| |
Affiliation: | 1. Institut Pasteur, Groupe Imagerie et Modélisation;2. CNRS, URA 2582, F‐75015 Paris, France;3. School of Mathematics and Statistics, The University of New South Wales, Sydney 2052, Australia;4. School of Mathematics and Applied Statistics, University of Wollongong, Wollongong 2522, Australia |
| |
Abstract: | Motivated by the needs of scientists using flow cytometry, we study the problem of estimating the region where two multivariate samples differ in density. We call this problem highest density difference region estimation and recognise it as a two‐sample analogue of highest density region or excess set estimation. Flow cytometry samples are typically in the order of 10 000 and 100 000 and with dimension ranging from about 3 to 20. The industry standard for the problem being studied is called Frequency Difference Gating, due to Roederer and Hardy ( 2001 ). After couching the problem in a formal statistical framework we devise an alternative estimator that draws upon recent statistical developments such as patient rule induction methods. Improved performance is illustrated in simulations. While motivated by flow cytometry, the methodology is suitable for general multivariate random samples where density difference regions are of interest. |
| |
Keywords: | Flow cytometry Frequency difference gating Highest density region Multivariate density estimation Patient rule induction method |
|
|