Variable selection in high-dimensional multivariate binary data with application to the analysis of microbial community DNA fingerprints |
| |
Authors: | Wilbur J D Ghosh J K Nakatsu C H Brouder S M Doerge R W |
| |
Institution: | Department of Statistics, Purdue University, West Lafayette, Indiana 47907-1399, USA. |
| |
Abstract: | In order to understand the relevance of microbial communities on crop productivity, the identification and characterization of the rhizosphere soil microbial community is necessary. Characteristic profiles of the microbial communities are obtained by denaturing gradient gel electrophoresis (DGGE) of polymerase chain reaction (PCR) amplified 16S rDNA from soil extracted DNA. These characteristic profiles, commonly called community DNA fingerprints, can be represented in the form of high-dimensional binary vectors. We address the problem of modeling and variable selection in high-dimensional multivariate binary data and present an application of our methodology in the context of a controlled agricultural experiment. |
| |
Keywords: | Classification DNA fingerprints High-dimensional data Microbial communities Multivariate binary data Permutation tests Variable selection |
本文献已被 PubMed 等数据库收录! |