首页 | 本学科首页   官方微博 | 高级检索  
     


Statistical significance for hierarchical clustering in genetic association and microarray expression studies
Authors:Mark?A?Levenstien  author-information"  >  author-information__contact u-icon-before"  >  mailto:markl@linkage.rockefeller.edu"   title="  markl@linkage.rockefeller.edu"   itemprop="  email"   data-track="  click"   data-track-action="  Email author"   data-track-label="  "  >Email author,Yaning?Yang,Jürg?Ott
Affiliation:(1) Laboratory of Statistical Genetics Rockefeller, University New York, New York, NY, 10021, United States
Abstract:

Background  

With the increasing amount of data generated in molecular genetics laboratories, it is often difficult to make sense of results because of the vast number of different outcomes or variables studied. Examples include expression levels for large numbers of genes and haplotypes at large numbers of loci. It is then natural to group observations into smaller numbers of classes that allow for an easier overview and interpretation of the data. This grouping is often carried out in multiple steps with the aid of hierarchical cluster analysis, each step leading to a smaller number of classes by combining similar observations or classes. At each step, either implicitly or explicitly, researchers tend to interpret results and eventually focus on that set of classes providing the "best" (most significant) result. While this approach makes sense, the overall statistical significance of the experiment must include the clustering process, which modifies the grouping structure of the data and often removes variation.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号