首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
由镰孢菌引起的赤霉病是小麦的重要病害,其抗性比较复杂。准确可靠的鉴定和评价方法是抗性改良成功的前提。评述了小麦赤霉病抗性类型及其不同赤霉病抗性鉴定和评价方法的优缺点,重点讨论了抗性类型与抗性鉴定方法的对应性。希望对赤霉病表型鉴定、不同抗性类型的理解和评价以及抗性改良有借鉴意义。  相似文献   

2.
Different analytical techniques used on the same data set may lead to different conclusions about the existence and strength of genetic structure. Therefore, reliable interpretation of the results from different methods depends on the efficacy and reliability of different statistical methods. In this paper, we evaluated the performance of multiple analytical methods to detect the presence of a linear barrier dividing populations. We were specifically interested in determining if simulation conditions, such as dispersal ability and genetic equilibrium, affect the power of different analytical methods for detecting barriers. We evaluated two boundary detection methods (Monmonier's algorithm and WOMBLING), two spatial Bayesian clustering methods (TESS and GENELAND), an aspatial clustering approach (STRUCTURE), and two recently developed, non-Bayesian clustering methods [PSMIX and discriminant analysis of principal components (DAPC)]. We found that clustering methods had higher success rates than boundary detection methods and also detected the barrier more quickly. All methods detected the barrier more quickly when dispersal was long distance in comparison to short-distance dispersal scenarios. Bayesian clustering methods performed best overall, both in terms of highest success rates and lowest time to barrier detection, with GENELAND showing the highest power. None of the methods suggested a continuous linear barrier when the data were generated under an isolation-by-distance (IBD) model. However, the clustering methods had higher potential for leading to incorrect barrier inferences under IBD unless strict criteria for successful barrier detection were implemented. Based on our findings and those of previous simulation studies, we discuss the utility of different methods for detecting linear barriers to gene flow.  相似文献   

3.
Family and twin studies consistently demonstrate a significant role for genetic factors in the aetiology of the reading disorder dyslexia. However, dyslexia is complex at both the genetic and phenotypic levels, and currently the nature of the core deficit or deficits remains uncertain. Traditional approaches for mapping disease genes, originally developed for single-gene disorders, have limited success when there is not a simple relationship between genotype and phenotype. Recent advances in high-throughput genotyping technology and quantitative statistical methods have made a new approach to identifying genes involved in complex disorders possible. The method involves assessing the genetic similarity of many sibling pairs along the lengths of all their chromosomes and attempting to correlate this similarity with that of their phenotypic scores. We are adopting this approach in an ongoing genome-wide search for genes involved in dyslexia susceptibility, and have already successfully applied the method by replicating results from previous studies suggesting that a quantitative trait locus at 6p21.3 influences reading disability.  相似文献   

4.
Organisms sampled for population‐level research are typically assigned to species by morphological criteria. However, if those criteria are limited to one sex or life stage, or the organisms come from a complex of closely related forms, the species assignments may misdirect analyses. The impact of such sampling can be assessed from the correspondence of genetic clusters, identified only from patterns of genetic variation, to the species identified using only phenotypic criteria. We undertook this protocol with the rock‐dwelling mbuna cichlids of Lake Malawi, for which species within genera are usually identified using adult male coloration patterns. Given high local endemism of male colour patterns, and considerable allele sharing among species, there persists considerable taxonomic uncertainty in these fishes. Over 700 individuals from a single transect were photographed, genotyped and separately assigned: (a) to morphospecies using photographs; and (b) to genetic clusters using five widely used methods. Overall, the correspondence between clustering methods was strong for larger clusters, but methods varied widely in estimated number of clusters. The correspondence between morphospecies and genetic clusters was also strong for larger clusters, as well as some smaller clusters for some methods. These analyses generally affirm (a) adult male‐limited sampling and (b) the taxonomic status of Lake Malawi mbuna, as the species in our study largely appear to be well‐demarcated genetic entities. More generally, our analyses highlight the challenges for clustering methods when the number of populations is unknown, especially in cases of highly uneven sample sizes.  相似文献   

5.
Recent advances in genotyping methodologies have allowed genome-wide association studies (GWAS) to accurately identify genetic variants that associate with common or pathological complex traits. Although most GWAS have focused on associations with single genetic variants, joint identification of multiple genetic variants, and how they interact, is essential for understanding the genetic architecture of complex phenotypic traits. Here, we propose an efficient stepwise method based on the Cochran-Mantel-Haenszel test (for stratified categorical data) to identify causal joint multiple genetic variants in GWAS. This method combines the CMH statistic with a stepwise procedure to detect multiple genetic variants associated with specific categorical traits, using a series of associated I × J contingency tables and a null hypothesis of no phenotype association. Through a new stratification scheme based on the sum of minor allele count criteria, we make the method more feasible for GWAS data having sample sizes of several thousands. We also examine the properties of the proposed stepwise method via simulation studies, and show that the stepwise CMH test performs better than other existing methods (e.g., logistic regression and detection of associations by Markov blanket) for identifying multiple genetic variants. Finally, we apply the proposed approach to two genomic sequencing datasets to detect linked genetic variants associated with bipolar disorder and obesity, respectively.  相似文献   

6.
Statistical species delimitation usually relies on singular data, primarily genetic, for detecting putative species and individual assignment to putative species. Given the variety of speciation mechanisms, singular data may not adequately represent the genetic, morphological and ecological diversity relevant to species delimitation. We describe a methodological framework combining multivariate and clustering techniques that uses genetic, morphological and ecological data to detect and assign individuals to putative species. Our approach recovers a similar number of species recognized using traditional, qualitative taxonomic approaches that are not detected when using purely genetic methods. Furthermore, our approach detects groupings that traditional, qualitative taxonomic approaches do not. This empirical test suggests that our approach to detecting and assigning individuals to putative species could be useful in species delimitation despite varying levels of differentiation across genetic, phenotypic and ecological axes. This work highlights a critical, and often overlooked, aspect of the process of statistical species delimitation—species detection and individual assignment. Irrespective of the species delimitation approach used, all downstream processing relies on how individuals are initially assigned, and the practices and statistical issues surrounding individual assignment warrant careful consideration.  相似文献   

7.
Nested effects models for high-dimensional phenotyping screens   总被引:2,自引:0,他引:2  
MOTIVATION: In high-dimensional phenotyping screens, a large number of cellular features is observed after perturbing genes by knockouts or RNA interference. Comprehensive analysis of perturbation effects is one of the most powerful techniques for attributing functions to genes, but not much work has been done so far to adapt statistical and computational methodology to the specific needs of large-scale and high-dimensional phenotyping screens. RESULTS: We introduce and compare probabilistic methods to efficiently infer a genetic hierarchy from the nested structure of observed perturbation effects. These hierarchies elucidate the structures of signaling pathways and regulatory networks. Our methods achieve two goals: (1) they reveal clusters of genes with highly similar phenotypic profiles, and (2) they order (clusters of) genes according to subset relationships between phenotypes. We evaluate our algorithms in the controlled setting of simulation studies and show their practical use in two experimental scenarios: (1) a data set investigating the response to microbial challenge in Drosophila melanogaster, and (2) a compendium of expression profiles of Saccharomyces cerevisiae knockout strains. We show that our methods identify biologically justified genetic hierarchies of perturbation effects. AVAILABILITY: The software used in our analysis is freely available in the R package 'nem' from www.bioconductor.org.  相似文献   

8.
We have developed an open-source database system named “Pheno-Pub” to support a series of data-handling and publication tasks, including statistical analyses, data review, and web site construction, for mouse phenotyping experiments. This system is composed of three applications. “Mou-Stat” provides semiautomatic statistical analyses for a batch of phenotypic data, including a variety of conditions for group comparisons (e.g., different scales of measurement parameters). “Genotype Viewer” and “Strain Viewer” provide representation of genotype-driven and measurement parameter-driven views of phenotypic data; they highlight significant differences in genotypes and between strains, respectively. Direct links from the Strain Viewer web site to the Genotype Viewer web site provide flexible navigation in the exploration of phenotypic data. With these publication tools, phenotypic data can be made available on the Internet by simple operations. This system is expandable for a wide range of uses in phenotypic comparative analyses, including comparisons among different genotypes and strains and comparisons among groups exposed to different environmental conditions. Finally, Pheno-Pub provides advanced usability for both producers of experimental data and consumers of phenotypic information. Therefore, Pheno-Pub contributes significantly to the publication of data in various fields of phenotyping research and to broad data sharing, thereby promoting the understanding of the functions of the entire mouse genome.  相似文献   

9.

Background

Although substance use disorders (SUDs) are heritable, few genetic risk factors for them have been identified, in part due to the small sample sizes of study populations. To address this limitation, researchers have aggregated subjects from multiple existing genetic studies, but these subjects can have missing phenotypic information, including diagnostic criteria for certain substances that were not originally a focus of study. Recent advances in addiction neurobiology have shown that comorbid SUDs (e.g., the abuse of multiple substances) have similar genetic determinants, which makes it possible to infer missing SUD diagnostic criteria using criteria from another SUD and patient genotypes through statistical modeling.

Results

We propose a new approach based on matrix completion techniques to integrate features of comorbid health conditions and individual’s genotypes to infer unreported diagnostic criteria for a disorder. This approach optimizes a bi-linear model that uses the interactions between known disease correlations and candidate genes to impute missing criteria. An efficient stochastic and parallel algorithm was developed to optimize the model with a speed 20 times greater than the classic sequential algorithm. It was tested on 3441 subjects who had both cocaine and opioid use disorders and successfully inferred missing diagnostic criteria with consistently better accuracy than other recent statistical methods.

Conclusions

The proposed matrix completion imputation method is a promising tool to impute unreported or unobserved symptoms or criteria for disease diagnosis. Integrating data at multiple scales or from heterogeneous sources may help improve the accuracy of phenotype imputation.
  相似文献   

10.
Functional dependencies between genes are a defining characteristic of gene networks underlying quantitative traits. However, recent studies show that the proportion of the genetic variation that can be attributed to statistical epistasis varies from almost zero to very high. It is thus of fundamental as well as instrumental importance to better understand whether different functional dependency patterns among polymorphic genes give rise to distinct statistical interaction patterns or not. Here we address this issue by combining a quantitative genetic model approach with genotype-phenotype models capable of translating allelic variation and regulatory principles into phenotypic variation at the level of gene expression. We show that gene regulatory networks with and without feedback motifs can exhibit a wide range of possible statistical genetic architectures with regard to both type of effect explaining phenotypic variance and number of apparent loci underlying the observed phenotypic effect. Although all motifs are capable of harboring significant interactions, positive feedback gives rise to higher amounts and more types of statistical epistasis. The results also suggest that the inclusion of statistical interaction terms in genetic models will increase the chance to detect additional QTL as well as functional dependencies between genetic loci over a broad range of regulatory regimes. This article illustrates how statistical genetic methods can fruitfully be combined with nonlinear systems dynamics to elucidate biological issues beyond reach of each methodology in isolation.  相似文献   

11.
More accurate and precise phenotyping strategies are necessary to empower high-resolution linkage mapping and genome-wide association studies and for training genomic selection models in plant improvement. Within this framework, the objective of modern phenotyping is to increase the accuracy, precision and throughput of phenotypic estimation at all levels of biological organization while reducing costs and minimizing labor through automation, remote sensing, improved data integration and experimental design. Much like the efforts to optimize genotyping during the 1980s and 1990s, designing effective phenotyping initiatives today requires multi-faceted collaborations between biologists, computer scientists, statisticians and engineers. Robust phenotyping systems are needed to characterize the full suite of genetic factors that contribute to quantitative phenotypic variation across cells, organs and tissues, developmental stages, years, environments, species and research programs. Next-generation phenotyping generates significantly more data than previously and requires novel data management, access and storage systems, increased use of ontologies to facilitate data integration, and new statistical tools for enhancing experimental design and extracting biologically meaningful signal from environmental and experimental noise. To ensure relevance, the implementation of efficient and informative phenotyping experiments also requires familiarity with diverse germplasm resources, population structures, and target populations of environments. Today, phenotyping is quickly emerging as the major operational bottleneck limiting the power of genetic analysis and genomic prediction. The challenge for the next generation of quantitative geneticists and plant breeders is not only to understand the genetic basis of complex trait variation, but also to use that knowledge to efficiently synthesize twenty-first century crop varieties.  相似文献   

12.
The availability of high-density single nucleotide polymorphisms (SNPs) data has made the human genetic association studies possible to identify common and rare variants underlying complex diseases in a genome-wide scale. A handful of novel genetic variants have been identified, which gives much hope and prospects for the future of genetic association studies. In this process, statistical and computational methods play key roles, among which information-based association tests have gained large popularity. This paper is intended to give a comprehensive review of the current literature in genetic association analysis casted in the framework of information theory. We focus our review on the following topics: (1) information theoretic approaches in genetic linkage and association studies; (2) entropy-based strategies for optimal SNP subset selection; and (3) the usage of theoretic information criteria in gene clustering and gene regulatory network construction.  相似文献   

13.
As the ability to measure dense genetic markers approaches the limit of the DNA sequence itself, taking advantage of possible clustering of genetic variants in, and around, a gene would benefit genetic association analyses, and likely provide biological insights. The greatest benefit might be realized when multiple rare variants cluster in a functional region. Several statistical tests have been developed, one of which is based on the popular Kulldorff scan statistic for spatial clustering of disease. We extended another popular spatial clustering method—Tango’s statistic—to genomic sequence data. An advantage of Tango’s method is that it is rapid to compute, and when single test statistic is computed, its distribution is well approximated by a scaled χ 2 distribution, making computation of p values very rapid. We compared the Type-I error rates and power of several clustering statistics, as well as the omnibus sequence kernel association test. Although our version of Tango’s statistic, which we call “Kernel Distance” statistic, took approximately half the time to compute than the Kulldorff scan statistic, it had slightly less power than the scan statistic. Our results showed that the Ionita-Laza version of Kulldorff’s scan statistic had the greatest power over a range of clustering scenarios.  相似文献   

14.
Stomatal density (SD) and stomatal complex area (SCA) are important traits that regulate gas exchange and abiotic stress response in plants. Despite sorghum (Sorghum bicolor) adaptation to arid conditions, the genetic potential of stomata-related traits remains unexplored due to challenges in available phenotyping methods. Hence, identifying loci that control stomatal traits is fundamental to designing strategies to breed sorghum with optimized stomatal regulation. We implemented both classical and deep learning methods to characterize genetic diversity in 311 grain sorghum accessions for stomatal traits at two different field environments. Nearly 12,000 images collected from abaxial (Ab) and adaxial (Ad) leaf surfaces revealed substantial variation in stomatal traits. Our study demonstrated significant accuracy between manual and deep learning methods in predicting SD and SCA. In sorghum, SD was 32%–39% greater on the Ab versus the Ad surface, while SCA on the Ab surface was 2%–5% smaller than on the Ad surface. Genome-Wide Association Study identified 71 genetic loci (38 were environment-specific) with significant genotype to phenotype associations for stomatal traits. Putative causal genes underlying the phenotypic variation were identified. Accessions with similar SCA but carrying contrasting haplotypes for SD were tested for stomatal conductance and carbon assimilation under field conditions. Our findings provide a foundation for further studies on the genetic and molecular mechanisms controlling stomata patterning and regulation in sorghum. An integrated physiological, deep learning, and genomic approach allowed us to unravel the genetic control of natural variation in stomata traits in sorghum, which can be applied to other plants.

High-throughput phenotyping using deep learning tools integrated with genome-wide association studies revealed genes that control SD and area in grain sorghum.  相似文献   

15.
We conducted a 10-centimorgan linkage autosomal genome scan in a set of 19 extended American pedigrees (219 subjects) ascertained through probands with panic disorder. Several anxiety disorders--including social phobia, agoraphobia, and simple phobia--in addition to panic disorder segregate in these families. In previous studies of this sample, linkage analyses were based separately on each of the individual categorical affection diagnoses. Given the substantial comorbidity between anxiety disorders and their probable shared genetic liability, it is clear that this method discards a considerable amount of information. In this article, we propose a new approach that considers panic disorder, simple phobia, social phobia, and agoraphobia as expressions of the same multivariate, putatively genetically influenced trait. We applied the most powerful multipoint Haseman-Elston method, using the grade of membership score generated from a fuzzy clustering of these phenotypes as the dependent variable in Haseman-Elston regression. One region on chromosome 4q31-q34, at marker D4S413 (with multipoint and single-point nominal P values < .00001), showed strong evidence of linkage (genomewide significance at P<.05). The same region is known to be the site of a neuropeptide Y receptor gene, NPY1R (4q31-q32), that was recently connected to anxiolytic-like effects in rats. Several other regions on four chromosomes (4q21.21-22.3, 5q14.2-14.3, 8p23.1, and 14q22.3-23.3) met criteria for suggestive linkage (multipoint nominal P values < .01). Family-by-family analysis did not show any strong evidence of heterogeneity. Our findings support the notion that the major anxiety disorders, including phobias and panic disorder, are complex traits that share at least one susceptibility locus. This method could be applied to other complex traits for which shared genetic-liability factors are thought to be important, such as substance dependencies.  相似文献   

16.
Statistical inference for simultaneous clustering of gene expression data   总被引:1,自引:0,他引:1  
Current methods for analysis of gene expression data are mostly based on clustering and classification of either genes or samples. We offer support for the idea that more complex patterns can be identified in the data if genes and samples are considered simultaneously. We formalize the approach and propose a statistical framework for two-way clustering. A simultaneous clustering parameter is defined as a function theta=Phi(P) of the true data generating distribution P, and an estimate is obtained by applying this function to the empirical distribution P(n). We illustrate that a wide range of clustering procedures, including generalized hierarchical methods, can be defined as parameters which are compositions of individual mappings for clustering patients and genes. This framework allows one to assess classical properties of clustering methods, such as consistency, and to formally study statistical inference regarding the clustering parameter. We present results of simulations designed to assess the asymptotic validity of different bootstrap methods for estimating the distribution of Phi(P(n)). The method is illustrated on a publicly available data set.  相似文献   

17.
S Wilkinson  C Haley  L Alderson  P Wiener 《Heredity》2011,106(2):261-269
Recently developed Bayesian genotypic clustering methods for analysing genetic data offer a powerful tool to evaluate the genetic structure of domestic farm animal breeds. The unit of study with these approaches is the individual instead of the population. We aimed to empirically evaluate various individual-based population genetic statistical methods for characterization of genetic diversity and structure of livestock breeds. Eighteen British pig populations, comprising 819 individuals, were genotyped at 46 microsatellite markers. Three Bayesian genotypic clustering approaches, principle component analysis (PCA) and phylogenetic reconstruction were applied to individual multilocus genotypes to infer the genetic structure and diversity of the British pig breeds. Comparisons of the three Bayesian genotypic clustering methods (, and ) revealed some broad similarities but also some notable differences. Overall, the methods agreed that majority of the British pig breeds are independent genetic units with little evidence of admixture. The three Bayesian genotypic clustering methods provided complementary, biologically credible clustering solutions but at different levels of resolution. detected finer genetic differentiation and in some cases, populations within breeds. Consequently, it estimated a greater number of underlying genetic populations (K, in the notation of Bayesian clustering methods). Two of the Bayesian methods ( and ) and phylogenetic reconstruction provided similar success in assignment of individuals, supporting the use of these methods for breed assignment.  相似文献   

18.
19.
OBJECTIVE: The potential value of haplotypes has attracted widespread interest in the mapping of complex traits. Haplotype sharing methods take the linkage disequilibrium information between multiple markers into account, and may have good power to detect predisposing genes. We present a new approach based on Mantel statistics for spacetime clustering, which is developed in order to improve the power of haplotype sharing analysis for gene mapping in complex disease. METHODS: The new statistic correlates genetic similarity and phenotypic similarity across pairs of haplotypes for case-only and case-control studies. The genetic similarity is measured as the shared length between haplotypes around a putative disease locus. The phenotypic similarity is measured as the mean-corrected cross-product based on the respective phenotypes. We analyzed two tests for statistical significance with respect to type I error: (1) assuming asymptotic normality, and (2) using a Monte Carlo permutation procedure. The results were compared to the chi(2) test for association based on 3-marker haplotypes. RESULTS: The results of the type I error rates for the Mantel statistics using the permutational procedure yielded pointwise valid tests. The approach based on the assumption of asymptotic normality was seriously liberal. CONCLUSION: Power comparisons showed that the Mantel statistics were better than or equal to the chi(2) test for all simulated disease models.  相似文献   

20.
Readily-accessible and standardised capture of genotypic variation has revolutionised our understanding of the genetic contribution to disease. Unfortunately, the corresponding systematic capture of patient phenotypic variation needed to fully interpret the impact of genetic variation has lagged far behind. Exploiting deep and systematic phenotyping of a cohort of 197 patients presenting with heterogeneous developmental disorders and whose genomes harbour de novo CNVs, we systematically applied a range of commonly-used functional genomics approaches to identify the underlying molecular perturbations and their phenotypic impact. Grouping patients into 408 non-exclusive patient-phenotype groups, we identified a functional association amongst the genes disrupted in 209 (51%) groups. We find evidence for a significant number of molecular interactions amongst the association-contributing genes, including a single highly-interconnected network disrupted in 20% of patients with intellectual disability, and show using microcephaly how these molecular networks can be used as baits to identify additional members whose genes are variant in other patients with the same phenotype. Exploiting the systematic phenotyping of this cohort, we observe phenotypic concordance amongst patients whose variant genes contribute to the same functional association but note that (i) this relationship shows significant variation across the different approaches used to infer a commonly perturbed molecular pathway, and (ii) that the phenotypic similarities detected amongst patients who share the same inferred pathway perturbation result from these patients sharing many distinct phenotypes, rather than sharing a more specific phenotype, inferring that these pathways are best characterized by their pleiotropic effects.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号