首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 255 毫秒
1.
Recent studies in population of European ancestry have shown that 30%∼50% of heritability for human complex traits such as height and body mass index, and common diseases such as schizophrenia and rheumatoid arthritis, can be captured by common SNPs and that genetic variation attributed to chromosomes are in proportion to their length. Using genome-wide estimation and partitioning approaches, we analysed 49 human quantitative traits, many of which are relevant to human diseases, in 7,170 unrelated Korean individuals genotyped on 326,262 SNPs. For 43 of the 49 traits, we estimated a nominally significant (P<0.05) proportion of variance explained by all SNPs on the Affymetrix 5.0 genotyping array (). On average across 47 of the 49 traits for which the estimate of is non-zero, common SNPs explain approximately one-third (range of 7.8% to 76.8%) of narrow sense heritability.The estimate of is highly correlated with the proportion of SNPs with association P<0.031 (r 2 = 0.92). Longer genomic segments tend to explain more phenotypic variation, with a correlation of 0.78 between the estimate of variance explained by individual chromosomes and their physical length, and 1% of the genome explains approximately 1% of the genetic variance. Despite the fact that there are a few SNPs with large effects for some traits, these results suggest that polygenicity is ubiquitous for most human complex traits and that a substantial proportion of the “missing heritability” is captured by common SNPs.  相似文献   

2.
The underlying basis of genetic variation in quantitative traits, in terms of the number of causal variants and the size of their effects, is largely unknown in natural populations. The expectation is that complex quantitative trait variation is attributable to many, possibly interacting, causal variants, whose effects may depend upon the sex, age and the environment in which they are expressed. A recently developed methodology in animal breeding derives a value of relatedness among individuals from high‐density genomic marker data, to estimate additive genetic variance within livestock populations. Here, we adapt and test the effectiveness of these methods to partition genetic variation for complex traits across genomic regions within ecological study populations where individuals have varying degrees of relatedness. We then apply this approach for the first time to a natural population and demonstrate that genetic variation in wing length in the great tit (Parus major) reflects contributions from multiple genomic regions. We show that a polygenic additive mode of gene action best describes the patterns observed, and we find no evidence of dosage compensation for the sex chromosome. Our results suggest that most of the genomic regions that influence wing length have the same effects in both sexes. We found a limited amount of genetic variance in males that is attributed to regions that have no effects in females, which could facilitate the sexual dimorphism observed for this trait. Although this exploratory work focuses on one complex trait, the methodology is generally applicable to any trait for any laboratory or wild population, paving the way for investigating sex‐, age‐ and environment‐specific genetic effects and thus the underlying genetic architecture of phenotype in biological study systems.  相似文献   

3.
Domestic dogs exhibit tremendous phenotypic diversity, including a greater variation in body size than any other terrestrial mammal. Here, we generate a high density map of canine genetic variation by genotyping 915 dogs from 80 domestic dog breeds, 83 wild canids, and 10 outbred African shelter dogs across 60,968 single-nucleotide polymorphisms (SNPs). Coupling this genomic resource with external measurements from breed standards and individuals as well as skeletal measurements from museum specimens, we identify 51 regions of the dog genome associated with phenotypic variation among breeds in 57 traits. The complex traits include average breed body size and external body dimensions and cranial, dental, and long bone shape and size with and without allometric scaling. In contrast to the results from association mapping of quantitative traits in humans and domesticated plants, we find that across dog breeds, a small number of quantitative trait loci (≤3) explain the majority of phenotypic variation for most of the traits we studied. In addition, many genomic regions show signatures of recent selection, with most of the highly differentiated regions being associated with breed-defining traits such as body size, coat characteristics, and ear floppiness. Our results demonstrate the efficacy of mapping multiple traits in the domestic dog using a database of genotyped individuals and highlight the important role human-directed selection has played in altering the genetic architecture of key traits in this important species.  相似文献   

4.
Large genome-wide association studies (GWAS) have identified many genetic loci associated with risk for myocardial infarction (MI) and coronary artery disease (CAD). Concurrently, efforts such as the National Institutes of Health (NIH) Roadmap Epigenomics Project and the Encyclopedia of DNA Elements (ENCODE) Consortium have provided unprecedented data on functional elements of the human genome. In the present study, we systematically investigate the biological link between genetic variants associated with this complex disease and their impacts on gene function. First, we examined the heritability of MI/CAD according to genomic compartments. We observed that single nucleotide polymorphisms (SNPs) residing within nearby regulatory regions show significant polygenicity and contribute between 59–71% of the heritability for MI/CAD. Second, we showed that the polygenicity and heritability explained by these SNPs are enriched in histone modification marks in specific cell types. Third, we found that a statistically higher number of 45 MI/CAD-associated SNPs that have been identified from large-scale GWAS studies reside within certain functional elements of the genome, particularly in active enhancer and promoter regions. Finally, we observed significant heterogeneity of this signal across cell types, with strong signals observed within adipose nuclei, as well as brain and spleen cell types. These results suggest that the genetic etiology of MI/CAD is largely explained by tissue-specific regulatory perturbation within the human genome.  相似文献   

5.
The interplay between dynamic models of biological systems and genomics is based on the assumption that genetic variation of the complex trait (i.e., outcome of model behavior) arises from component traits (i.e., model parameters) in lower hierarchical levels. In order to provide a proof of concept of this statement for a cattle growth model, we ask whether model parameters map genomic regions that harbor quantitative trait loci (QTLs) already described for the complex trait. We conducted a genome-wide association study (GWAS) with a Bayesian hierarchical LASSO method in two parameters of the Davis Growth Model, a system of three ordinary differential equations describing DNA accretion, protein synthesis and degradation, and fat synthesis. Phenotypic and genotypic data were available for 893 Nellore (Bos indicus) cattle. Computed values for parameter k1 (DNA accretion rate) ranged from 0.005 ± 0.003 and for α (constant for energy for maintenance requirement) 0.134 ± 0.024. The expected biological interpretation of the parameters is confirmed by QTLs mapped for k1 and α. QTLs within genomic regions mapped for k1 are expected to be correlated with the DNA pool: body size and weight. Single nucleotide polymorphisms (SNPs) which were significant for α mapped QTLs that had already been associated with residual feed intake, feed conversion ratio, average daily gain (ADG), body weight, and also dry matter intake. SNPs identified for k1 were able to additionally explain 2.2% of the phenotypic variability of the complex ADG, even when SNPs for k1 did not match the genomic regions associated with ADG. Although improvements are needed, our findings suggest that genomic analysis on component traits may help to uncover the genetic basis of more complex traits, particularly when lower biological hierarchies are mechanistically described by mathematical simulation models.  相似文献   

6.
Natural populations exhibit substantial variation in quantitative traits. A quantitative trait is typically defined by its mean and variance, and to date most genetic mapping studies focus on loci altering trait means but not (co)variances. For single traits, the control of trait variance across genetic backgrounds is referred to as genetic canalization. With multiple traits, the genetic covariance among different traits in the same environment indicates the magnitude of potential genetic constraint, while genotype-by-environment interaction (GxE) concerns the same trait across different environments. While some have suggested that these three attributes of quantitative traits are different views of similar concepts, it is not yet clear, however, whether they have the same underlying genetic mechanism. Here, we detect quantitative trait loci (QTL) influencing the (co)variance of phenological traits in six distinct environments in Boechera stricta, a close relative of Arabidopsis. We identified nFT as the QTL altering the magnitude of phenological trait canalization, genetic constraint, and GxE. Both the magnitude and direction of nFT''s canalization effects depend on the environment, and to our knowledge, this reversibility of canalization across environments has not been reported previously. nFT''s effects on trait covariance structure (genetic constraint and GxE) likely result from the variable and reversible canalization effects across different traits and environments, which can be explained by the interaction among nFT, genomic backgrounds, and environmental stimuli. This view is supported by experiments demonstrating significant nFT by genomic background epistatic interactions affecting phenological traits and expression of the candidate gene, FT. In contrast to the well-known canalization gene Hsp90, the case of nFT may exemplify an alternative mechanism: Our results suggest that (at least in traits with major signal integrators such as flowering time) genetic canalization, genetic constraint, and GxE may have related genetic mechanisms resulting from interactions among major QTL, genomic backgrounds, and environments.  相似文献   

7.
Identifying causal genetic variants underlying heritable phenotypic variation is a long‐standing goal in evolutionary genetics. We previously identified several quantitative trait loci (QTL) for five morphological traits in a captive population of zebra finches (Taeniopygia guttata) by whole‐genome linkage mapping. We here follow up on these studies with the aim to narrow down on the quantitative trait variants (QTN) in one wild and three captive populations. First, we performed an association study using 672 single nucleotide polymorphisms (SNPs) within candidate genes located in the previously identified QTL regions in a sample of 939 wild‐caught zebra finches. Then, we validated the most promising SNP–phenotype associations (n = 25 SNPs) in 5228 birds from four populations. Genotype–phenotype associations were generally weak in the wild population, where linkage disequilibrium (LD) spans only short genomic distances. In contrast, in captive populations, where LD blocks are large, apparent SNP effects on morphological traits (i.e. associations) were highly repeatable with independent data from the same population. Most of those SNPs also showed significant associations with the same trait in other captive populations, but the direction and magnitude of these effects varied among populations. This suggests that the tested SNPs are not the causal QTN but rather physically linked to them, and that LD between SNPs and causal variants differs between populations due to founder effects. While the identification of QTN remains challenging in nonmodel organisms, we illustrate that it is indeed possible to confirm the location and magnitude of QTL in a population with stable linkage between markers and causal variants.  相似文献   

8.
Targeted genomic selection methodologies, or sequence capture, allow for DNA enrichment and large-scale resequencing and characterization of natural genetic variation in species with complex genomes, such as rapeseed canola (Brassica napus L., AACC, 2n=38). The main goal of this project was to combine sequence capture with next generation sequencing (NGS) to discover single nucleotide polymorphisms (SNPs) in specific areas of the B. napus genome historically associated (via quantitative trait loci –QTL– analysis) to traits of agronomical and nutritional importance. A 2.1 million feature sequence capture platform was designed to interrogate DNA sequence variation across 47 specific genomic regions, representing 51.2 Mb of the Brassica A and C genomes, in ten diverse rapeseed genotypes. All ten genotypes were sequenced using the 454 Life Sciences chemistry and to assess the effect of increased sequence depth, two genotypes were also sequenced using Illumina HiSeq chemistry. As a result, 589,367 potentially useful SNPs were identified. Analysis of sequence coverage indicated a four-fold increased representation of target regions, with 57% of the filtered SNPs falling within these regions. Sixty percent of discovered SNPs corresponded to transitions while 40% were transversions. Interestingly, fifty eight percent of the SNPs were found in genic regions while 42% were found in intergenic regions. Further, a high percentage of genic SNPs was found in exons (65% and 64% for the A and C genomes, respectively). Two different genotyping assays were used to validate the discovered SNPs. Validation rates ranged from 61.5% to 84% of tested SNPs, underpinning the effectiveness of this SNP discovery approach. Most importantly, the discovered SNPs were associated with agronomically important regions of the B. napus genome generating a novel data resource for research and breeding this crop species.  相似文献   

9.
High-resolution genetic maps are essential for fine mapping of complex traits, genome assembly, and comparative genomic analysis. Single-nucleotide polymorphisms (SNPs) are the primary molecular markers used for genetic map construction. In this study, we identified 13,362 SNPs evenly distributed across the Japanese flounder (Paralichthys olivaceus) genome. Of these SNPs, 12,712 high-confidence SNPs were subjected to high-throughput genotyping and assigned to 24 consensus linkage groups (LGs). The total length of the genetic linkage map was 3,497.29 cM with an average distance of 0.47 cM between loci, thereby representing the densest genetic map currently reported for Japanese flounder. Nine positive quantitative trait loci (QTLs) forming two main clusters for Vibrio anguillarum disease resistance were detected. All QTLs could explain 5.1–8.38% of the total phenotypic variation. Synteny analysis of the QTL regions on the genome assembly revealed 12 immune-related genes, among them 4 genes strongly associated with V. anguillarum disease resistance. In addition, 246 genome assembly scaffolds with an average size of 21.79 Mb were anchored onto the LGs; these scaffolds, comprising 522.99 Mb, represented 95.78% of assembled genomic sequences. The mapped assembly scaffolds in Japanese flounder were used for genome synteny analyses against zebrafish (Danio rerio) and medaka (Oryzias latipes). Flounder and medaka were found to possess almost one-to-one synteny, whereas flounder and zebrafish exhibited a multi-syntenic correspondence. The newly developed high-resolution genetic map, which will facilitate QTL mapping, scaffold assembly, and genome synteny analysis of Japanese flounder, marks a milestone in the ongoing genome project for this species.  相似文献   

10.
The migration of maize from tropical to temperate climates was accompanied by a dramatic evolution in flowering time. To gain insight into the genetic architecture of this adaptive trait, we conducted a 50K SNP-based genome-wide association and diversity investigation on a panel of tropical and temperate American and European representatives. Eighteen genomic regions were associated with flowering time. The number of early alleles cumulated along these regions was highly correlated with flowering time. Polymorphism in the vicinity of the ZCN8 gene, which is the closest maize homologue to Arabidopsis major flowering time (FT) gene, had the strongest effect. This polymorphism is in the vicinity of the causal factor of Vgt2 QTL. Diversity was lower, whereas differentiation and LD were higher for associated loci compared to the rest of the genome, which is consistent with selection acting on flowering time during maize migration. Selection tests also revealed supplementary loci that were highly differentiated among groups and not associated with flowering time in our panel, whereas they were in other linkage-based studies. This suggests that allele fixation led to a lack of statistical power when structure and relatedness were taken into account in a linear mixed model. Complementary designs and analysis methods are necessary to unravel the architecture of complex traits. Based on linkage disequilibrium (LD) estimates corrected for population structure, we concluded that the number of SNPs genotyped should be at least doubled to capture all QTLs contributing to the genetic architecture of polygenic traits in this panel. These results show that maize flowering time is controlled by numerous QTLs of small additive effect and that strong polygenic selection occurred under cool climatic conditions. They should contribute to more efficient genomic predictions of flowering time and facilitate the dissemination of diverse maize genetic resources under a wide range of environments.  相似文献   

11.
12.
Kostem E  Lozano JA  Eskin E 《Genetics》2011,188(2):449-460
Genome-wide association studies (GWASs) have been effectively identifying the genomic regions associated with a disease trait. In a typical GWAS, an informative subset of the single-nucleotide polymorphisms (SNPs), called tag SNPs, is genotyped in case/control individuals. Once the tag SNP statistics are computed, the genomic regions that are in linkage disequilibrium (LD) with the most significantly associated tag SNPs are believed to contain the causal polymorphisms. However, such LD regions are often large and contain many additional polymorphisms. Following up all the SNPs included in these regions is costly and infeasible for biological validation. In this article we address how to characterize these regions cost effectively with the goal of providing investigators a clear direction for biological validation. We introduce a follow-up study approach for identifying all untyped associated SNPs by selecting additional SNPs, called follow-up SNPs, from the associated regions and genotyping them in the original case/control individuals. We introduce a novel SNP selection method with the goal of maximizing the number of associated SNPs among the chosen follow-up SNPs. We show how the observed statistics of the original tag SNPs and human genetic variation reference data such as the HapMap Project can be utilized to identify the follow-up SNPs. We use simulated and real association studies based on the HapMap data and the Wellcome Trust Case Control Consortium to demonstrate that our method shows superior performance to the correlation- and distance-based traditional follow-up SNP selection approaches. Our method is publicly available at http://genetics.cs.ucla.edu/followupSNPs.  相似文献   

13.
Recent technological developments have facilitated an increased focus on identifying genomic regions underlying adaptive trait variation in natural populations, and it has been advocated that this information should be important for designating population units for conservation. In marine fishes, phenotypic studies have suggested adaptation through divergence of life-history traits among natural populations, but the distribution of adaptive genetic variation in these species is still relatively poorly known. In this study, we extract information about the geographical distribution of genetic variation for 33 single nucleotide polymorphisms (SNPs) associated with life-history trait candidate genes, and compare this to variation in 70 putatively neutral SNPs in Atlantic cod (Gadus morhua). We analyse samples covering the major population complexes in the eastern Atlantic and find strong evidence for non-neutral levels and patterns of population structuring for several of the candidate gene-associated markers, including two SNPs in the growth hormone 1 gene. Thus, this study aligns with findings from phenotypic studies, providing molecular data strongly suggesting that these or closely linked genes are under selection in natural populations of Atlantic cod. Furthermore, we find that patterns of variation in outlier markers do not align with those observed at selectively neutral markers, and that outlier markers identify conservation units on finer geographical scales than those revealed when analysing only neutral markers. Accordingly, results also suggest that information about adaptive genetic variation will be useful for targeted conservation and management in this and other marine species.  相似文献   

14.
Clutch size and egg mass are life history traits that have been extensively studied in wild bird populations, as life history theory predicts a negative trade‐off between them, either at the phenotypic or at the genetic level. Here, we analyse the genomic architecture of these heritable traits in a wild great tit (Parus major) population, using three marker‐based approaches – chromosome partitioning, quantitative trait locus (QTL) mapping and a genome‐wide association study (GWAS). The variance explained by each great tit chromosome scales with predicted chromosome size, no location in the genome contains genome‐wide significant QTL, and no individual SNPs are associated with a large proportion of phenotypic variation, all of which may suggest that variation in both traits is due to many loci of small effect, located across the genome. There is no evidence that any regions of the genome contribute significantly to both traits, which combined with a small, nonsignificant negative genetic covariance between the traits, suggests the absence of genetic constraints on the independent evolution of these traits. Our findings support the hypothesis that variation in life history traits in natural populations is likely to be determined by many loci of small effect spread throughout the genome, which are subject to continued input of variation by mutation and migration, although we cannot exclude the possibility of an additional input of major effect genes influencing either trait.  相似文献   

15.
How predictable is the genetic basis of phenotypic adaptation? Answering this question begins by estimating the repeatability of adaptation at the genetic level. Here, we provide a comprehensive estimate of the repeatability of the genetic basis of adaptive phenotypic evolution in a natural system. We used quantitative trait locus (QTL) mapping to discover genomic regions controlling a large number of morphological traits that have diverged in parallel between pairs of threespine stickleback (Gasterosteus aculeatus species complex) in Paxton and Priest lakes, British Columbia. We found that nearly half of QTL affected the same traits in the same direction in both species pairs. Another 40% influenced a parallel phenotypic trait in one lake but not the other. The remaining 10% of QTL had phenotypic effects in opposite directions in the two species pairs. Similarity in the proportional contributions of all QTL to parallel trait differences was about 0.4. Surprisingly, QTL reuse was unrelated to phenotypic effect size. Our results indicate that repeated use of the same genomic regions is a pervasive feature of parallel phenotypic adaptation, at least in sticklebacks. Identifying the causes of this pattern would aid prediction of the genetic basis of phenotypic evolution.  相似文献   

16.
17.
The extraordinary phenotypic diversity of dog breeds has been sculpted by a unique population history accompanied by selection for novel and desirable traits. Here we perform a comprehensive analysis using multiple test statistics to identify regions under selection in 509 dogs from 46 diverse breeds using a newly developed high-density genotyping array consisting of >170,000 evenly spaced SNPs. We first identify 44 genomic regions exhibiting extreme differentiation across multiple breeds. Genetic variation in these regions correlates with variation in several phenotypic traits that vary between breeds, and we identify novel associations with both morphological and behavioral traits. We next scan the genome for signatures of selective sweeps in single breeds, characterized by long regions of reduced heterozygosity and fixation of extended haplotypes. These scans identify hundreds of regions, including 22 blocks of homozygosity longer than one megabase in certain breeds. Candidate selection loci are strongly enriched for developmental genes. We chose one highly differentiated region, associated with body size and ear morphology, and characterized it using high-throughput sequencing to provide a list of variants that may directly affect these traits. This study provides a catalogue of genomic regions showing extreme reduction in genetic variation or population differentiation in dogs, including many linked to phenotypic variation. The many blocks of reduced haplotype diversity observed across the genome in dog breeds are the result of both selection and genetic drift, but extended blocks of homozygosity on a megabase scale appear to be best explained by selection. Further elucidation of the variants under selection will help to uncover the genetic basis of complex traits and disease.  相似文献   

18.
Estimated breeding values for average daily feed intake (AFI; kg/day), residual feed intake (RFI; kg/day) and average daily gain (ADG; kg/day) were generated using a mixed linear model incorporating genomic relationships for 698 Angus steers genotyped with the Illumina BovineSNP50 assay. Association analyses of estimated breeding values (EBVs) were performed for 41,028 single nucleotide polymorphisms (SNPs), and permutation analysis was used to empirically establish the genome-wide significance threshold (P < 0.05) for each trait. SNPs significantly associated with each trait were used in a forward selection algorithm to identify genomic regions putatively harbouring genes with effects on each trait. A total of 53, 66 and 68 SNPs explained 54.12% (24.10%), 62.69% (29.85%) and 55.13% (26.54%) of the additive genetic variation (when accounting for the genomic relationships) in steer breeding values for AFI, RFI and ADG, respectively, within this population. Evaluation by pathway analysis revealed that many of these SNPs are in genomic regions that harbour genes with metabolic functions. The presence of genetic correlations between traits resulted in 13.2% of SNPs selected for AFI and 4.5% of SNPs selected for RFI also being selected for ADG in the analysis of breeding values. While our study identifies panels of SNPs significant for efficiency traits in our population, validation of all SNPs in independent populations will be necessary before commercialization.  相似文献   

19.
Many traits of biological and agronomic significance in plants are controlled in a complex manner where multiple genes and environmental signals affect the expression of the phenotype. In Oryza sativa (rice), thousands of quantitative genetic signals have been mapped to the rice genome. In parallel, thousands of gene expression profiles have been generated across many experimental conditions. Through the discovery of networks with real gene co-expression relationships, it is possible to identify co-localized genetic and gene expression signals that implicate complex genotype-phenotype relationships. In this work, we used a knowledge-independent, systems genetics approach, to discover a high-quality set of co-expression networks, termed Gene Interaction Layers (GILs). Twenty-two GILs were constructed from 1,306 Affymetrix microarray rice expression profiles that were pre-clustered to allow for improved capture of gene co-expression relationships. Functional genomic and genetic data, including over 8,000 QTLs and 766 phenotype-tagged SNPs (p-value < = 0.001) from genome-wide association studies, both covering over 230 different rice traits were integrated with the GILs. An online systems genetics data-mining resource, the GeneNet Engine, was constructed to enable dynamic discovery of gene sets (i.e. network modules) that overlap with genetic traits. GeneNet Engine does not provide the exact set of genes underlying a given complex trait, but through the evidence of gene-marker correspondence, co-expression, and functional enrichment, site visitors can identify genes with potential shared causality for a trait which could then be used for experimental validation. A set of 2 million SNPs was incorporated into the database and serve as a potential set of testable biomarkers for genes in modules that overlap with genetic traits. Herein, we describe two modules found using GeneNet Engine, one with significant overlap with the trait amylose content and another with significant overlap with blast disease resistance.  相似文献   

20.
Surveys of genomic variation have improved our understanding of the relationship between fitness‐related phenotypes and their underlying genetic basis. In some cases, single large‐effect genes have been found to underlie important traits; however, complex traits are expected to be under polygenic control and elucidation of multiple gene interactions may be required to fully understand the genetic basis of the trait. In this study, we investigated the genetic basis of the ocean‐ and river‐maturing ecotypes in anadromous Pacific lamprey (Entosphenus tridentatus). In Pacific lamprey, the ocean‐maturing ecotype is distinguished by advanced maturity of females (e.g., large egg mass) at the onset of freshwater migration relative to immature females of the river‐maturing ecotype. We examined a total of 219 adult Pacific lamprey that were collected at‐entry to the Klamath River over a 12‐month period. Each individual was genotyped at 308 SNPs representing known neutral and adaptive loci and measured at morphological traits, including egg mass as an indicator of ocean‐ and river‐maturing ecotype for females. The two ecotypes did not exhibit genetic structure at 148 neutral loci, indicating that ecotypic diversity exists within a single population. In contrast, we identified the genetic basis of maturation ecotypes in Pacific lamprey as polygenic, involving two unlinked gene regions that have a complex epistatic relationship. Importantly, these gene regions appear to show stronger effects when considered in gene interaction models than if just considered additive, illustrating the importance of considering epistatic effects and gene networks when researching the genetic basis of complex traits in Pacific lamprey and other species.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号