首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Variations and similarities in our individual genomes are part of our history, our heritage, and our identity. Some human genomic variants are associated with common traits such as hair and eye color, while others are associated with susceptibility to disease or response to drug treatment. Identifying the human variations producing clinically relevant phenotypic changes is critical for providing accurate and personalized diagnosis, prognosis, and treatment for diseases. Furthermore, a better understanding of the molecular underpinning of disease can lead to development of new drug targets for precision medicine. Several resources have been designed for collecting and storing human genomic variations in highly structured, easily accessible databases. Unfortunately, a vast amount of information about these genetic variants and their functional and phenotypic associations is currently buried in the literature, only accessible by manual curation or sophisticated text text-mining technology to extract the relevant information. In addition, the low cost of sequencing technologies coupled with increasing computational power has enabled the development of numerous computational methodologies to predict the pathogenicity of human variants. This review provides a detailed comparison of current human variant resources, including HGMD, OMIM, ClinVar, and UniProt/Swiss-Prot, followed by an overview of the computational methods and techniques used to leverage the available data to predict novel deleterious variants. We expect these resources and tools to become the foundation for understanding the molecular details of genomic variants leading to disease, which in turn will enable the promise of precision medicine.  相似文献   

2.
3.
BackgroundThe success of collapsing methods which investigate the combined effect of rare variants on complex traits has so far been limited. The manner in which variants within a gene are selected prior to analysis has a crucial impact on this success, which has resulted in analyses conventionally filtering variants according to their consequence. This study investigates whether an alternative approach to filtering, using annotations from recently developed bioinformatics tools, can aid these types of analyses in comparison to conventional approaches.ConclusionIncorporating variant annotations from non-coding bioinformatics tools should prove to be a valuable asset for rare variant analyses in the future. Filtering by variant consequence is only possible in coding regions of the genome, whereas utilising non-coding bioinformatics annotations provides an opportunity to discover unknown causal variants in non-coding regions as well. This should allow studies to uncover a greater number of causal variants for complex traits and help elucidate their functional role in disease.  相似文献   

4.
The human genome encodes a limited number of genes yet contributes to individual differences in a vast array of heritable traits. A possible explanation for the capacity our genome to generate this virtually unlimited range of phenotypic variation in complex traits is to assume functional interactions between genes. Therefore we searched two mammalian genomes to identify potential epistatic interactions by looking for co-adapted genes marked by excess two-locus genetic differentiation between populations/lineages using publicly available SNP genotype data. The practical motivation for this effort is to reduce the number of pair-wise tests that need to be performed in genome-wide association studies aimed at detecting GxG interactions, by focusing on pairs predicted to be more likely to jointly affect variation in complex traits. Hence, this approach generates a list of candidate interactions that can be empirically tested. In both the mouse and human data we observed two-locus genetic differentiation in excess of what can be expected from chance alone based on simulations. In an attempt to validate our hypothesis that pairs of genes showing excess genetic divergence represent potential functional interactions, we selected a small set of gene combinations postulated to be interacting based on our analyses and looked for a combined effect of the selected genes on variation in complex traits in both mice and man. In both cases the individual effect of the genes were not significant, instead we observed marginally significant interaction effects. These results show that genome wide searches for gene-gene interactions based on population genetic data are feasible and can generate interesting candidate gene pairs to be further tested for their contribution to phenotypic variation in complex traits.  相似文献   

5.
MOTIVATION: The past decade has seen the introduction of fast and relatively inexpensive methods to detect genetic variation across the genome and exponential growth in the number of known single nucleotide variants (SNVs). There is increasing interest in bioinformatics approaches to identify variants that are functionally important from millions of candidate variants. Here, we describe the essential components of bioinformatics tools that predict functional SNVs. RESULTS: Bioinformatics tools have great potential to identify functional SNVs, but the black box nature of many tools can be a pitfall for researchers. Understanding the underlying methods, assumptions and biases of these tools is essential to their intelligent application.  相似文献   

6.
Expression quantitative trait loci (eQTLs) are currently the most abundant and systematically-surveyed class of functional consequence for genetic variation. Recent genetic studies of gene expression have identified thousands of eQTLs in diverse tissue types for the majority of human genes. Application of this large eQTL catalog provides an important resource for understanding the molecular basis of common genetic diseases. However, only now has both the availability of individuals with full genomes and corresponding advances in functional genomics provided the opportunity to dissect eQTLs to identify causal regulatory variants. Resolving the properties of such causal regulatory variants is improving understanding of the molecular mechanisms that influence traits and guiding the development of new genome-scale approaches to variant interpretation. In this review, we provide an overview of current computational and experimental methods for identifying causal regulatory variants and predicting their phenotypic consequences.  相似文献   

7.
Recent advances in genotyping methodologies have allowed genome-wide association studies (GWAS) to accurately identify genetic variants that associate with common or pathological complex traits. Although most GWAS have focused on associations with single genetic variants, joint identification of multiple genetic variants, and how they interact, is essential for understanding the genetic architecture of complex phenotypic traits. Here, we propose an efficient stepwise method based on the Cochran-Mantel-Haenszel test (for stratified categorical data) to identify causal joint multiple genetic variants in GWAS. This method combines the CMH statistic with a stepwise procedure to detect multiple genetic variants associated with specific categorical traits, using a series of associated I × J contingency tables and a null hypothesis of no phenotype association. Through a new stratification scheme based on the sum of minor allele count criteria, we make the method more feasible for GWAS data having sample sizes of several thousands. We also examine the properties of the proposed stepwise method via simulation studies, and show that the stepwise CMH test performs better than other existing methods (e.g., logistic regression and detection of associations by Markov blanket) for identifying multiple genetic variants. Finally, we apply the proposed approach to two genomic sequencing datasets to detect linked genetic variants associated with bipolar disorder and obesity, respectively.  相似文献   

8.
Genome-wide association studies (GWAS) have had a tremendous success in the identification of common DNA sequence variants associated with complex human diseases and traits. However, because of their design, GWAS are largely inappropriate to characterize the role of rare and low-frequency DNA variants on human phenotypic variation. Rarer genetic variation is geographically more restricted, supporting the need for local whole-genome sequencing (WGS) efforts to study these variants in specific populations. Here, we present the first large-scale low-pass WGS of the French-Canadian population. Specifically, we sequenced at ~5.6× coverage the whole genome of 1970 French Canadians recruited by the Montreal Heart Institute Biobank and identified 29 million bi-allelic variants (31 % novel), including 19 million variants with a minor allele frequency (MAF) <0.5 %. Genotypes from the WGS data are highly concordant with genotypes obtained by exome array on the same individuals (99.8 %), even when restricting this analysis to rare variants (MAF <0.5, 99.9 %) or heterozygous sites (98.9 %). To further validate our data set, we showed that we can effectively use it to replicate several genetic associations with myocardial infarction risk and blood lipid levels. Furthermore, we analyze the utility of our WGS data set to generate a French-Canadian-specific imputation reference panel and to infer population structure in the Province of Quebec. Our results illustrate the value of low-pass WGS to study the genetics of human diseases in the founder French-Canadian population.  相似文献   

9.
Significantly improved crop varieties are urgently needed to feed the rapidly growing human population under changing climates. While genome sequence information and excellent genomic tools are in place for major crop species, the systematic quantification of phenotypic traits or components thereof in a high-throughput fashion remains an enormous challenge. In order to help bridge the genotype to phenotype gap, we developed a comprehensive framework for high-throughput phenotype data analysis in plants, which enables the extraction of an extensive list of phenotypic traits from nondestructive plant imaging over time. As a proof of concept, we investigated the phenotypic components of the drought responses of 18 different barley (Hordeum vulgare) cultivars during vegetative growth. We analyzed dynamic properties of trait expression over growth time based on 54 representative phenotypic features. The data are highly valuable to understand plant development and to further quantify growth and crop performance features. We tested various growth models to predict plant biomass accumulation and identified several relevant parameters that support biological interpretation of plant growth and stress tolerance. These image-based traits and model-derived parameters are promising for subsequent genetic mapping to uncover the genetic basis of complex agronomic traits. Taken together, we anticipate that the analytical framework and analysis results presented here will be useful to advance our views of phenotypic trait components underlying plant development and their responses to environmental cues.  相似文献   

10.
A considerable and unanticipated plasticity of the human genome, manifested as inter-individual copy number variation, has been discovered. These structural changes constitute a major source of inter-individual genetic variation that could explain variable penetrance of inherited (Mendelian and polygenic) diseases and variation in the phenotypic expression of aneuploidies and sporadic traits, and might represent a major factor in the aetiology of complex, multifactorial traits. For these reasons, an effort should be made to discover all common and rare copy number variants (CNVs) in the human population. This will also enable systematic exploration of both SNPs and CNVs in association studies to identify the genomic contributors to the common disorders and complex traits.  相似文献   

11.
One of the longest running debates in evolutionary biology concerns the kind of genetic variation that is primarily responsible for phenotypic variation in species. Here, we address this question for humans specifically from the perspective of population allele frequency of variants across the complete genome, including both coding and noncoding regions. We establish simple criteria to assess the likelihood that variants are functional based on their genomic locations and then use whole-genome sequence data from 29 subjects of European origin to assess the relationship between the functional properties of variants and their population allele frequencies. We find that for all criteria used to assess the likelihood that a variant is functional, the rarer variants are significantly more likely to be functional than the more common variants. Strikingly, these patterns disappear when we focus on only those variants in which the major alleles are derived. These analyses indicate that the majority of the genetic variation in terms of phenotypic consequence may result from a mutation-selection balance, as opposed to balancing selection, and have direct relevance to the study of human disease.  相似文献   

12.
Rapid advances and cost erosion in exome and genome analysis of patients with both rare and common genetic disorders have accelerated gene discovery and illuminated fundamental biological mechanisms. The thrill of discovery has been accompanied, however, with the sobering appreciation that human genomes are burdened with a large number of rare and ultra rare variants, thereby posing a significant challenge in dissecting both the effect of such alleles on protein function and also the biological relevance of these events to patient pathology. In an effort to develop model systems that are able to generate surrogates of human pathologies, a powerful suite of tools have been developed in zebrafish, taking advantage of the relatively small (compared to invertebrate models) evolutionary distance of that genome to humans, the orthology of several organs and signaling processes, and the suitability of this organism for medium and high throughput phenotypic screening. Here we will review the use of this model organism in dissecting human genetic disorders; we will highlight how diverse strategies have informed disease causality and genetic architecture; and we will discuss relative strengths and limitations of these approaches in the context of medical genome sequencing. This article is part of a Special Issue entitled: From Genome to Function.  相似文献   

13.
Morphological divergence among species may be constrained by the pattern of genetic variances and covariances among traits within species. Assessing the existence of such a relationship in nature requires information on the stability of intraspecific correlation and covariance structure and the correspondence of this structure to the pattern of evolutionary divergence within a lineage. Here, we investigate these issues for nine morphological traits and 15 species of stalk-eyed flies in the genus Diasemopsis. Within-species matrices for these traits were generated from phenotypic data for all the Diasemopsis species and from genetic data for a single Diasemopsis species, D. dubia. The among-species pattern of divergence was assessed by calculating the evolutionary correlations for all pairwise combinations of the morphological traits along the phylogeny of these species. Comparisons of intraspecific matrices reveal significant similarity among all species in the phenotypic correlations matrices but not the covariance matrices. In addition, the differences in correlation structure that do exist among species are not related to their phylogenetic placement or change in the means of the traits. Comparisons of the phenotypic and phylogenetic matrices suggest a strong relationship between the pattern of evolutionary change among species and both the intraspecific correlation structure and the stability of this structure among species. The phenotypic and the phylogenetic matrices are significantly similar, and pairs of traits whose intraspecific correlations are more stable across taxa exhibit stronger coevolution on the phylogeny. These results suggest either the existence of strong constraints on the pattern of evolutionary change or a consistent pattern of correlated selection shaping both the phenotypic and phylogenetic matrices. The genetic correlation structure for D. dubia, however, does not correspond with patterns found in the phenotypic and phylogenetic data. Possible reasons for this disagreement are discussed.  相似文献   

14.
It is a fundamental challenge to discover the association of genetic traits with phenotypic traits. In this study, we aimed to identify possible genetic traits related to horse temperament. Based on previous findings, we selected 71 candidate genes related to temperamental trait and examined them in the human and horse reference genomes (hg38 and equCab2, respectively). We found 16 orthologous genes and, by comparing with the human reference genome, 17 homologous genes in the horse reference genome. We designed probes specific for the 33 horse genes. Using the probes, we built sequencing libraries of the genomic DNA samples from eight aggressive and eight docile horses, and sequenced the constructed libraries using the Illumina Hiseq2500 platform. Through the analysis of the targeted exome sequences, we identified single nucleotide polymorphisms (SNPs) in the genes. SNPs could be served as genetic markers to evaluate aggressive or docile levels of horses. To examine whether any genetic variants are associated with horse temperament, we performed genome-wide association study (GWAS) using the SNP data. GWAS analysis identified ten variants (p-value?<0.05) which could be related to horse temperament. We validated the variants using Sanger sequencing. The most significant variants were found in MAOA (c.1164+41T>C) and AR (c.1047+27G>T) genes with 8.09?×?10?4 p-value. We suggest that the variants might be used to assess horse temperament and to determine superior horses for riding or racing.  相似文献   

15.
16.
Bioinformatics is central to biology education in the 21st century. With the generation of terabytes of data per day, the application of computer-based tools to stored and distributed data is fundamentally changing research and its application to problems in medicine, agriculture, conservation and forensics. In light of this 'information revolution,' undergraduate biology curricula must be redesigned to prepare the next generation of informed citizens as well as those who will pursue careers in the life sciences. The BEDROCK initiative (Bioinformatics Education Dissemination: Reaching Out, Connecting and Knitting together) has fostered an international community of bioinformatics educators. The initiative's goals are to: (i) Identify and support faculty who can take leadership roles in bioinformatics education; (ii) Highlight and distribute innovative approaches to incorporating evolutionary bioinformatics data and techniques throughout undergraduate education; (iii) Establish mechanisms for the broad dissemination of bioinformatics resource materials and teaching models; (iv) Emphasize phylogenetic thinking and problem solving; and (v) Develop and publish new software tools to help students develop and test evolutionary hypotheses. Since 2002, BEDROCK has offered more than 50 faculty workshops around the world, published many resources and supported an environment for developing and sharing bioinformatics education approaches. The BEDROCK initiative builds on the established pedagogical philosophy and academic community of the BioQUEST Curriculum Consortium to assemble the diverse intellectual and human resources required to sustain an international reform effort in undergraduate bioinformatics education.  相似文献   

17.
《Trends in genetics : TIG》2023,39(9):703-714
Phenotypic plasticity, the ability of an organism to display different phenotypes across environments, is widespread in nature. Plasticity aids survival in novel environments. Herein, we review studies from yeast that allow us to start uncovering the genetic architecture of phenotypic plasticity. Genetic variants and their interactions impact the phenotype in different environments, and distinct environments modulate the impact of genetic variants and their interactions on the phenotype. Because of this, certain hidden genetic variation is expressed in specific genetic and environmental backgrounds. A better understanding of the genetic mechanisms of phenotypic plasticity will help to determine short- and long-term responses to selection and how wide variation in disease manifestation occurs in human populations.  相似文献   

18.
Despite the evidence that human obesity has strong genetic determinants, efforts at identifying specific genes that influence human obesity have largely been unsuccessful. Using the sibship data obtained from 32 low income Mexican American pedigrees ascertained on a type II diabetic proband and a multipoint variance-components method, we tested for linkage between various obesity-related traits plus associated metabolic traits and 15 markers on human chromosome 7. We found evidence for linkage between markers in the OB gene region and various traits, as follows: D7S514 and extremity skinfolds (LOD = 3.1), human carboxypeptidase A1 (HCPA1) and 32,33-split proinsulin level (LOD = 4.2), and HCPA1 and proinsulin level (LOD = 3.2). A putative susceptibility locus linked to the marker D7S514 explained 56% of the total phenotypic variation in extremity skinfolds. Variation at the HCPA1 locus explained 64% of phenotypic variation in proinsulin level and approximately 73% of phenotypic variation in split proinsulin concentration, respectively. Weaker evidence for linkage to several other obesity-related traits (e.g., waist circumference, body-mass index, fat mass by bioimpedance, etc.) was observed for a genetic location, which is approximately 15 cM telomeric to OB. In conclusion, our study reveals that the OB region plays a significant role in determining the phenotypic variation of both insulin precursors and obesity-related traits, at least in Mexican Americans.  相似文献   

19.
Considerable clinical and molecular variations have been known in retinal blinding diseases in man and also in dogs. Different forms of retinal diseases occur in specific breed(s) caused by mutations segregating within each isolated breeding population. While molecular studies to find genes and mutations underlying retinal diseases in dogs have benefited largely from the phenotypic and genetic uniformity within a breed, within- and across-breed variations have often played a key role in elucidating the molecular basis. The increasing knowledge of phenotypic, allelic, and genetic heterogeneities in canine retinal degeneration has shown that the overall picture is rather more complicated than initially thought. Over the past 20?years, various approaches have been developed and tested to search for genes and mutations underlying genetic traits in dogs, depending on the availability of genetic tools and sample resources. Candidate gene, linkage analysis, and genome-wide association studies have so far identified 24 mutations in 18 genes underlying retinal diseases in at least 58 dog breeds. Many of these genes have been associated with retinal diseases in humans, thus providing opportunities to study the role in pathogenesis and in normal vision. Application in therapeutic interventions such as gene therapy has proven successful initially in a naturally occurring dog model followed by trials in human patients. Other genes whose human homologs have not been associated with retinal diseases are potential candidates to explain equivalent human diseases and contribute to the understanding of their function in vision.  相似文献   

20.
Although genome-wide association studies (GWASs) have discovered numerous novel genetic variants associated with many complex traits and diseases, those genetic variants typically explain only a small fraction of phenotypic variance. Factors that account for phenotypic variance include environmental factors and gene-by-environment interactions (GEIs). Recently, several studies have conducted genome-wide gene-by-environment association analyses and demonstrated important roles of GEIs in complex traits. One of the main challenges in these association studies is to control effects of population structure that may cause spurious associations. Many studies have analyzed how population structure influences statistics of genetic variants and developed several statistical approaches to correct for population structure. However, the impact of population structure on GEI statistics in GWASs has not been extensively studied and nor have there been methods designed to correct for population structure on GEI statistics. In this paper, we show both analytically and empirically that population structure may cause spurious GEIs and use both simulation and two GWAS datasets to support our finding. We propose a statistical approach based on mixed models to account for population structure on GEI statistics. We find that our approach effectively controls population structure on statistics for GEIs as well as for genetic variants.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号