首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
PURPOSE OF REVIEW: Recently, genome-wide genetic screening of common DNA sequence variants has proven a successful approach to identify novel genetic contributors to complex traits. This review summarizes recent genome-wide association studies for lipid phenotypes, and evaluates the next steps needed to obtain a full picture of genotype-phenotype correlation and apply these findings to inform clinical practice. RECENT FINDINGS: So far, genome-wide association studies have defined at least 19 genomic regions that contain common DNA single nucleotide polymorphisms associated with LDL cholesterol, HDL cholesterol and/or triglycerides. Of these, eight represent novel loci in humans, whereas 11 genes have been previously implicated in lipoprotein metabolism. Many of the same loci with common variants have already been shown to lead to monogenic lipid disorders in humans and/or mice, suggesting that a spectrum of common and rare alleles at each validated locus contributes to blood lipid concentrations. SUMMARY: At least 19 loci harbor common variations that contribute to blood lipid concentrations in humans. Larger scale genome-wide association studies should identify additional loci, and sequencing of these loci should pinpoint all relevant alleles. With a full catalog of DNA polymorphisms in hand, a panel of lipid-related variants can be studied to provide clinical risk stratification and targeting of therapeutic interventions.  相似文献   

2.
Linkage analysis is a successful procedure to associate diseases with specific genomic regions. These regions are often large, containing hundreds of genes, which make experimental methods employed to identify the disease gene arduous and expensive. We present two methods to prioritize candidates for further experimental study: Common Pathway Scanning (CPS) and Common Module Profiling (CMP). CPS is based on the assumption that common phenotypes are associated with dysfunction in proteins that participate in the same complex or pathway. CPS applies network data derived from protein–protein interaction (PPI) and pathway databases to identify relationships between genes. CMP identifies likely candidates using a domain-dependent sequence similarity approach, based on the hypothesis that disruption of genes of similar function will lead to the same phenotype. Both algorithms use two forms of input data: known disease genes or multiple disease loci. When using known disease genes as input, our combined methods have a sensitivity of 0.52 and a specificity of 0.97 and reduce the candidate list by 13-fold. Using multiple loci, our methods successfully identify disease genes for all benchmark diseases with a sensitivity of 0.84 and a specificity of 0.63. Our combined approach prioritizes good candidates and will accelerate the disease gene discovery process.  相似文献   

3.
Gene prioritization through genomic data fusion   总被引:4,自引:0,他引:4  
The identification of genes involved in health and disease remains a challenge. We describe a bioinformatics approach, together with a freely accessible, interactive and flexible software termed Endeavour, to prioritize candidate genes underlying biological processes or diseases, based on their similarity to known genes involved in these phenomena. Unlike previous approaches, ours generates distinct prioritizations for multiple heterogeneous data sources, which are then integrated, or fused, into a global ranking using order statistics. In addition, it offers the flexibility of including additional data sources. Validation of our approach revealed it was able to efficiently prioritize 627 genes in disease data sets and 76 genes in biological pathway sets, identify candidates of 16 mono- or polygenic diseases, and discover regulatory genes of myeloid differentiation. Furthermore, the approach identified a novel gene involved in craniofacial development from a 2-Mb chromosomal region, deleted in some patients with DiGeorge-like birth defects. The approach described here offers an alternative integrative method for gene discovery.  相似文献   

4.
P Garg  C Borel  AJ Sharp 《PloS one》2012,7(8):e41695
Parent-of-origin (PofO) effects, such as imprinting are a phenomenon in which homologous chromosomes exhibit differential gene expression and epigenetic modifications according to their parental origin. Such non-Mendelian inheritance patterns are generally ignored by conventional association studies, as these tests consider the maternal and paternal alleles as equivalent. To identify regulatory regions that show PofO effects on gene expression (imprinted expression Quantitative Trait Loci, ieQTLs), here we have developed a novel method in which we associate SNP genotypes of defined parental origin with gene expression levels. We applied this method to study 59 HapMap phase II parent-offspring trios. By analyzing mother/father/child trios, rules of Mendelian inheritance allowed the parental origin to be defined for ~95% of SNPs in each child. We used 680,475 informative SNPs and corresponding expression data for 92,167 probe sets from Affymetrix GeneChip Human Exon 1.0 ST arrays and performed four independent cis-association analyses with the expression level of RefSeq genes within 1 Mb using PLINK. Independent analyses of maternal and paternal genotypes identified two significant cis-ieQTLs (p<10(-7)) at which expression of genes SFT2D2 and SRRT associated exclusively with maternally inherited SNPs rs3753292 and rs6945374, respectively. 28 additional suggestive cis-associations with only maternal or paternal SNPs were found at a lower stringency threshold of p<10(-6), including associations with two known imprinted genes PEG10 and TRAPPC9, demonstrating the efficacy of our method. Furthermore, comparison of our method that utilizes independent analyses of maternal and paternal genotypes with the Likelihood Ratio Test (LRT) showed it to be more effective for detecting imprinting effects than the LRT. Our method represents a novel approach that can identify imprinted regulatory elements that control gene expression, suggesting novel PofO effects in the human genome.  相似文献   

5.
Cancer genomes frequently contain somatic copy number alterations (SCNA) that can significantly perturb the expression level of affected genes and thus disrupt pathways controlling normal growth. In melanoma, many studies have focussed on the copy number and gene expression levels of the BRAF, PTEN and MITF genes, but little has been done to identify new genes using these parameters at the genome-wide scale. Using karyotyping, SNP and CGH arrays, and RNA-seq, we have identified SCNA affecting gene expression ('SCNA-genes') in seven human metastatic melanoma cell lines. We showed that the combination of these techniques is useful to identify candidate genes potentially involved in tumorigenesis. Since few of these alterations were recurrent across our samples, we used a protein network-guided approach to determine whether any pathways were enriched in SCNA-genes in one or more samples. From this unbiased genome-wide analysis, we identified 28 significantly enriched pathway modules. Comparison with two large, independent melanoma SCNA datasets showed less than 10% overlap at the individual gene level, but network-guided analysis revealed 66% shared pathways, including all but three of the pathways identified in our data. Frequently altered pathways included WNT, cadherin signalling, angiogenesis and melanogenesis. Additionally, our results emphasize the potential of the EPHA3 and FRS2 gene products, involved in angiogenesis and migration, as possible therapeutic targets in melanoma. Our study demonstrates the utility of network-guided approaches, for both large and small datasets, to identify pathways recurrently perturbed in cancer.  相似文献   

6.
MOTIVATION: Discovery of regulatory motifs in unaligned DNA sequences remains a fundamental problem in computational biology. Two categories of algorithms have been developed to identify common motifs from a set of DNA sequences. The first can be called a 'multiple genes, single species' approach. It proposes that a degenerate motif is embedded in some or all of the otherwise unrelated input sequences and tries to describe a consensus motif and identify its occurrences. It is often used for co-regulated genes identified through experimental approaches. The second approach can be called 'single gene, multiple species'. It requires orthologous input sequences and tries to identify unusually well conserved regions by phylogenetic footprinting. Both approaches perform well, but each has some limitations. It is tempting to combine the knowledge of co-regulation among different genes and conservation among orthologous genes to improve our ability to identify motifs. RESULTS: Based on the Consensus algorithm previously established by our group, we introduce a new algorithm called PhyloCon (Phylogenetic Consensus) that takes into account both conservation among orthologous genes and co-regulation of genes within a species. This algorithm first aligns conserved regions of orthologous sequences into multiple sequence alignments, or profiles, then compares profiles representing non-orthologous sequences. Motifs emerge as common regions in these profiles. Here we present a novel statistic to compare profiles of DNA sequences and a greedy approach to search for common subprofiles. We demonstrate that PhyloCon performs well on both synthetic and biological data. AVAILABILITY: Software available upon request from the authors. http://ural.wustl.edu/softwares.html  相似文献   

7.
The orphan nuclear receptor Nurr1 is expressed in the developing and adult central nervous system. Previous studies have shown that Nurr1 is essential for the generation of midbrain dopamine neurons. Furthermore, Nurr1 is critical for respiratory functions associated with the brain stem. Very few Nurr1 regulated genes have been identified and it remains unclear how Nurr1 influences the function and development of neurons. To identify novel Nurr1 target genes we have searched for regulated genes in the dopaminergic MN9D cell line. These experiments identified Neuropilin-1 (Nrp1), a receptor protein involved in axon guidance and angiogenesis, as a novel Nurr1 target gene. Nrp1 expression was rapidly up-regulated by Nurr1 in MN9D cells and in situ hybridization analysis showed that Nrp1 was coexpressed with Nurr1 in the brain stem dorsal motor nucleus. Importantly, Nrp1 expression was down-regulated in this area in Nurr1 null mice. Moreover, two functional Nurr1 binding sites were identified in the Nrp1 promoter and Nurr1 was found to be recruited to these sites in MN9D cells, further supporting that Nrp1 is a direct downstream target of Nurr1. Taken together, our findings suggest that Nurr1 might influence the processes of axon guidance and/or angiogenesis via the regulation of Nrp1 expression.  相似文献   

8.
MOTIVATION: Genomic imprinting plays an important role in both normal development and diseases. Abnormal imprinting is strongly associated with several human diseases including cancers. Most of the imprinted genes were discovered in the neighborhood of the known imprinted genes. This approach is difficult to extend to analyze the whole genome. We have decided to take a computational approach to systematically search the whole genome for the presence of mono-allelic expressed genes and imprinted genes in human genome. RESULTS: A computational method was developed to identify novel imprinted or mono-allelic genes. Individuals represented in human cDNA libraries were genotyped using Bayesian statistics, and differential expression of polymorphic alleles was identified. A significant reduction in the number of libraries that expressed both alleles, measured by Z-statistics, is a strong indicator for an imprinted or a mono-allelic gene. AVAILABILITY: The data sets are available at http://leelab.nci.nih.gov/leelab/jsp/IGDM/IGDM.html  相似文献   

9.
MOTIVATION: Recent studies have shown that microarray gene expression data are useful for phenotype classification of many diseases. A major problem in this classification is that the number of features (genes) greatly exceeds the number of instances (tissue samples). It has been shown that selecting a small set of informative genes can lead to improved classification accuracy. Many approaches have been proposed for this gene selection problem. Most of the previous gene ranking methods typically select 50-200 top-ranked genes and these genes are often highly correlated. Our goal is to select a small set of non-redundant marker genes that are most relevant for the classification task. RESULTS: To achieve this goal, we developed a novel hybrid approach that combines gene ranking and clustering analysis. In this approach, we first applied feature filtering algorithms to select a set of top-ranked genes, and then applied hierarchical clustering on these genes to generate a dendrogram. Finally, the dendrogram was analyzed by a sweep-line algorithm and marker genes are selected by collapsing dense clusters. Empirical study using three public datasets shows that our approach is capable of selecting relatively few marker genes while offering the same or better leave-one-out cross-validation accuracy compared with approaches that use top-ranked genes directly for classification. AVAILABILITY: The HykGene software is freely available at http://www.cs.dartmouth.edu/~wyh/software.htm CONTACT: wyh@cs.dartmouth.edu SUPPLEMENTARY INFORMATION: Supplementary material is available from http://www.cs.dartmouth.edu/~wyh/hykgene/supplement/index.htm.  相似文献   

10.
Extravillous cytotrophoblasts isolated from first trimester placenta, and immortalised cell lines derived from them, have the intrinsic ability to form endothelial-like tubes when cultured on Matrigel™ extracellular matrix. This in vitro tube formation may model placental angiogenesis and/or endovascular differentiation by trophoblasts. To interpret the relevance of this phenomenon to placental development, we used a gene expression microarray approach to identify which genes and pathways are associated with the tube-forming phenotype of HTR8/SVneo first trimester trophoblasts (HTR8-M), compared with HTR8/SVneo not forming tubes on plastic culture surface (HTR8-P). Furthermore, we used weighted gene co-expression network analysis (WGCNA) of microarray data to identify modules of co-expressed genes underlying the biological processes. There were 481 genes differentially expressed between HTR8-M and HTR8-P and these were significantly enriched for blood vessel development and related gene ontologies. WGCNA clustered the genes into 9 co-expression modules. One module was significantly associated with HTR8-M (p = 1.15E-05) and contained genes involved in actin cytoskeleton organization, cell migration and blood vessel development, consistent with tube formation on Matrigel. Another module was significantly associated with HTR8-P (p = 1.94E-05) and was enriched for genes involved in mitosis, consistent with proliferation by cells on plastic which do not differentiate. Up-regulation of angiogenesis and vascular development pathways in endovascular trophoblasts in vivo could underpin spiral artery remodelling processes, which are defective in preeclamptic pregnancies.  相似文献   

11.
Reconstructing the duplication history of tandemly repeated genes   总被引:4,自引:0,他引:4  
We present a novel approach to deal with the problem of reconstructing the duplication history of tandemly repeated genes that are supposed to have arisen from unequal recombination. We first describe the mathematical model of evolution by tandem duplication and introduce duplication histories and duplication trees. We then provide a simple recursive algorithm which determines whether or not a given rooted phylogeny can be a duplication history and another algorithm that simulates the unequal recombination process and searches for the best duplication trees according to the maximum parsimony criterion. We use real data sets of human immunoglobulins and T-cell receptors to validate our methods and algorithms. Identity between most parsimonious duplication trees and most parsimonious phylogenies for the same data, combined with the agreement with additional knowledge about the sequences, such as the presence of polymorphisms, shows strong evidence that our reconstruction procedure provides good insights into the duplication histories of these loci.  相似文献   

12.
13.
MOTIVATION: Analysis of oligonucleotide array data, especially to select genes of interest, is a highly challenging task because of the large volume of information and various experimental factors. Moreover, interaction effect (i.e. expression changes depend on probe effects) complicates the analysis because current methods often use an additive model to analyze data. We propose an approach to address these issues with the aim of producing a more reliable selection of differentially expressed genes. The approach uses the rank for normalization, employs the percentile-range to measure expression variation, and applies various filters to monitor expression changes. RESULTS: We compare our approach with MAS and Dchip models. A data set from an angiogenesis study is used for illustration. Results show that our approach performs better than other methods either in identification of the positive control gene or in PCR confirmatory tests. In addition, the invariant set of genes in our approach provides an efficient way for normalization.  相似文献   

14.
Pathway analyses of genome-wide association studies aggregate information over sets of related genes, such as genes in common pathways, to identify gene sets that are enriched for variants associated with disease. We develop a model-based approach to pathway analysis, and apply this approach to data from the Wellcome Trust Case Control Consortium (WTCCC) studies. Our method offers several benefits over existing approaches. First, our method not only interrogates pathways for enrichment of disease associations, but also estimates the level of enrichment, which yields a coherent way to promote variants in enriched pathways, enhancing discovery of genes underlying disease. Second, our approach allows for multiple enriched pathways, a feature that leads to novel findings in two diseases where the major histocompatibility complex (MHC) is a major determinant of disease susceptibility. Third, by modeling disease as the combined effect of multiple markers, our method automatically accounts for linkage disequilibrium among variants. Interrogation of pathways from eight pathway databases yields strong support for enriched pathways, indicating links between Crohn''s disease (CD) and cytokine-driven networks that modulate immune responses; between rheumatoid arthritis (RA) and “Measles” pathway genes involved in immune responses triggered by measles infection; and between type 1 diabetes (T1D) and IL2-mediated signaling genes. Prioritizing variants in these enriched pathways yields many additional putative disease associations compared to analyses without enrichment. For CD and RA, 7 of 8 additional non-MHC associations are corroborated by other studies, providing validation for our approach. For T1D, prioritization of IL-2 signaling genes yields strong evidence for 7 additional non-MHC candidate disease loci, as well as suggestive evidence for several more. Of the 7 strongest associations, 4 are validated by other studies, and 3 (near IL-2 signaling genes RAF1, MAPK14, and FYN) constitute novel putative T1D loci for further study.  相似文献   

15.
16.
Asthma is a complex trait for which different strategies have been used to identify its environmental and genetic predisposing factors. Here, we describe a novel methodological approach to select candidate genes for asthma genetic association studies. In this regard, the Genes to Diseases (G2D) computational tool has been used in combination with a genome-wide scan performed in a sub-sample of the Saguenay-Lac-St-Jean (SLSJ) asthmatic familial collection (n = 609) to identify candidate genes located in two suggestive loci shown to be linked with asthma (6q26) and atopy (10q26.3), and presenting differential parent-of-origin effects. This approach combined gene selection based on the G2D data mining analysis of the bibliographic and protein public databases, or according to the genes already known to be associated with the same or a similar phenotype. Ten genes (LPA, NOX3, SNX9, VIL2, VIP, ADAM8, DOCK1, FANK1, GPR123 and PTPRE) were selected for a subsequent association study performed in a large SLSJ sample (n = 1167) of individuals tested for asthma and atopy related phenotypes. Single nucleotide polymorphisms (n = 91) within the candidate genes were genotyped and analysed using a family-based association test. The results suggest a protective association to allergic asthma for PTPRE rs7081735 in the SLSJ sample (p = 0.000463; corrected p = 0.0478). This association has not been replicated in the Childhood Asthma Management Program (CAMP) cohort. Sequencing of the regions around rs7081735 revealed additional polymorphisms, but additional genotyping did not yield new associations. These results demonstrate that the G2D tool can be useful in the selection of candidate genes located in chromosomal regions linked to a complex trait.  相似文献   

17.
18.

Background

Prioritizing genetic variants is a challenge because disease susceptibility loci are often located in genes of unknown function or the relationship with the corresponding phenotype is unclear. A global data-mining exercise on the biomedical literature can establish the phenotypic profile of genes with respect to their connection to disease phenotypes. The importance of protein-protein interaction networks in the genetic heterogeneity of common diseases or complex traits is becoming increasingly recognized. Thus, the development of a network-based approach combined with phenotypic profiling would be useful for disease gene prioritization.

Results

We developed a random-set scoring model and implemented it to quantify phenotype relevance in a network-based disease gene-prioritization approach. We validated our approach based on different gene phenotypic profiles, which were generated from PubMed abstracts, OMIM, and GeneRIF records. We also investigated the validity of several vocabulary filters and different likelihood thresholds for predicted protein-protein interactions in terms of their effect on the network-based gene-prioritization approach, which relies on text-mining of the phenotype data. Our method demonstrated good precision and sensitivity compared with those of two alternative complex-based prioritization approaches. We then conducted a global ranking of all human genes according to their relevance to a range of human diseases. The resulting accurate ranking of known causal genes supported the reliability of our approach. Moreover, these data suggest many promising novel candidate genes for human disorders that have a complex mode of inheritance.

Conclusion

We have implemented and validated a network-based approach to prioritize genes for human diseases based on their phenotypic profile. We have devised a powerful and transparent tool to identify and rank candidate genes. Our global gene prioritization provides a unique resource for the biological interpretation of data from genome-wide association studies, and will help in the understanding of how the associated genetic variants influence disease or quantitative phenotypes.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2105-15-315) contains supplementary material, which is available to authorized users.  相似文献   

19.
Angiogenesis is important for many physiological processes, diseases, and also regenerative medicine. Therapies that inhibit the vascular endothelial growth factor (VEGF) pathway have been used in the clinic for cancer and macular degeneration. In cancer applications, these treatments suffer from a "tumor escape phenomenon" where alternative pathways are upregulated and angiogenesis continues. The redundancy of angiogenesis regulation indicates the need for additional studies and new drug targets. We aimed to (i) identify novel and missing angiogenesis annotations and (ii) verify their significance to angiogenesis. To achieve these goals, we integrated the human interactome with known angiogenesis-annotated proteins to identify a set of 202 angiogenesis-associated proteins. Across endothelial cell lines, we found that a significant fraction of these proteins had highly perturbed gene expression during angiogenesis. After treatment with VEGF-A, we found increasing expression of HIF-1α, APP, HIV-1 tat interactive protein 2, and MEF2C, while endoglin, liprin β1 and HIF-2α had decreasing expression across three endothelial cell lines. The analysis showed differential regulation of HIF-1α and HIF-2α. The data also provided additional evidence for the role of endothelial cells in Alzheimer's disease.  相似文献   

20.
Formation of a functional vasculature during mammalian development is essential for embryonic survival. In addition, imbalance in blood vessel growth contributes to the pathogenesis of numerous disorders. Most of our understanding of vascular development and blood vessel growth comes from investigating the Vegf signaling pathway as well as the recent observation that molecules involved in axon guidance also regulate vascular patterning. In order to take an unbiased, yet focused, approach to identify novel genes regulating vascular development, we performed a three-step ENU mutagenesis screen in zebrafish. We first screened live embryos visually, evaluating blood flow in the main trunk vessels, which form by vasculogenesis, and the intersomitic vessels, which form by angiogenesis. Embryos that displayed reduced or absent circulation were fixed and stained for endogenous alkaline phosphatase activity to reveal blood vessel morphology. All putative mutants were then crossed into the Tg(flk1:EGFP)(s843) transgenic background to facilitate detailed examination of endothelial cells in live and fixed embryos. We screened 4015 genomes and identified 30 mutations affecting various aspects of vascular development. Specifically, we identified 3 genes (or loci) that regulate the specification and/or differentiation of endothelial cells, 8 genes that regulate vascular tube and lumen formation, 8 genes that regulate vascular patterning, and 11 genes that regulate vascular remodeling, integrity and maintenance. Only 4 of these genes had previously been associated with vascular development in zebrafish illustrating the value of this focused screen. The analysis of the newly defined loci should lead to a greater understanding of vascular development and possibly provide new drug targets to treat the numerous pathologies associated with dysregulated blood vessel growth.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号