首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Any given human individual carries multiple genetic variants that disrupt protein-coding genes, through structural variation, as well as nucleotide variants and indels. Predicting the phenotypic consequences of a gene disruption remains a significant challenge. Current approaches employ information from a range of biological networks to predict which human genes are haploinsufficient (meaning two copies are required for normal function) or essential (meaning at least one copy is required for viability). Using recently available study gene sets, we show that these approaches are strongly biased towards providing accurate predictions for well-studied genes. By contrast, we derive a haploinsufficiency score from a combination of unbiased large-scale high-throughput datasets, including gene co-expression and genetic variation in over 6000 human exomes. Our approach provides a haploinsufficiency prediction for over twice as many genes currently unassociated with papers listed in Pubmed as three commonly-used approaches, and outperforms these approaches for predicting haploinsufficiency for less-studied genes. We also show that fine-tuning the predictor on a set of well-studied ‘gold standard’ haploinsufficient genes does not improve the prediction for less-studied genes. This new score can readily be used to prioritize gene disruptions resulting from any genetic variant, including copy number variants, indels and single-nucleotide variants.  相似文献   

2.
A variety of single-gene human diseases are caused by haploinsufficiency, a genetic condition by which mutational inactivation of one allele leads to reduced protein levels and functional impairment. Translational enhancement of the spare allele could exert a therapeutic effect. Here we developed BOOST, a novel gene-editing approach to rescue haploinsufficiency loci by the change of specific single nucleotides in the Kozak sequence, which controls translation by regulating start codon recognition. We evaluated for translational strength 230 Kozak sequences of annotated human haploinsufficient genes and 4621 derived variants, which can be installed by base editing, by a high-throughput reporter assay. Of these variants, 149 increased the translation of 47 Kozak sequences, demonstrating that a substantial proportion of haploinsufficient genes are controlled by suboptimal Kozak sequences. Validation of 18 variants for 8 genes produced an average enhancement in an expression window compatible with the rescue of the genetic imbalance. Base editing of the NCF1 gene, whose monoallelic loss causes chronic granulomatous disease, resulted in the desired increase of NCF1 (p47phox) protein levels in a relevant cell model. We propose BOOST as a fine-tuned approach to modulate translation, applicable to the correction of dozens of haploinsufficient monogenic disorders independently of the causing mutation.  相似文献   

3.
Mechanisms of haploinsufficiency revealed by genome-wide profiling in yeast   总被引:16,自引:0,他引:16  
Haploinsufficiency is defined as a dominant phenotype in diploid organisms that are heterozygous for a loss-of-function allele. Despite its relevance to human disease, neither the extent of haploinsufficiency nor its precise molecular mechanisms are well understood. We used the complete set of Saccharomyces cerevisiae heterozygous deletion strains to survey the genome for haploinsufficiency via fitness profiling in rich (YPD) and minimal media to identify all genes that confer a haploinsufficient growth defect. This assay revealed that approximately 3% of all approximately 5900 genes tested are haploinsufficient for growth in YPD. This class of genes is functionally enriched for metabolic processes carried out by molecular complexes such as the ribosome. Much of the haploinsufficiency in YPD is alleviated by slowing the growth rate of each strain in minimal media, suggesting that certain gene products are rate limiting for growth only in YPD. Overall, our results suggest that the primary mechanism of haploinsufficiency in yeast is due to insufficient protein production. We discuss the relevance of our findings in yeast to human haploinsufficiency disorders.  相似文献   

4.
Autism Spectrum Disorders (ASD) are highly heritable and characterised by impairments in social interaction and communication, and restricted and repetitive behaviours. Considering four sets of de novo copy number variants (CNVs) identified in 181 individuals with autism and exploiting mouse functional genomics and known protein-protein interactions, we identified a large and significantly interconnected interaction network. This network contains 187 genes affected by CNVs drawn from 45% of the patients we considered and 22 genes previously implicated in ASD, of which 192 form a single interconnected cluster. On average, those patients with copy number changed genes from this network possess changes in 3 network genes, suggesting that epistasis mediated through the network is extensive. Correspondingly, genes that are highly connected within the network, and thus whose copy number change is predicted by the network to be more phenotypically consequential, are significantly enriched among patients that possess only a single ASD-associated network copy number changed gene (p = 0.002). Strikingly, deleted or disrupted genes from the network are significantly enriched in GO-annotated positive regulators (2.3-fold enrichment, corrected p = 2×10−5), whereas duplicated genes are significantly enriched in GO-annotated negative regulators (2.2-fold enrichment, corrected p = 0.005). The direction of copy change is highly informative in the context of the network, providing the means through which perturbations arising from distinct deletions or duplications can yield a common outcome. These findings reveal an extensive ASD-associated molecular network, whose topology indicates ASD-relevant mutational deleteriousness and that mechanistically details how convergent aetiologies can result extensively from CNVs affecting pathways causally implicated in ASD.  相似文献   

5.
High-throughput sequencing technologies have offered in recent years new opportunities to study genome variations. These studies have mostly focused on single nucleotide polymorphisms, small insertions or deletions and on copy number variants. Other structural variants, such as large insertions or deletions, tandem duplications, translocations, and inversions are less well-studied, despite that some have an important impact on phenotypes. In the present study, we performed a large-scale survey of structural variants in cattle. We report the identification of 6,426 putative structural variants in cattle extracted from whole-genome sequence data of 62 bulls representing the three major French dairy breeds. These genomic variants affect DNA segments greater than 50 base pairs and correspond to deletions, inversions and tandem duplications. Out of these, we identified a total of 547 deletions and 410 tandem duplications which could potentially code for CNVs. Experimental validation was carried out on 331 structural variants using a novel high-throughput genotyping method. Out of these, 255 structural variants (77%) generated good quality genotypes and 191 (75%) of them were validated. Gene content analyses in structural variant regions revealed 941 large deletions removing completely one or several genes, including 10 single-copy genes. In addition, some of the structural variants are located within quantitative trait loci for dairy traits. This study is a pan-genome assessment of genomic variations in cattle and may provide a new glimpse into the bovine genome architecture. Our results may also help to study the effects of structural variants on gene expression and consequently their effect on certain phenotypes of interest.  相似文献   

6.
Sequencing projects have identified large numbers of rare stop-gain and frameshift variants in the human genome. As most of these are observed in the heterozygous state, they test a gene’s tolerance to haploinsufficiency and dominant loss of function. We analyzed the distribution of truncating variants across 16,260 autosomal protein coding genes in 11,546 individuals. We observed 39,893 truncating variants affecting 12,062 genes, which significantly differed from an expectation of 12,916 genes under a model of neutral de novo mutation (p<10−4). Extrapolating this to increasing numbers of sequenced individuals, we estimate that 10.8% of human genes do not tolerate heterozygous truncating variants. An additional 10 to 15% of truncated genes may be rescued by incomplete penetrance or compensatory mutations, or because the truncating variants are of limited functional impact. The study of protein truncating variants delineates the essential genome and, more generally, identifies rare heterozygous variants as an unexplored source of diversity of phenotypic traits and diseases.  相似文献   

7.
Genomic disorders are a clinically diverse group of conditions caused by gain, loss or re-orientation of a genomic region containing dosage-sensitive genes. One class of genomic disorder is caused by hemizygous deletions resulting in haploinsufficiency of a single or, more usually, several genes. For example, the heterozygous contiguous gene deletion on chromosome 22q11.2 causing DiGeorge syndrome involves at least 20-30 genes. Determining how the copy number variation (CNV) affects human variation and contributes to the aetiology and progression of various genomic disorders represents important questions for the future. Here, I will discuss the functional significance of one form of CNV, haploinsufficiency (i.e. loss of a gene copy), of DNA damage response components and its association with certain genomic disorders. There is increasing evidence that haploinsufficiency for certain genes encoding key players in the cells response to DNA damage, particularly those of the Ataxia Telangiectasia and Rad3-related (ATR)-pathway, has a functional impact. I will review this evidence and present examples of some well known clinically similar genomic disorders that have recently been shown to be defective in the ATR-dependent DNA damage response. Finally, I will discuss the potential implications of a haploinsufficiency-induced defective DNA damage response for the clinical management of certain human genomic disorders.Key Words: DNA damage response, ATR, haploinsufficiency, genomic disorders.  相似文献   

8.
Branchio-oto-renal (BOR) syndrome is an autosomal dominant disorder characterized by branchial arch anomalies, hearing loss and renal dysmorphology. Although haploinsufficiency of EYA1 and SIX1 are known to cause BOR, copy number variation analysis has only been performed on a limited number of BOR patients. In this study, we used high-resolution array-based comparative genomic hybridization on 32 BOR probands negative for coding-sequence and splice-site mutations in known BOR-causing genes to identify potential disease-causing genomic rearrangements. Of the >1,000 rare and novel copy number variants we identified, four were heterozygous deletions of EYA1 and several downstream genes that had nearly identical breakpoints associated with retroviral sequence blocks, suggesting that non-allelic homologous recombination seeded by this recombination hotspot is important in the pathogenesis of BOR. A different heterozygous deletion removing the last exon of EYA1 was identified in an additional proband. Thus, in total five probands (14 %) had deletions of all or part of EYA1. Using a novel disease-gene prioritization strategy that includes network analysis of genes associated with other deletions suggests that SHARPIN (Sipl1), FGF3 and the HOXA gene cluster may contribute to the pathogenesis of BOR.  相似文献   

9.
Numerous genetic studies have established a role for rare genomic variants in Congenital Heart Disease (CHD) at the copy number variation (CNV) and de novo variant (DNV) level. To identify novel haploinsufficient CHD disease genes, we performed an integrative analysis of CNVs and DNVs identified in probands with CHD including cases with sporadic thoracic aortic aneurysm. We assembled CNV data from 7,958 cases and 14,082 controls and performed a gene-wise analysis of the burden of rare genomic deletions in cases versus controls. In addition, we performed variation rate testing for DNVs identified in 2,489 parent-offspring trios. Our analysis revealed 21 genes which were significantly affected by rare CNVs and/or DNVs in probands. Fourteen of these genes have previously been associated with CHD while the remaining genes (FEZ1, MYO16, ARID1B, NALCN, WAC, KDM5B and WHSC1) have only been associated in small cases series or show new associations with CHD. In addition, a systems level analysis revealed affected protein-protein interaction networks involved in Notch signaling pathway, heart morphogenesis, DNA repair and cilia/centrosome function. Taken together, this approach highlights the importance of re-analyzing existing datasets to strengthen disease association and identify novel disease genes and pathways.  相似文献   

10.
Inverse metabolic engineering based on elementary mode analysis was applied to maximize the biomass yield of Escherchia coli MG1655. Elementary mode analysis was previously employed to identify among 1691 possible pathways for cell growth the most efficient pathway with maximum biomass yield. The metabolic network analysis predicted that deletion of only 6 genes reduces the number of possible elementary modes to the most efficient pathway. We have constructed a strain containing these gene deletions and we evaluated its properties in batch and in chemostat growth experiments. The results show that the theoretical predictions are closely matched by the properties of the designed strain.  相似文献   

11.
The DNA damage response (DDR) can restrain the ability of oncogenes to cause genomic instability and drive malignant transformation. The gene encoding the histone H2AX DDR factor maps to 11q23, a region frequently altered in human cancers. Since H2ax functions as a haploinsufficient suppressor of B lineage lymphomas with c-Myc amplification and/or translocation, we determined the impact of H2ax expression on the ability of deregulated c-Myc expression to cause genomic instability and drive transformation of B cells. Neither H2ax deficiency nor haploinsufficiency affected the rate of mortality of Eμ-c-Myc mice from B lineage lymphomas with genomic deletions and amplifications. Yet H2ax functioned in a dosage-dependent manner to prevent unbalanced translocations in Eμ-c-Myc tumors, demonstrating that H2ax functions in a haploinsufficient manner to suppress allelic imbalances and limit molecular heterogeneity within and among Eμ-c-Myc lymphomas. Regardless of H2ax copy number, all Eμ-c-Myc tumors contained identical amplification of chromosome 19 sequences spanning 20 genes. Many of these genes encode proteins with tumor-promoting activities, including Cd274, which encodes the PD-L1 programmed death ligand that induces T cell apoptosis and enables cancer cells to escape immune surveillance. This amplicon was in non-malignant B and T cells and non-lymphoid cells, linked to the Eμ-c-Myc transgene, and associated with overexpression of PD-L1 on non-malignant B cells. Our data demonstrate that, in addition to deregulated c-Myc expression, non-malignant B lineage lymphocytes of Eμ-c-Myc transgenic mice may have constitutive amplification and increased expression of other tumor-promoting genes.  相似文献   

12.
We report a genome-wide assessment of single nucleotide polymorphisms (SNPs) and copy number variants (CNVs) in schizophrenia. We investigated SNPs using 871 patients and 863 controls, following up the top hits in four independent cohorts comprising 1,460 patients and 12,995 controls, all of European origin. We found no genome-wide significant associations, nor could we provide support for any previously reported candidate gene or genome-wide associations. We went on to examine CNVs using a subset of 1,013 cases and 1,084 controls of European ancestry, and a further set of 60 cases and 64 controls of African ancestry. We found that eight cases and zero controls carried deletions greater than 2 Mb, of which two, at 8p22 and 16p13.11-p12.4, are newly reported here. A further evaluation of 1,378 controls identified no deletions greater than 2 Mb, suggesting a high prior probability of disease involvement when such deletions are observed in cases. We also provide further evidence for some smaller, previously reported, schizophrenia-associated CNVs, such as those in NRXN1 and APBA2. We could not provide strong support for the hypothesis that schizophrenia patients have a significantly greater “load” of large (>100 kb), rare CNVs, nor could we find common CNVs that associate with schizophrenia. Finally, we did not provide support for the suggestion that schizophrenia-associated CNVs may preferentially disrupt genes in neurodevelopmental pathways. Collectively, these analyses provide the first integrated study of SNPs and CNVs in schizophrenia and support the emerging view that rare deleterious variants may be more important in schizophrenia predisposition than common polymorphisms. While our analyses do not suggest that implicated CNVs impinge on particular key pathways, we do support the contribution of specific genomic regions in schizophrenia, presumably due to recurrent mutation. On balance, these data suggest that very few schizophrenia patients share identical genomic causation, potentially complicating efforts to personalize treatment regimens.  相似文献   

13.
14.
Dilated cardiomyopathy commonly causes heart failure and is the most frequent precipitating cause of heart transplantation. Familial dilated cardiomyopathy has been shown to be caused by rare variant mutations in more than 30 genes but only ~35% of its genetic cause has been identified, principally by using linkage-based or candidate gene discovery approaches. In a multigenerational family with autosomal dominant transmission, we employed whole-exome sequencing in a proband and three of his affected family members, and genome-wide copy number variation in the proband and his affected father and unaffected mother. Exome sequencing identified 428 single point variants resulting in missense, nonsense, or splice site changes. Genome-wide copy number analysis identified 51 insertion deletions and 440 copy number variants > 1 kb. Of these, a 8733 bp deletion, encompassing exon 4 of the heat shock protein cochaperone BCL2-associated athanogene 3 (BAG3), was found in seven affected family members and was absent in 355 controls. To establish the relevance of variants in this protein class in genetic DCM, we sequenced the coding exons in BAG3 in 311 other unrelated DCM probands and identified one frameshift, two nonsense, and four missense rare variants absent in 355 control DNAs, four of which were familial and segregated with disease. Knockdown of bag3 in a zebrafish model recapitulated DCM and heart failure. We conclude that new comprehensive genomic approaches have identified rare variants in BAG3 as causative of DCM.  相似文献   

15.
The identification of genetic and epigenetic alterations from primary tumor cells has become a common method to identify genes critical to the development and progression of cancer. We seek to identify those genetic and epigenetic aberrations that have the most impact on gene function within the tumor. First, we perform a bioinformatic analysis of copy number variation (CNV) and DNA methylation covering the genetic landscape of ovarian cancer tumor cells. We separately examined CNV and DNA methylation for 42 primary serous ovarian cancer samples using MOMA-ROMA assays and 379 tumor samples analyzed by The Cancer Genome Atlas. We have identified 346 genes with significant deletions or amplifications among the tumor samples. Utilizing associated gene expression data we predict 156 genes with altered copy number and correlated changes in expression. Among these genes CCNE1, POP4, UQCRB, PHF20L1 and C19orf2 were identified within both data sets. We were specifically interested in copy number variation as our base genomic property in the prediction of tumor suppressors and oncogenes in the altered ovarian tumor. We therefore identify changes in DNA methylation and expression for all amplified and deleted genes. We statistically define tumor suppressor and oncogenic features for these modalities and perform a correlation analysis with expression. We predicted 611 potential oncogenes and tumor suppressors candidates by integrating these data types. Genes with a strong correlation for methylation dependent expression changes exhibited at varying copy number aberrations include CDCA8, ATAD2, CDKN2A, RAB25, AURKA, BOP1 and EIF2C3. We provide copy number variation and DNA methylation analysis for over 11,500 individual genes covering the genetic landscape of ovarian cancer tumors. We show the extent of genomic and epigenetic alterations for known tumor suppressors and oncogenes and also use these defined features to identify potential ovarian cancer gene candidates.  相似文献   

16.
Deletions spanning chromosome 5q31.2 are among the most common recurring cytogenetic abnormalities detectable in myelodysplastic syndromes (MDS). Prior genomic studies have suggested that haploinsufficiency of multiple 5q31.2 genes may contribute to MDS pathogenesis. However, this hypothesis has never been formally tested. Therefore, we designed this study to systematically and comprehensively evaluate all 28 chromosome 5q31.2 genes and directly test whether haploinsufficiency of a single 5q31.2 gene may result from a heterozygous nucleotide mutation or microdeletion. We selected paired tumor (bone marrow) and germline (skin) DNA samples from 46 de novo MDS patients (37 without a cytogenetic 5q31.2 deletion) and performed total exonic gene resequencing (479 amplicons) and array comparative genomic hybridization (CGH). We found no somatic nucleotide changes in the 46 MDS samples, and no cytogenetically silent 5q31.2 deletions in 20/20 samples analyzed by array CGH. Twelve novel single nucleotide polymorphisms were discovered. The mRNA levels of 7 genes in the commonly deleted interval were reduced by 50% in CD34+ cells from del(5q) MDS samples, and no gene showed complete loss of expression. Taken together, these data show that small deletions and/or point mutations in individual 5q31.2 genes are not common events in MDS, and implicate haploinsufficiency of multiple genes as the relevant genetic consequence of this common deletion.  相似文献   

17.
18.
Schizophrenia (SZ) and autism spectrum disorders (ASD) are highly heritable neuropsychiatric disorders, although environmental factors, such as maternal immune activation (MIA), play a role as well. Cytokines mediate the effects of MIA on neurogenesis and behavior in animal models. However, MIA stimulators can also induce a febrile reaction, which could have independent effects on neurogenesis through heat shock (HS)-regulated cellular stress pathways. However, this has not been well-studied. To help understand the role of fever in MIA, we used a recently described model of human brain development in which induced pluripotent stem cells (iPSCs) differentiate into 3-dimensional neuronal aggregates that resemble a first trimester telencephalon. RNA-seq was carried out on aggregates that were heat shocked at 39°C for 24 hours, along with their control partners maintained at 37°C. 186 genes showed significant differences in expression following HS (p<0.05), including known HS-inducible genes, as expected, as well as those coding for NGFR and a number of SZ and ASD candidates, including SMARCA2, DPP10, ARNT2, AHI1 and ZNF804A. The degree to which the expression of these genes decrease or increase during HS is similar to that found in copy loss and copy gain copy number variants (CNVs), although the effects of HS are likely to be transient. The dramatic effect on the expression of some SZ and ASD genes places HS, and perhaps other cellular stressors, into a common conceptual framework with disease-causing genetic variants. The findings also suggest that some candidate genes that are assumed to have a relatively limited impact on SZ and ASD pathogenesis based on a small number of positive genetic findings, such as SMARCA2 and ARNT2, may in fact have a much more substantial role in these disorders - as targets of common environmental stressors.  相似文献   

19.
Gene dosage and gene duplicability   总被引:2,自引:0,他引:2       下载免费PDF全文
Qian W  Zhang J 《Genetics》2008,179(4):2319-2324
The evolutionary process leading to the fixation of newly duplicated genes is not well understood. It was recently proposed that the fixation of duplicate genes is frequently driven by positive selection for increased gene dosage (i.e., the gene dosage hypothesis), because haploinsufficient genes were reported to have more paralogs than haplosufficient genes in the human genome. However, the previous analysis incorrectly assumed that the presence of dominant abnormal alleles of a human gene means that the gene is haploinsufficient, ignoring the fact that many dominant abnormal alleles arise from gain-of-function mutations. Here we show in both humans and yeast that haploinsufficient genes generally do not duplicate more frequently than haplosufficient genes. Yeast haploinsufficient genes do exhibit enhanced retention after whole-genome duplication compared to haplosufficient genes if they encode members of stable protein complexes, but the same phenomenon is absent if the genes do not encode protein complex members, suggesting that the dosage balance effect rather than the dosage effect is the underlying cause of the phenomenon. On the basis of these and other results, we conclude that selection for higher gene dosage does not play a major role in driving the fixation of duplication genes.  相似文献   

20.
Genomic structural variation is an important and abundant source of genetic and phenotypic variation. In this study, we performed an initial analysis of copy number variations (CNVs) using BovineHD SNP genotyping data from 147 Holstein cows identified as having high or low feed efficiency as estimated by residual feed intake (RFI). We detected 443 candidate CNV regions (CNVRs) that represent 18.4?Mb (0.6?%) of the genome. To investigate the functional impacts of CNVs, we created two groups of 30 individual animals with extremely low or high estimated breeding values (EBVs) for RFI, and referred to these groups as low intake (LI; more efficient) or high intake (HI; less efficient), respectively. We identified 240 (~9.0?Mb) and 274 (~10.2?Mb) CNVRs from LI and HI groups, respectively. Approximately 30–40?% of the CNVRs were specific to the LI group or HI group of animals. The 240 LI CNVRs overlapped with 137 Ensembl genes. Network analyses indicated that the LI-specific genes were predominantly enriched for those functioning in the inflammatory response and immunity. By contrast, the 274 HI CNVRs contained 177 Ensembl genes. Network analyses indicated that the HI-specific genes were particularly involved in the cell cycle, and organ and bone development. These results relate CNVs to two key variables, namely immune response and organ and bone development. The data indicate that greater feed efficiency relates more closely to immune response, whereas cattle with reduced feed efficiency may have a greater capacity for organ and bone development.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号