首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Paired-end sequencing is emerging as a key technique for assessing genome rearrangements and structural variation on a genome-wide scale. This technique is particularly useful for detecting copy-neutral rearrangements, such as inversions and translocations, which are common in cancer and can produce novel fusion genes. We address the question of how much sequencing is required to detect rearrangement breakpoints and to localize them precisely using both theoretical models and simulation. We derive a formula for the probability that a fusion gene exists in a cancer genome given a collection of paired-end sequences from this genome. We use this formula to compute fusion gene probabilities in several breast cancer samples, and we find that we are able to accurately predict fusion genes in these samples with a relatively small number of fragments of large size. We further demonstrate how the ability to detect fusion genes depends on the distribution of gene lengths, and we evaluate how different parameters of a sequencing strategy impact breakpoint detection, breakpoint localization, and fusion gene detection, even in the presence of errors that suggest false rearrangements. These results will be useful in calibrating future cancer sequencing efforts, particularly large-scale studies of many cancer genomes that are enabled by next-generation sequencing technologies.  相似文献   

2.
It is widely established that chromosomal rearrangements induce oncogenesis in solid tumors. However, discovering chromosomal rearrangements that are targetable and actionable remains a difficulty. Targeting gene fusion or chromosomal rearrangement seems to be a powerful strategy to address malignancies characterized by gene rearrangement. Oncogenic NRG1 fusions are relatively rare drivers that infrequently occur across most tumor types. NRG1 fusions exhibit unique biological properties and are difficult to identify owing to their large intronic regions. NRG1 fusions can be detected using a variety of techniques, including fluorescence in situ hybridization, immunohistochemistry, or next-generation sequencing (NGS), with NGS-based RNA sequencing being the most sensitive. Previous studies have shown that NRG1 fusion protein induces tumorigenesis, and numerous therapies targeting the ErbB signaling pathway, such as ErbB kinase inhibitors and monoclonal antibodies, have initially demonstrated encouraging anticancer efficacy in malignant tumors carrying NRG1 fusions. In this review, we present the characteristics and prevalence of NRG1 fusions in solid tumors. Additionally, we discuss the laboratory approaches for diagnosing NRG1 gene fusions. More importantly, we outline promising strategies for treating malignancies with NRG1 fusion.  相似文献   

3.
Fusion genes formed by chromosomal rearrangements are common drivers of cancer. Recent innovations in the field of next-generation sequencing (NGS) have seen a dynamic shift from traditional fusion detection approaches, such as visual characterization by fluorescence, to more precise multiplexed methods. There are many different NGS-based approaches to fusion gene detection and deciding on the most appropriate method can be difficult. Beyond the experimental approach, consideration needs to be given to factors such as the ease of implementation, processing time, associated costs, and the level of expertise required for data analysis. Here, the different NGS-based methods for fusion gene detection, the basic principles underlying the techniques, and the benefits and limitations of each approach are reviewed. This article concludes with a discussion of how NGS will impact fusion gene detection in a clinical context and from where the next innovations are evolving.  相似文献   

4.
The sequencing of the 12 genomes of members of the genus Drosophila was taken as an opportunity to reevaluate the genetic and physical maps for 11 of the species, in part to aid in the mapping of assembled scaffolds. Here, we present an overview of the importance of cytogenetic maps to Drosophila biology and to the concepts of chromosomal evolution. Physical and genetic markers were used to anchor the genome assembly scaffolds to the polytene chromosomal maps for each species. In addition, a computational approach was used to anchor smaller scaffolds on the basis of the analysis of syntenic blocks. We present the chromosomal map data from each of the 11 sequenced non-Drosophila melanogaster species as a series of sections. Each section reviews the history of the polytene chromosome maps for each species, presents the new polytene chromosome maps, and anchors the genomic scaffolds to the cytological maps using genetic and physical markers. The mapping data agree with Muller's idea that the majority of Drosophila genes are syntenic. Despite the conservation of genes within homologous chromosome arms across species, the karyotypes of these species have changed through the fusion of chromosomal arms followed by subsequent rearrangement events.  相似文献   

5.
Ambiguous genitalia or disorder of the sexual development is a birth defect where the external genitals do not have the typical appearance of either a male or female. Here we report a boy with ambiguous genitalia and short stature. The cytogenetic analysis by G-banding revealed a small Y chromosome and an additional material on the 15p arm. Further, molecular cytogenetic analysis by Fluorescence in situ hybridization (FISH) using whole chromosome paint probes showed the presence of Y sequences on the 15p arm, confirming that it is a Y;15 translocation. Subsequent, FISH with centromere probe Y showed two signals depicting the presence of two centromeres and differing with a balanced translocation. The dicentric nature of the derivative 15 chromosome was confirmed by FISH with both 15 and Y centromeric probes. Further, the delineation of the Y chromosomal DNA was also done by quantitative real time PCR. Additional Y-short tandem repeat typing was performed to find out the extent of deletion on small Y chromosome. Fine mapping was carried out with 8 Y specific BAC clones which helped in defining the breakpoint regions. MLPA was performed to check the presence or absence of subtelomeric regions and SHOX regions on Y. Finally array CGH helped us in confirming the breakpoint regions. In our study we identified and characterized a novel complex Y chromosomal rearrangement with a complete deletion of the Yq region and duplication of the Yp region with one copy being translocated onto the15p arm. This is the first report of novel and unique Y complex rearrangement showing a deletion, duplication and a translocation in the same patient. The possible mechanism of the rearrangement and the phenotype–genotype correlation are discussed.  相似文献   

6.
Next-generation sequencing (NGS) technologies have revolutionised the analysis of genomic structural variants (SVs), providing significant insights into SV de novo formation based on analyses of rearrangement breakpoint junctions. The short DNA reads generated by NGS, however, have also created novel obstacles by biasing the ascertainment of SVs, an aspect that we refer to as the 'short-read dilemma'. For example, recent studies have found that SVs are often complex, with SV formation generating large numbers of breakpoints in a single event (multi-breakpoint SVs) or structurally polymorphic loci having multiple allelic states (multi-allelic SVs). This complexity may be obscured in short reads, unless the data is analysed and interpreted within its wider genomic context. We discuss how novel approaches will help to overcome the short-read dilemma, and how integration of other sources of information, including the structure of chromatin, may help in the future to deepen the understanding of SV formation processes.  相似文献   

7.
L Pezzoli  ME Sana  P Ferrazzi  M Iascone 《Gene》2012,507(2):165-169
We describe a male patient affected by hypertrophic cardiomyopathy (HCM) with no point mutations in the eight sarcomeric genes most commonly involved in the disease. By multiple ligation-dependent probe amplification (MLPA) we have identified a multi-exons C-terminus deletion in the cardiac myosin binding protein C (MYBPC3) gene. The rearrangement has been confirmed by long PCR and breakpoints have been defined by sequencing. The 3.5kb terminal deletion is mediated by Alu-repeat elements and is predicted to result in haploinsufficiency of MYBPC3. To exclude the presence of other rare pathogenic variants in additional HCM genes, we performed targeted next-generation sequencing (NGS) of 88 cardiomyopathy-associated genes but we did not identify any further mutation. Interestingly, the MYBPC3 multi-exons deletion was detectable by NGS. This finding broadens the range of mutational spectrum observed in HCM, contributing to understanding the genetic basis of the most common inherited cardiovascular disease. Moreover, our data suggest that NGS may represent a new tool to achieve a deeper insight into molecular basis of complex diseases, allowing to detect in a single experiment both point mutations and gene rearrangements.  相似文献   

8.
Bhutkar A  Schaeffer SW  Russo SM  Xu M  Smith TF  Gelbart WM 《Genetics》2008,179(3):1657-1680
The availability of 12 complete genomes of various species of genus Drosophila provides a unique opportunity to analyze genome-scale chromosomal rearrangements among a group of closely related species. This article reports on the comparison of gene order between these 12 species and on the fixed rearrangement events that disrupt gene order. Three major themes are addressed: the conservation of syntenic blocks across species, the disruption of syntenic blocks (via chromosomal inversion events) and its relationship to the phylogenetic distribution of these species, and the rate of rearrangement events over evolutionary time. Comparison of syntenic blocks across this large genomic data set confirms that genetic elements are largely (95%) localized to the same Muller element across genus Drosophila species and paracentric inversions serve as the dominant mechanism for shuffling the order of genes along a chromosome. Gene-order scrambling between species is in accordance with the estimated evolutionary distances between them and we find it to approximate a linear process over time (linear to exponential with alternate divergence time estimates). We find the distribution of synteny segment sizes to be biased by a large number of small segments with comparatively fewer large segments. Our results provide estimated chromosomal evolution rates across this set of species on the basis of whole-genome synteny analysis, which are found to be higher than those previously reported. Identification of conserved syntenic blocks across these genomes suggests a large number of conserved blocks with varying levels of embryonic expression correlation in Drosophila melanogaster. On the other hand, an analysis of the disruption of syntenic blocks between species allowed the identification of fixed inversion breakpoints and estimates of breakpoint reuse and lineage-specific breakpoint event segregation.  相似文献   

9.
10.
Next‐generation sequencing (NGS) provides a powerful tool for the discovery of important genes and alleles in crop plants and their wild relatives. Despite great advances in NGS technologies, whole‐genome shotgun sequencing is cost‐prohibitive for species with complex genomes. An attractive option is to reduce genome complexity to a single chromosome prior to sequencing. This work describes a strategy for studying the genomes of distant wild relatives of wheat by isolating single chromosomes from addition or substitution lines, followed by chromosome sorting using flow cytometry and sequencing of chromosomal DNA by NGS technology. We flow‐sorted chromosome 5Mg from a wheat/Aegilops geniculata disomic substitution line [DS5Mg (5D)] and sequenced it using an Illumina HiSeq 2000 system at approximately 50 × coverage. Paired‐end sequences were assembled and used for structural and functional annotation. A total of 4236 genes were annotated on 5Mg, in close agreement with the predicted number of genes on wheat chromosome 5D (4286). Single‐gene FISH indicated no major chromosomal rearrangements between chromosomes 5Mg and 5D. Comparing chromosome 5Mg with model grass genomes identified synteny blocks in Brachypodium distachyon, rice (Oryza sativa), sorghum (Sorghum bicolor) and barley (Hordeum vulgare). Chromosome 5Mg‐specific SNPs and cytogenetic probe‐based resources were developed and validated. Deletion bin‐mapped and ordered 5Mg SNP markers will be useful to track 5M‐specific introgressions and translocations. This study provides a detailed sequence‐based analysis of the composition of a chromosome from a distant wild relative of bread wheat, and opens up opportunities to develop genomic resources for wild germplasm to facilitate crop improvement.  相似文献   

11.
12.
13.
Chronic myelogenous leukaemia (CML) has a special phenomenon of chromosome translocation, which is called Philadelphia chromosome translocation. However, the detailed connection of this structure is troublesome and expensive to be identified. Low‐coverage whole genome sequencing (LCWGS) could not only detect the previously unknown chromosomal translocation, but also provide the breakpoint candidate small region (with an accuracy of ±200 bases). Importantly, the sequencing cost of LCWGS is about US$300. Then, with the Sanger DNA sequencing, the precise breakpoint can be determined at a single base level. In our project, with LCWGS, BCR and ABL1 are successfully identified to be disrupted in three CML patients (at chr22:23,632,356 and chr9:133,590,450; chr22:23,633,748 and chr9:133,635,781; chr22: 23,631,831 and chr9:133,598,513, respectively). Due to the reconnection after chromosome breakage, classical fusion gene (BCR::ABL1) was found in bone marrow and peripheral blood. The precise breakpoints were helpful to investigate the pathogenic mechanism of CML and could better guide the classification of CML subtypes. This LCWGS method is universal and can be used to detect all diseases related to chromosome variation, such as solid tumours, liquid tumours and birth defects.  相似文献   

14.
Complex chromosomal rearrangements are very rare chromosomal abnormalities. Individuals with a complex chromosomal rearrangement can be phenotypically normal or display a clinical abnormality. It is believed that these abnormalities are due to either microdeletions or microduplications at the translocation breakpoints or as a result of disruption of the genes located in the breakpoints. In this study we describe a 2-year-old child with mental retardation and developmental delay in whom a de novo apparently balanced exceptional complex chromosomal rearrangement was found through conventional cytogenetic analysis. Using both cytogenetic and FISH analysis, the patient's karyotype was found to be: 46,XY,der(5)t(5;7)(p15.1;7q34),t(5;8)(q13.1;8q24.1)dn. A large, clinically significant deletion which encompassed 887.69 kb was detected at the 5q12.1–5q12.3 (chr5:62.886.523–63.774.210) genomic region using array-CGH. This deleted region includes the HTR1A and RNF180 genes. This is the first report of an individual with an apparently balanced complex chromosomal rearrangement in conjunction with a microdeletion at 5q12.1–5q12.3 in which there are both mental-motor retardation and dysmorphia.  相似文献   

15.

Background

A key challenge in the realm of human disease research is next generation sequencing (NGS) interpretation, whereby identified filtered variant-harboring genes are associated with a patient’s disease phenotypes. This necessitates bioinformatics tools linked to comprehensive knowledgebases. The GeneCards suite databases, which include GeneCards (human genes), MalaCards (human diseases) and PathCards (human pathways) together with additional tools, are presented with the focus on MalaCards utility for NGS interpretation as well as for large scale bioinformatic analyses.

Results

VarElect, our NGS interpretation tool, leverages the broad information in the GeneCards suite databases. MalaCards algorithms unify disease-related terms and annotations from 69 sources. Further, MalaCards defines hierarchical relatedness—aliases, disease families, a related diseases network, categories and ontological classifications. GeneCards and MalaCards delineate and share a multi-tiered, scored gene-disease network, with stringency levels, including the definition of elite status—high quality gene-disease pairs, coming from manually curated trustworthy sources, that includes 4500 genes for 8000 diseases. This unique resource is key to NGS interpretation by VarElect. VarElect, a comprehensive search tool that helps infer both direct and indirect links between genes and user-supplied disease/phenotype terms, is robustly strengthened by the information found in MalaCards. The indirect mode benefits from GeneCards’ diverse gene-to-gene relationships, including SuperPaths—integrated biological pathways from 12 information sources. We are currently adding an important information layer in the form of “disease SuperPaths”, generated from the gene-disease matrix by an algorithm similar to that previously employed for biological pathway unification. This allows the discovery of novel gene-disease and disease–disease relationships. The advent of whole genome sequencing necessitates capacities to go beyond protein coding genes. GeneCards is highly useful in this respect, as it also addresses 101,976 non-protein-coding RNA genes. In a more recent development, we are currently adding an inclusive map of regulatory elements and their inferred target genes, generated by integration from 4 resources.

Conclusions

MalaCards provides a rich big-data scaffold for in silico biomedical discovery within the gene-disease universe. VarElect, which depends significantly on both GeneCards and MalaCards power, is a potent tool for supporting the interpretation of wet-lab experiments, notably NGS analyses of disease. The GeneCards suite has thus transcended its 2-decade role in biomedical research, maturing into a key player in clinical investigation.
  相似文献   

16.
Cytogenetic analyses of constitutional diseases have disclosed several chromosomal rearrangements. At the molecular level, these rearrangements often result in the breakage of genes or alteration of genome architecture. Fluorescence in situ hybridization (FISH) and molecular investigations of a patient showing hypotonia and dysmorphic traits revealed a masked complex chromosome abnormality previously detected by G-banding as a simple 8qter deletion. To characterize the genetic rearrangements panels of bacterial artificial chromosomes (BACs) covering 8q24.22-->qter were constructed, and short tandem repeats (STRs) were used to refine the localization of the breakpoints and to assess the parental origin of the defect. Chromosome 8 displayed the breakpoint at 8q24.22 and an unexpected distal breakpoint at 8q24.23 resulting in unbalanced translocation of a small 8q genomic region on the chromosome 16qter. The study of the 16qter region revealed that the 16q subtelomere was retained and the translocated material of distal 8q was juxtaposed. Moreover, molecular analyses showed that part of the translocated 8qter segment on der(16) was partially duplicated, inverted and that the rearrangement arose in the paternal meiosis. These findings emphasize the complexity of some only apparently simple chromosomal rearrangements and suggest a subtelomeric FISH approach to enhance diagnostic care when a cytogenetic terminal deletion is found.  相似文献   

17.
Jin  Yaqiong  Chen  Geng  Xiao  Wenming  Hong  Huixiao  Xu  Joshua  Guo  Yongli  Xiao  Wenzhong  Shi  Tieliu  Shi  Leming  Tong  Weida  Ning  Baitang 《中国科学:生命科学英文版》2019,62(7):895-904
High-throughput next generation sequencing(NGS) is a shotgun approach applied in a parallel fashion by which the genome is fragmented and sequenced through small pieces and then analyzed either by aligning to a known reference genome or by de novo assembly without reference genome. This technology has led researchers to conduct an explosion of sequencing related projects in multidisciplinary fields of science. However, due to the limitations of sequencing-based chemistry, length of sequencing reads and the complexity of genes, it is difficult to determine the sequences of some portions of the human genome, leaving gaps in genomic data that frustrate further analysis. Particularly, some complex genes are difficult to be accurately sequenced or mapped because they contain high GC-content and/or low complexity regions, and complicated pseudogenes, such as the genes encoding xenobiotic metabolizing enzymes and transporters(XMETs). The genetic variants in XMET genes are critical to predicate interindividual variability in drug efficacy, drug safety and susceptibility to environmental toxicity. We summarized and discussed challenges, wet-lab methods, and bioinformatics algorithms in sequencing "complex" XMET genes, which may provide insightful information in the application of NGS technology for implementation in toxicogenomics and pharmacogenomics.  相似文献   

18.
19.
Gibbon species have accumulated an unusually high number of chromosomal changes since diverging from the common hominoid ancestor 15–18 million years ago. The cause of this increased rate of chromosomal rearrangements is not known, nor is it known if genome architecture has a role. To address this question, we analyzed sequences spanning 57 breaks of synteny between northern white-cheeked gibbons (Nomascus l. leucogenys) and humans. We find that the breakpoint regions are enriched in segmental duplications and repeats, with Alu elements being the most abundant. Alus located near the gibbon breakpoints (<150 bp) have a higher CpG content than other Alus. Bisulphite allelic sequencing reveals that these gibbon Alus have a lower average density of methylated cytosine that their human orthologues. The finding of higher CpG content and lower average CpG methylation suggests that the gibbon Alu elements are epigenetically distinct from their human orthologues. The association between undermethylation and chromosomal rearrangement in gibbons suggests a correlation between epigenetic state and structural genome variation in evolution.  相似文献   

20.
MOTIVATION: Loss of chromosomal material is often observed in cancer cells. In this situation the expression of genes is related to their position on the genome. Epigenetic phenomena may also silence several genes in the same region of a chromosome. While cytogenetic or other molecular genetic methods spot changes of DNA copy number, they cannot detect other causes of silencing. RESULTS: We propose a method that utilizes the link from expression information gained from high-density DNA microarrays to the gene locus according to current databases. Statistical methods adequate to spot conspicuous runs of non-expressed genes are introduced and compared to one another by merit of their power and robustness against false positives. AVAILABILITY: Code for the formulae can be obtained (R code) via http://www.panix.com/~derwisch/hannes/longrun  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号