首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.

Background

The detailed study of breakpoints associated with copy number variants (CNVs) can elucidate the mutational mechanisms that generate them and the comparison of breakpoints across species can highlight differences in genomic architecture that may lead to lineage-specific differences in patterns of CNVs. Here, we provide a detailed analysis of Drosophila CNV breakpoints and contrast it with similar analyses recently carried out for the human genome.

Results

By applying split-read methods to a total of 10x coverage of 454 shotgun sequence across nine lines of D. melanogaster and by re-examining a previously published dataset of CNVs detected using tiling arrays, we identified the precise breakpoints of more than 600 insertions, deletions, and duplications. Contrasting these CNVs with those found in humans showed that in both taxa CNV breakpoints fall into three classes: blunt breakpoints; simple breakpoints associated with microhomology; and breakpoints with additional nucleotides inserted/deleted and no microhomology. In both taxa CNV breakpoints are enriched with non-B DNA sequence structures, which may impair DNA replication and/or repair. However, in contrast to human genomes, non-allelic homologous-recombination (NAHR) plays a negligible role in CNV formation in Drosophila. In flies, non-homologous repair mechanisms are responsible for simple, recurrent, and complex CNVs, including insertions of de novo sequence as large as 60 bp.

Conclusions

Humans and Drosophila differ considerably in the importance of homology-based mechanisms for the formation of CNVs, likely as a consequence of the differences in the abundance and distribution of both segmental duplications and transposable elements between the two genomes.  相似文献   

2.
Genomic rearrangements involving the peripheral myelin protein gene (PMP22) in human chromosome 17p12 are associated with neuropathy: duplications cause Charcot-Marie-Tooth disease type 1A (CMT1A), whereas deletions lead to hereditary neuropathy with liability to pressure palsies (HNPP). Our previous studies showed that >99% of these rearrangements are recurrent and mediated by nonallelic homologous recombination (NAHR). Rare copy number variations (CNVs) generated by nonrecurrent rearrangements also exist in 17p12, but their underlying mechanisms are not well understood. We investigated 21 subjects with rare CNVs associated with CMT1A or HNPP by oligonucleotide-based comparative genomic hybridization microarrays and breakpoint sequence analyses, and we identified 17 unique CNVs, including two genomic deletions, ten genomic duplications, two complex rearrangements, and three small exonic deletions. Each of these CNVs includes either the entire PMP22 gene, or exon(s) only, or ultraconserved potential regulatory sequences upstream of PMP22, further supporting the contention that PMP22 is the critical gene mediating the neuropathy phenotypes associated with 17p12 rearrangements. Breakpoint sequence analysis reveals that, different from the predominant NAHR mechanism in recurrent rearrangement, various molecular mechanisms, including nonhomologous end joining, Alu-Alu-mediated recombination, and replication-based mechanisms (e.g., FoSTeS and/or MMBIR), can generate nonrecurrent 17p12 rearrangements associated with neuropathy. We document a multitude of ways in which gene function can be altered by CNVs. Given the characteristics, including small size, structural complexity, and location outside of coding regions, of selected rare CNVs, their identification remains a challenge for genome analysis. Rare CNVs may potentially represent an important portion of “missing heritability” for human diseases.  相似文献   

3.
The aims of this study were to create a copy number variant (CNV) profile of human chromosome 22 and to establish a genotype-phenotype correlation for patients with genomic abnormalities on chromosome 22. Thus, 1,654 consecutive pediatric patients with a diversity of clinical findings were evaluated by high-resolution chromosomal microarray analysis (CMA). We identified 25 individuals with abnormal CNVs on chromosome 22, representing 1.5% of the cases analyzed in this cohort. Meanwhile, we detected 1,298 benign CNVs on this chromosome in these individuals. Twenty-one of the 25 abnormal CNVs and the majority of the benign CNVs occurred through involvement of the 8 unstable genomic regions enriched with low copy repeats (LCR22A-H). The highly dynamic status of LCR22s within the 22q11 region facilitates the formation of diverse genomic abnormalities. This CNV profile provides a general perspective of the spectrum of chromosome 22 genomic imbalances and subsequently improves the CNV-phenotype correlations.  相似文献   

4.
Copy number variants (CNVs) are pervasive in several animal and plant genomes and contribute to shaping genetic diversity. In barley, there is evidence that changes in gene copy number underlie important agronomic traits. The recently released reference sequence of barley represents a valuable genomic resource for unveiling the incidence of CNVs that affect gene content and for identifying sequence features associated with CNV formation. Using exome sequencing and read count data, we detected 16 605 deletions and duplications that affect barley gene content by surveying a diverse panel of 172 cultivars, 171 landraces, 22 wild relatives and other 32 uncategorized domesticated accessions. The quest for segmental duplications (SDs) in the reference sequence revealed many low‐copy repeats, most of which overlap predicted coding sequences. Statistical analyses revealed that the incidence of CNVs increases significantly in SD‐rich regions, indicating that these sequence elements act as hot spots for the formation of CNVs. The present study delivers a comprehensive genome‐wide study of CNVs affecting barley gene content and implicates SDs in the molecular mechanisms that lead to the formation of this class of CNVs.  相似文献   

5.
Despite considerable excitement over the potential functional significance of copy-number variants (CNVs), we still lack knowledge of the fine-scale architecture of the large majority of CNV regions in the human genome. In this study, we used a high-resolution array-based comparative genomic hybridization (aCGH) platform that targeted known CNV regions of the human genome at approximately 1 kb resolution to interrogate the genomic DNAs of 30 individuals from four HapMap populations. Our results revealed that 1020 of 1153 CNV loci (88%) were actually smaller in size than what is recorded in the Database of Genomic Variants based on previously published studies. A reduction in size of more than 50% was observed for 876 CNV regions (76%). We conclude that the total genomic content of currently known common human CNVs is likely smaller than previously thought. In addition, approximately 8% of the CNV regions observed in multiple individuals exhibited genomic architectural complexity in the form of smaller CNVs within larger ones and CNVs with interindividual variation in breakpoints. Future association studies that aim to capture the potential influences of CNVs on disease phenotypes will need to consider how to best ascertain this previously uncharacterized complexity.  相似文献   

6.
Chromosome breakage in germline and somatic genomes gives rise to copy number variation (CNV) responsible for genomic disorders and tumorigenesis. DNA sequence is known to play an important role in breakage at chromosome fragile sites; however, the sequences susceptible to double-strand breaks (DSBs) underlying CNV formation are largely unknown. Here we analyze 140 germline CNV breakpoints from 116 individuals to identify DNA sequences enriched at breakpoint loci compared to 2800 simulated control regions. We find that, overall, CNV breakpoints are enriched in tandem repeats and sequences predicted to form G-quadruplexes. G-rich repeats are overrepresented at terminal deletion breakpoints, which may be important for the addition of a new telomere. Interstitial deletions and duplication breakpoints are enriched in Alu repeats that in some cases mediate non-allelic homologous recombination (NAHR) between the two sides of the rearrangement. CNV breakpoints are enriched in certain classes of repeats that may play a role in DNA secondary structure, DSB susceptibility and/or DNA replication errors.  相似文献   

7.
Lee JA  Carvalho CM  Lupski JR 《Cell》2007,131(7):1235-1247
The prevailing mechanism for recurrent and some nonrecurrent rearrangements causing genomic disorders is nonallelic homologous recombination (NAHR) between region-specific low-copy repeats (LCRs). For other nonrecurrent rearrangements, nonhomologous end joining (NHEJ) is implicated. Pelizaeus-Merzbacher disease (PMD) is an X-linked dysmyelinating disorder caused most frequently (60%-70%) by nonrecurrent duplication of the dosage-sensitive proteolipid protein 1 (PLP1) gene but also by nonrecurrent deletion or point mutations. Many PLP1 duplication junctions are refractory to breakpoint sequence analysis, an observation inconsistent with a simple recombination mechanism. Our current analysis of junction sequences in PMD patients confirms the occurrence of simple tandem PLP1 duplications but also uncovers evidence for sequence complexity at some junctions. These data are consistent with a replication-based mechanism that we term FoSTeS, for replication Fork Stalling and Template Switching. We propose that complex duplication and deletion rearrangements associated with PMD, and potentially other nonrecurrent rearrangements, may be explained by this replication-based mechanism.  相似文献   

8.
Copy number variants (CNVs) of the Williams–Beuren syndrome (WBS) 7q11.23 region are responsible for neurodevelopmental disorders with multi-system involvement and variable expressivity. Typical features of WBS microdeletion comprise a recognizable pattern of facial dysmorphisms, supravalvular aortic stenosis, connective tissue abnormalities, hypercalcemia, and a distinctive neurobehavioral phenotype. Conversely, the phenotype of patients carrying the 7q11.23 reciprocal duplications includes less distinctive facial dysmorphisms and prominent speech delay. The common deletion/duplication ranges in size from 1.5 to 1.8 Mb and encompasses approximately 28 genes. This region is flanked by low copy repeats (LCRs) with greater than ~97% identity, which can mediate non-allelic homologous recombination resulting from misalignment of LCRs during meiosis. A clear genotype–phenotype correlation has been established in WBS only for the elastin gene, which is responsible for the vascular and connective tissue abnormalities. The molecular substrates underlying the other clinical features of 7q11.23 CNVs, including the neurocognitive phenotypes, are still debated. Recent studies suggest that besides the role of the genes in the deleted/duplicated interval, multiple factors such as regulatory sequences, epigenetic mechanisms, parental origin of the CNV, and nucleotide variations in the non-deleted/duplicated allele may be important in determining the variable expressivity of 7q11.23 CNV phenotypes. Here, we review the clinical and molecular findings and the recent insights on genomic disorders associated with CNVs involving the 7q11.23 region.  相似文献   

9.
Genomic rearrangements can cause both Mendelian and complex disorders. Currently, several major mechanisms causing genomic rearrangements, such as non-allelic homologous recombination (NAHR), non-homologous end joining (NHEJ), fork stalling and template switching (FoSTeS), and microhomology-mediated break-induced replication (MMBIR), have been proposed. However, to what extent these mechanisms contribute to gene-specific pathogenic copy-number variations (CNVs) remains understudied. Furthermore, few studies have resolved these pathogenic alterations at the nucleotide-level. Accordingly, our aim was to explore which mechanisms contribute to a large, unique set of locus-specific non-recurrent genomic rearrangements causing the genetic neurocutaneous disorder neurofibromatosis type 1 (NF1). Through breakpoint-spanning PCR as well as array comparative genomic hybridization, we have identified the breakpoints in 85 unrelated individuals carrying an NF1 intragenic CNV. Furthermore, we characterized the likely rearrangement mechanisms of these 85 CNVs, along with those of two additional previously published NF1 intragenic CNVs. Unlike the most typical recurrent rearrangements mediated by flanking low-copy repeats (LCRs), NF1 intragenic rearrangements vary in size, location, and rearrangement mechanisms. We propose the DNA-replication-based mechanisms comprising both FoSTeS and/or MMBIR and serial replication stalling to be the predominant mechanisms leading to NF1 intragenic CNVs. In addition to the loop within a 197-bp palindrome located in intron 40, four Alu elements located in introns 1, 2, 3, and 50 were also identified as intragenic-rearrangement hotspots within NF1.  相似文献   

10.
ABSTRACT: BACKGROUND: Copy number variants (CNVs) account for substantial variation between genomes and are a major source of normal and pathogenic phenotypic differences. The dog is an ideal model to investigate mutational mechanisms that generate CNVs as its genome lacks a functional ortholog of the PRDM9 gene implicated in recombination and CNV formation in humans. Here we comprehensively assay CNVs using high-density array comparative genomic hybridization in 50 dogs from 17 dog breeds and 3 gray wolves. RESULTS: We use a stringent new method to identify a total of 430 high-confidence CNV loci, that range in size from 9 kb to 1.6 Mb and span 26.4 Mb, or 1.08%, of the assayed dog genome, overlapping 413 annotated genes. 98% of CNVs observed in each breed are also observed in multiple breeds. CNVs predicted to disrupt gene function are significantly less common than expected by chance. We identify a significant overrepresentation of peaks of GC content, previously shown to be enriched in dog recombination hotspots, in the vicinity of CNV breakpoints. CONCLUSIONS: A number of the CNVs identified by this study are candidates for generating breed-specific phenotypes. Purifying selection seems to be a major factor shaping structural variation in the dog genome, suggesting that many CNVs are deleterious. Localized peaks of GC content appear to be novel sites of CNV formation in the dog genome by non-allelic homologous recombination, potentially activated by the loss of PRDM9. These sequence features may have driven genome instability and chromosomal rearrangements throughout canid evolution.  相似文献   

11.

Background  

Algorithms and software for CNV detection have been developed, but they detect the CNV regions sample-by-sample with individual-specific breakpoints, while common CNV regions are likely to occur at the same genomic locations across different individuals in a homogenous population. Current algorithms to detect common CNV regions do not account for the varying reliability of the individual CNVs, typically reported as confidence scores by SNP-based CNV detection algorithms. General methodologies for identifying these recurrent regions, especially those directed at SNP arrays, are still needed.  相似文献   

12.
Copy number variants (CNVs) are widely distributed throughout the human genome, where they contribute to genetic variation and phenotypic diversity. De novo CNVs are also a major cause of numerous genetic and developmental disorders. However, unlike many other types of mutations, little is known about the genetic and environmental risk factors for new and deleterious CNVs. DNA replication errors have been implicated in the generation of a major class of CNVs, the nonrecurrent CNVs. We have found that agents that perturb normal replication and create conditions of replication stress, including hydroxyurea and aphidicolin, are potent inducers of nonrecurrent CNVs in cultured human cells. These findings have broad implications for identifying CNV risk factors and for hydroxyurea-related therapies in humans.  相似文献   

13.
Nonallelic homologous recombination (NAHR), occurring between low-copy repeats (LCRs) >10 kb in size and sharing >97% DNA sequence identity, is responsible for the majority of recurrent genomic rearrangements in the human genome. Recent studies have shown that transposable elements (TEs) can also mediate recurrent deletions and translocations, indicating the features of substrates that mediate NAHR may be significantly less stringent than previously believed. Using >4 kb length and >95% sequence identity criteria, we analyzed of the genome-wide distribution of long interspersed element (LINE) retrotransposon and their potential to mediate NAHR. We identified 17 005 directly oriented LINE pairs located <10 Mbp from each other as potential NAHR substrates, placing 82.8% of the human genome at risk of LINE–LINE-mediated instability. Cross-referencing these regions with CNVs in the Baylor College of Medicine clinical chromosomal microarray database of 36 285 patients, we identified 516 CNVs potentially mediated by LINEs. Using long-range PCR of five different genomic regions in a total of 44 patients, we confirmed that the CNV breakpoints in each patient map within the LINE elements. To additionally assess the scale of LINE–LINE/NAHR phenomenon in the human genome, we tested DNA samples from six healthy individuals on a custom aCGH microarray targeting LINE elements predicted to mediate CNVs and identified 25 LINE–LINE rearrangements. Our data indicate that LINE–LINE-mediated NAHR is widespread and under-recognized, and is an important mechanism of structural rearrangement contributing to human genomic variability.  相似文献   

14.
15.
Although copy number variation (CNV) has recently received much attention as a form of structure variation within the human genome, knowledge is still inadequate on fundamental CNV characteristics such as occurrence rate, genomic distribution and ethnic differentiation. In the present study, we used the Affymetrix GeneChip® Mapping 500K Array to discover and characterize CNVs in the human genome and to study ethnic differences of CNVs between Caucasians and Asians. Three thousand and nineteen CNVs, including 2381 CNVs in autosomes and 638 CNVs in X chromosome, from 985 Caucasian and 692 Asian individuals were identified, with a mean length of 296 kb. Among these CNVs, 190 had frequencies greater than 1% in at least one ethnic group, and 109 showed significant ethnic differences in frequencies (p<0.01). After merging overlapping CNVs, 1135 copy number variation regions (CNVRs), covering approximately 439 Mb (14.3%) of the human genome, were obtained. Our findings of ethnic differentiation of CNVs, along with the newly constructed CNV genomic map, extend our knowledge on the structural variation in the human genome and may furnish a basis for understanding the genomic differentiation of complex traits across ethnic groups.  相似文献   

16.
Copy number variants (CNVs) are genomic rearrangements resulting from gains or losses of DNA segments. Typically, the term refers to rearrangements of sequences larger than 1 kb. This type of polymorphism has recently been shown to be a key contributor to intra-species genetic variation, along with single-nucleotide polymorphisms and short insertion-deletion polymorphisms. Over the last decade, a growing number of studies have highlighted the importance of copy number variation (CNV) as a factor affecting human phenotype and individual CNVs have been linked to risks for severe diseases. In plants, the exploration of the extent and role of CNV is still just beginning. Initial genomic analyses indicate that CNVs are prevalent in plants and have greatly affected plant genome evolution. Many CNV events have been observed in outcrossing and autogamous species. CNVs are usually found on all chromosomes, with CNV hotspots interspersed with regions of very low genetic variation. Although CNV is mainly associated with intergenic regions, many CNVs encompass protein-coding genes. The collected data suggest that CNV mainly affects the members of large families of functionally redundant genes. Thus, the effects of individual CNV events on phenotype are usually modest. Nevertheless, there are many cases in which CNVs for specific genes have been linked to important traits such as flowering time, plant height and resistance to biotic and abiotic stress. Recent reports suggest that CNVs may form rapidly in response to stress.  相似文献   

17.
We describe genomic structures of 59 X-chromosome segmental duplications that include the proteolipid protein 1 gene (PLP1) in patients with Pelizaeus-Merzbacher disease. We provide the first report of 13 junction sequences, which gives insight into underlying mechanisms. Although proximal breakpoints were highly variable, distal breakpoints tended to cluster around low-copy repeats (LCRs) (50% of distal breakpoints), and each duplication event appeared to be unique (100 kb to 4.6 Mb in size). Sequence analysis of the junctions revealed no large homologous regions between proximal and distal breakpoints. Most junctions had microhomology of 1-6 bases, and one had a 2-base insertion. Boundaries between single-copy and duplicated DNA were identical to the reference genomic sequence in all patients investigated. Taken together, these data suggest that the tandem duplications are formed by a coupled homologous and nonhomologous recombination mechanism. We suggest repair of a double-stranded break (DSB) by one-sided homologous strand invasion of a sister chromatid, followed by DNA synthesis and nonhomologous end joining with the other end of the break. This is in contrast to other genomic disorders that have recurrent rearrangements formed by nonallelic homologous recombination between LCRs. Interspersed repetitive elements (Alu elements, long interspersed nuclear elements, and long terminal repeats) were found at 18 of the 26 breakpoint sequences studied. No specific motif that may predispose to DSBs was revealed, but single or alternating tracts of purines and pyrimidines that may cause secondary structures were common. Analysis of the 2-Mb region susceptible to duplications identified proximal-specific repeats and distal LCRs in addition to the previously reported ones, suggesting that the unique genomic architecture may have a role in nonrecurrent rearrangements by promoting instability.  相似文献   

18.
Germline copy number variation (CNV) is considered to be an important form of human genetic polymorphisms. Previous studies have identified amounts of CNVs in human genome by advanced technologies, such as comparative genomic hybridization, single nucleotide genotyping, and high-throughput sequencing. CNV is speculated to be derived from multiple mechanisms, such as nonallelic homologous recombination (NAHR) and nonhomologous end-joining (NHEJ). CNVs cover a much larger genome scale than single nucleotide polymorphisms (SNPs), and may alter gene expression levels by means of gene dosage, gene fusion, gene disruption, and long-range regulation effects, thus affecting individual phenotypes and playing crucial roles in human pathogenesis. The number of studies linking CNVs with common complex diseases has increased dramatically in recent years. Here, we provide a comprehensive review of the current understanding of germline CNVs, and summarize the association of germline CNVs with the susceptibility to a wide variety of human diseases that were identified in recent years. We also propose potential issues that should be addressed in future studies.  相似文献   

19.
DNA replication errors are a major driver of evolution—from single nucleotide polymorphisms to large-scale copy number variations (CNVs). Here we test a specific replication-based model to explain the generation of interstitial, inverted triplications. While no genetic information is lost, the novel inversion junctions and increased copy number of the included sequences create the potential for adaptive phenotypes. The model—Origin-Dependent Inverted-Repeat Amplification (ODIRA)—proposes that a replication error at pre-existing short, interrupted, inverted repeats in genomic sequences generates an extrachromosomal, inverted dimeric, autonomously replicating intermediate; subsequent genomic integration of the dimer yields this class of CNV without loss of distal chromosomal sequences. We used a combination of in vitro and in vivo approaches to test the feasibility of the proposed replication error and its downstream consequences on chromosome structure in the yeast Saccharomyces cerevisiae. We show that the proposed replication error—the ligation of leading and lagging nascent strands to create “closed” forks—can occur in vitro at short, interrupted inverted repeats. The removal of molecules with two closed forks results in a hairpin-capped linear duplex that we show replicates in vivo to create an inverted, dimeric plasmid that subsequently integrates into the genome by homologous recombination, creating an inverted triplication. While other models have been proposed to explain inverted triplications and their derivatives, our model can also explain the generation of human, de novo, inverted amplicons that have a 2:1 mixture of sequences from both homologues of a single parent—a feature readily explained by a plasmid intermediate that arises from one homologue and integrates into the other homologue prior to meiosis. Our tests of key features of ODIRA lend support to this mechanism and suggest further avenues of enquiry to unravel the origins of interstitial, inverted CNVs pivotal in human health and evolution.  相似文献   

20.

Background

There is growing evidence for the prevalence of copy number variation (CNV) and its role in phenotypic variation in many eukaryotic species. Here we use array comparative genomic hybridization to explore the extent of this type of structural variation in domesticated barley cultivars and wild barleys.

Results

A collection of 14 barley genotypes including eight cultivars and six wild barleys were used for comparative genomic hybridization. CNV affects 14.9% of all the sequences that were assessed. Higher levels of CNV diversity are present in the wild accessions relative to cultivated barley. CNVs are enriched near the ends of all chromosomes except 4H, which exhibits the lowest frequency of CNVs. CNV affects 9.5% of the coding sequences represented on the array and the genes affected by CNV are enriched for sequences annotated as disease-resistance proteins and protein kinases. Sequence-based comparisons of CNV between cultivars Barke and Morex provided evidence that DNA repair mechanisms of double-strand breaks via single-stranded annealing and synthesis-dependent strand annealing play an important role in the origin of CNV in barley.

Conclusions

We present the first catalog of CNVs in a diploid Triticeae species, which opens the door for future genome diversity research in a tribe that comprises the economically important cereal species wheat, barley, and rye. Our findings constitute a valuable resource for the identification of CNV affecting genes of agronomic importance. We also identify potential mechanisms that can generate variation in copy number in plant genomes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号