首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 312 毫秒
1.

Background

Copy number variations (CNVs) confer significant effects on genetic innovation and phenotypic variation. Previous CNV studies in swine seldom focused on in-depth characterization of global CNVs.

Results

Using whole-genome assembly comparison (WGAC) and whole-genome shotgun sequence detection (WSSD) approaches by next generation sequencing (NGS), we probed formation signatures of both segmental duplications (SDs) and individualized CNVs in an integrated fashion, building the finest resolution CNV and SD maps of pigs so far. We obtained copy number estimates of all protein-coding genes with copy number variation carried by individuals, and further confirmed two genes with high copy numbers in Meishan pigs through an enlarged population. We determined genome-wide CNV hotspots, which were significantly enriched in SD regions, suggesting evolution of CNV hotspots may be affected by ancestral SDs. Through systematically enrichment analyses based on simulations and bioinformatics analyses, we revealed CNV-related genes undergo a different selective constraint from those CNV-unrelated regions, and CNVs may be associated with or affect pig health and production performance under recent selection.

Conclusions

Our studies lay out one way for characterization of CNVs in the pig genome, provide insight into the pig genome variation and prompt CNV mechanisms studies when using pigs as biomedical models for human diseases.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-593) contains supplementary material, which is available to authorized users.  相似文献   

2.
The primary objective of this study was to create a genome-wide high resolution map (i.e., >100 bp) of 'rearrangement hotspots' which can facilitate the identification of regions capable of mediating de novo deletions or duplications in humans. A hierarchical method was employed to fragment segmental duplications (SDs) into multiple smaller SD units. Combining an end space free pairwise alignment algorithm with a 'seed and extend' approach, we have exhaustively searched 409 million alignments to detect complex structural rearrangements within the reference-guided assembly of the NA18507 human genome (18× coverage), including the previously identified novel 4.8 Mb sequence from de novo assembly within this genome. We have identified 1,963 rearrangement hotspots within SDs which encompass 166 genes and display an enrichment of duplicated gene nucleotide variants (DNVs). These regions are correlated with increased non-allelic homologous recombination (NAHR) event frequency which presumably represents the origin of copy number variations (CNVs) and pathogenic duplications/deletions. Analysis revealed that 20% of the detected hotspots are clustered within the proximal and distal SD breakpoints flanked by the pathogenic deletions/duplications that have been mapped for 24 NAHR-mediated genomic disorders. FISH Validation of selected complex regions revealed 94% concordance with in silico localization of the highly homologous derivatives. Other results from this study indicate that intra-chromosomal recombination is enhanced in genic compared with agenic duplicated regions, and that gene desert regions comprising SDs may represent reservoirs for creation of novel genes. The generation of genome-wide signatures of 'rearrangement hotspots', which likely serve as templates for NAHR, may provide a powerful approach towards understanding the underlying mutational mechanism(s) for development of constitutional and acquired diseases.  相似文献   

3.
Chiang CW  Derti A  Schwartz D  Chou MF  Hirschhorn JN  Wu CT 《Genetics》2008,180(4):2277-2293
Ultraconserved elements (UCEs) are sequences that are identical between reference genomes of distantly related species. As they are under negative selection and enriched near or in specific classes of genes, one explanation for their ultraconservation may be their involvement in important functions. Indeed, many UCEs can drive tissue-specific gene expression. We have demonstrated that nonexonic UCEs are depleted among segmental duplications (SDs) and copy number variants (CNVs) and proposed that their ultraconservation may reflect a mechanism of copy counting via comparison. Here, we report that nonexonic UCEs are also depleted among 10 of 11 recent genomewide data sets of human CNVs, including 3 obtained with strategies permitting greater precision in determining the extents of CNVs. We further present observations suggesting that nonexonic UCEs per se may contribute to this depletion and that their apparent dosage sensitivity was in effect when they became fixed in the last common ancestor of mammals, birds, and reptiles, consistent with dosage sensitivity contributing to ultraconservation. Finally, in searching for the mechanism(s) underlying the function of nonexonic UCEs, we have found that they are enriched in TAATTA, which is also the recognition sequence for the homeodomain DNA-binding module, and bounded by a change in A + T frequency.  相似文献   

4.
Copy number variations (CNVs) are gains and losses of genomic sequence greater than 50?bp between two individuals of a species. While single nucleotide polymorphisms (SNPs) are more frequent, CNVs impact a higher percentage of genomic sequence and have potentially greater effects, including the changing of gene structure and dosage, altering gene regulation and exposing recessive alleles. In particular, segmental duplications (SDs) were shown to be one of the catalysts and hotspots for CNV formation. Substantial progress has been made in understanding CNVs in mammals, especially in humans and rodents. CNVs have been shown to be important in both normal phenotypic variability and disease susceptibility. Recently, interest in CNV study has extended into domesticated animals, including cattle. Multiple genome-wide cattle CNV studies have been carried out using both microarray and next generation sequencing technologies. Integration of SD and CNV results with SNP and other datasets are beginning to reveal impacts of CNVs on cattle domestication, health, and production traits.  相似文献   

5.
Major depressive disorder (MDD) affects approximately 15 million Americans. Approximately 2 million of these are classified as being refractory to treatment (TR‐MDD). Because of the lack of available therapies for TR‐MDD, and the high risk of suicide, there is interest in identifying new treatment modalities and diagnostic methods. Understanding of the impact of genomic copy number variation in the etiology of a variety of neuropsychiatric phenotypes is increasing. Low copy repeat elements at 15q13.3 facilitate non‐allelic homologous recombination, resulting in recurrent copy number variants (CNVs). Numerous reports have described association between microdeletions in this region and a variety of neuropsychiatric phenotypes, with CHRNA7 implicated as a candidate gene. However, the pathogenicity of 15q13.3 duplications is less clear. As part of an ongoing study, in which we have identified a number of metabolomic anomalies in spinal fluid from TR‐MDD patients, we also evaluated genomic copy number variation in patients (n = 125) and controls (n = 26) via array‐based copy number genomic hybridization (CGH); the case frequency was compared with frequencies reported in a prior study as well as a larger population‐sized cohort. We identified five TR‐MDD patients with microduplications involving CHRNA7. CHRNA7 duplications are the most common CNVs identified by clinical CGH in this cohort. Therefore, this study provides insight into the potential involvement of CHRNA7 duplications in the etiology of TR‐MDD and informs those involved with care of affected individuals.  相似文献   

6.
Copy number variations (CNVs) are large insertions, deletions or duplications in the genome that vary between members of a species and are known to affect a wide variety of phenotypic traits. In this study, we identified CNVs in a population of bulls using low coverage next‐generation sequence data. First, in order to determine a suitable strategy for CNV detection in our data, we compared the performance of three distinct CNV detection algorithms on benchmark CNV datasets and concluded that using the multiple sample read depth approach was the best method for identifying CNVs in our sequences. Using this technique, we identified a total of 1341 copy number variable regions (CNVRs) from genome sequences of 154 purebred sires used in Cycle VII of the USMARC Germplasm Evaluation Project. These bulls represented the seven most popular beef breeds in the United States: Hereford, Charolais, Angus, Red Angus, Simmental, Gelbvieh and Limousin. The CNVRs covered 6.7% of the bovine genome and spanned 2465 protein‐coding genes and many known quantitative trait loci (QTL). Genes harbored in the CNVRs were further analyzed to determine their function as well as to find any breed‐specific differences that may shed light on breed differences in adaptation, health and production.  相似文献   

7.
The detection of copy number variants (CNV) by array-based platforms provides valuable insight into understanding human diversity. However, suboptimal study design and data processing negatively affect CNV assessment. We quantitatively evaluate their impact when short-sequence oligonucleotide arrays are applied (Affymetrix Genome-Wide Human SNP Array 6.0) by evaluating 42 HapMap samples for CNV detection. Several processing and segmentation strategies are implemented, and results are compared to CNV assessment obtained using an oligonucleotide array CGH platform designed to query CNVs at high resolution (Agilent). We quantitatively demonstrate that different reference models (e.g. single versus pooled sample reference) used to detect CNVs are a major source of inter-platform discrepancy (up to 30%) and that CNVs residing within segmental duplication regions (higher reference copy number) are significantly harder to detect (P < 0.0001). After adjusting Affymetrix data to mimic the Agilent experimental design (reference sample effect), we applied several common segmentation approaches and evaluated differential sensitivity and specificity for CNV detection, ranging 39–77% and 86–100% for non-segmental duplication regions, respectively, and 18–55% and 39–77% for segmental duplications. Our results are relevant to any array-based CNV study and provide guidelines to optimize performance based on study-specific objectives.  相似文献   

8.
拷贝数变异的全基因组关联分析   总被引:3,自引:0,他引:3  
基因组拷贝数变异(copy number variations,CNVs)是指与基因组参考序列相比,基因组中≥1 kb的DNA片段插入、缺失和/或扩增,及其互相组合衍生出的复杂变异.由于其具有分布范围广、可遗传、相对稳定和高度异质性等特点,目前认为,CNVs是一种新的可以作为疾病易感标志的基因组DNA多态性,其变异引起的基因剂量改变可以导致表型改变.最近,一种基于CNVs的新的疾病易感基因鉴定策略——CNV全基因组关联分析开始出现,这一策略和传统的基于单核苷酸多态性的关联分析具有互补性,通过认识基因组结构变异可以认识复杂疾病的分子机制和遗传基础.  相似文献   

9.
High-throughput sequencing technologies have offered in recent years new opportunities to study genome variations. These studies have mostly focused on single nucleotide polymorphisms, small insertions or deletions and on copy number variants. Other structural variants, such as large insertions or deletions, tandem duplications, translocations, and inversions are less well-studied, despite that some have an important impact on phenotypes. In the present study, we performed a large-scale survey of structural variants in cattle. We report the identification of 6,426 putative structural variants in cattle extracted from whole-genome sequence data of 62 bulls representing the three major French dairy breeds. These genomic variants affect DNA segments greater than 50 base pairs and correspond to deletions, inversions and tandem duplications. Out of these, we identified a total of 547 deletions and 410 tandem duplications which could potentially code for CNVs. Experimental validation was carried out on 331 structural variants using a novel high-throughput genotyping method. Out of these, 255 structural variants (77%) generated good quality genotypes and 191 (75%) of them were validated. Gene content analyses in structural variant regions revealed 941 large deletions removing completely one or several genes, including 10 single-copy genes. In addition, some of the structural variants are located within quantitative trait loci for dairy traits. This study is a pan-genome assessment of genomic variations in cattle and may provide a new glimpse into the bovine genome architecture. Our results may also help to study the effects of structural variants on gene expression and consequently their effect on certain phenotypes of interest.  相似文献   

10.
拷贝数变异: 基因组多样性的新形式   总被引:1,自引:0,他引:1  
吴志俊  金玮 《遗传》2009,31(4):339-347
基因拷贝数变异是指DNA片段大小范围从kb到Mb的亚微观突变, 是一可能具有致病性、良性或未知临床意义的基因组改变。Fosmid末端配对序列比较策略、比较基因组杂交芯片是当前较多使用的检测手段。染色体非等位的同源重排、非同源突变和非b DNA结构是造成基因组拷贝数变异的重要原因。拷贝数变异可导致不同程度的基因表达差异, 对正常表型的构成及疾病的发生发展具有一定作用。文章在总结基因拷贝数变异的认识过程和研究策略的基础上, 分析了拷贝数变异的形成和作用机制, 介绍了第一代人类基因组拷贝数变异图谱, 阐述了拷贝数变异研究的临床意义, 提示在探索疾病相关的遗传变异时不能错失拷贝数变异这一基因组多样性的新形式。  相似文献   

11.
12.

Background

The detailed study of breakpoints associated with copy number variants (CNVs) can elucidate the mutational mechanisms that generate them and the comparison of breakpoints across species can highlight differences in genomic architecture that may lead to lineage-specific differences in patterns of CNVs. Here, we provide a detailed analysis of Drosophila CNV breakpoints and contrast it with similar analyses recently carried out for the human genome.

Results

By applying split-read methods to a total of 10x coverage of 454 shotgun sequence across nine lines of D. melanogaster and by re-examining a previously published dataset of CNVs detected using tiling arrays, we identified the precise breakpoints of more than 600 insertions, deletions, and duplications. Contrasting these CNVs with those found in humans showed that in both taxa CNV breakpoints fall into three classes: blunt breakpoints; simple breakpoints associated with microhomology; and breakpoints with additional nucleotides inserted/deleted and no microhomology. In both taxa CNV breakpoints are enriched with non-B DNA sequence structures, which may impair DNA replication and/or repair. However, in contrast to human genomes, non-allelic homologous-recombination (NAHR) plays a negligible role in CNV formation in Drosophila. In flies, non-homologous repair mechanisms are responsible for simple, recurrent, and complex CNVs, including insertions of de novo sequence as large as 60 bp.

Conclusions

Humans and Drosophila differ considerably in the importance of homology-based mechanisms for the formation of CNVs, likely as a consequence of the differences in the abundance and distribution of both segmental duplications and transposable elements between the two genomes.  相似文献   

13.
Submicroscopic (less than 2 Mb) segmental DNA copy number changes are a recently recognized source of genetic variability between individuals. The biological consequences of copy number variants (CNVs) are largely undefined. In some cases, CNVs that cause gene dosage effects have been implicated in phenotypic variation. CNVs have been detected in diverse species, including mice and humans. Published studies in mice have been limited by resolution and strain selection. We chose to study 21 well-characterized inbred mouse strains that are the focus of an international effort to measure, catalog, and disseminate phenotype data. We performed comparative genomic hybridization using long oligomer arrays to characterize CNVs in these strains. This technique increased the resolution of CNV detection by more than an order of magnitude over previous methodologies. The CNVs range in size from 21 to 2,002 kb. Clustering strains by CNV profile recapitulates aspects of the known ancestry of these strains. Most of the CNVs (77.5%) contain annotated genes, and many (47.5%) colocalize with previously mapped segmental duplications in the mouse genome. We demonstrate that this technique can identify copy number differences associated with known polymorphic traits. The phenotype of previously uncharacterized strains can be predicted based on their copy number at these loci. Annotation of CNVs in the mouse genome combined with sequence-based analysis provides an important resource that will help define the genetic basis of complex traits.  相似文献   

14.
Shortened foetal femur length (FL) is a common abnormal phenotype that often causes anxiety in pregnant women, and standard clinical treatments remain unavailable. We investigated the clinical characteristics, genetic aetiology and obstetric pregnancy outcomes of foetuses with short FL and provided a reference for perinatal management of such cases. Chromosomal microarray analysis was used to analyse the copy number variations (CNV) in short FL foetuses. Of the 218 foetuses with short FL, 33 foetuses exhibited abnormal CNVs, including 19 with pathogenic CNVs and 14 with variations of uncertain clinical significance. Of the 19 foetuses with pathogenic CNVs, four had aneuploidy, 14 had deletions/duplications, and one had pathogenic uniparental diploidy. The 7q11.23 microdeletion was detected in three foetuses. The severity of short FL was not associated with the rate of pathogenic CNVs. The duration of short FL for the intrauterine ultrasound phenotype in foetuses carrying a pathogenic CNV was independent of the gestational age. Further, maternal age was not associated with the incidence of foetal pathogenic CNVs. Adverse pregnancy outcomes occurred in 77 cases, including termination of pregnancy in 63 cases, postnatal dwarfed foetuses with intellectual disability in 11 cases, and three deaths within 3 months of birth. Pathogenic CNVs closely related to foetal short FL were identified, among which the 7q11.23 microdeletion was highly associated with short FL development. This study provides a reference for the perinatal management of foetuses with short FL.  相似文献   

15.
Summary High‐density single‐nucleotide polymorphism (SNP) microarrays provide a useful tool for the detection of copy number variants (CNVs). The analysis of such large amounts of data is complicated, especially with regard to determining where copy numbers change and their corresponding values. In this article, we propose a Bayesian multiple change‐point model (BMCP) for segmentation and estimation of SNP microarray data. Segmentation concerns separating a chromosome into regions of equal copy number differences between the sample of interest and some reference, and involves the detection of locations of copy number difference changes. Estimation concerns determining true copy number for each segment. Our approach not only gives posterior estimates for the parameters of interest, namely locations for copy number difference changes and true copy number estimates, but also useful confidence measures. In addition, our algorithm can segment multiple samples simultaneously, and infer both common and rare CNVs across individuals. Finally, for studies of CNVs in tumors, we incorporate an adjustment factor for signal attenuation due to tumor heterogeneity or normal contamination that can improve copy number estimates.  相似文献   

16.
17.
Genomic rearrangements involving the peripheral myelin protein gene (PMP22) in human chromosome 17p12 are associated with neuropathy: duplications cause Charcot-Marie-Tooth disease type 1A (CMT1A), whereas deletions lead to hereditary neuropathy with liability to pressure palsies (HNPP). Our previous studies showed that >99% of these rearrangements are recurrent and mediated by nonallelic homologous recombination (NAHR). Rare copy number variations (CNVs) generated by nonrecurrent rearrangements also exist in 17p12, but their underlying mechanisms are not well understood. We investigated 21 subjects with rare CNVs associated with CMT1A or HNPP by oligonucleotide-based comparative genomic hybridization microarrays and breakpoint sequence analyses, and we identified 17 unique CNVs, including two genomic deletions, ten genomic duplications, two complex rearrangements, and three small exonic deletions. Each of these CNVs includes either the entire PMP22 gene, or exon(s) only, or ultraconserved potential regulatory sequences upstream of PMP22, further supporting the contention that PMP22 is the critical gene mediating the neuropathy phenotypes associated with 17p12 rearrangements. Breakpoint sequence analysis reveals that, different from the predominant NAHR mechanism in recurrent rearrangement, various molecular mechanisms, including nonhomologous end joining, Alu-Alu-mediated recombination, and replication-based mechanisms (e.g., FoSTeS and/or MMBIR), can generate nonrecurrent 17p12 rearrangements associated with neuropathy. We document a multitude of ways in which gene function can be altered by CNVs. Given the characteristics, including small size, structural complexity, and location outside of coding regions, of selected rare CNVs, their identification remains a challenge for genome analysis. Rare CNVs may potentially represent an important portion of “missing heritability” for human diseases.  相似文献   

18.
Background: We sought to characterize the landscape of structural variation associated with the subset of congenital cardiac defects characterized by left‐sided obstruction. Methods: Cases with left‐sided cardiac defects (LSCD) and pediatric controls were uniformly genotyped and assessed for copy number variant (CNV) calls. Significance testing was performed to ascertain differences in overall CNV incidence, and for CNV enrichment of specific genes and gene functions in LSCD cases relative to controls. Results: A total of 257 cases of European descent and 962 ethnically matched, disease‐free pediatric controls were included. Although there was no difference in CNV rate between cases and controls, a significant enrichment in rare LSCD CNVs was detected overall (p = 7.30 × 10?3, case/control ratio = 1.26) and when restricted either to deletions (p = 7.58 × 10?3, case/control ratio = 1.20) or duplications (3.02 × 10?3, case/control ratio = 1.43). Neither gene‐based, functional nor knowledge‐based analyses identified genes, loci or pathways that were significantly enriched in cases as compared to controls when appropriate corrections for multiple tests were applied. However, several genes of interest were identified by virtue of their association with cardiac development, known human conditions, or reported disruption by CNVs in other patient cohorts. Conclusion: This study examines the largest cohort to date with LSCD for structural variation. These data suggest that CNVs play a role in disease risk and identify numerous genes disrupted by CNVs of potential disease relevance. These findings further highlight the genetic heterogeneity and complexity of these disorders. Birth Defects Research (Part A) 100:951–964, 2014. © 2014 Wiley Periodicals, Inc.  相似文献   

19.
Copy number variation refers to regions along chromosomes that harbor a type of structural variation, such as duplications or deletions. Copy number variants (CNVs) play a role in many important traits as well as in genetic diversity. Previous analyses of chickens using array comparative genomic hybridizations or single‐nucleotide polymorphism chip assays have been performed on various breeds and genetic lines to discover CNVs. In this study, we assessed individuals from two highly inbred (inbreeding coefficiency > 99.99%) lines, Leghorn G‐B2 and Fayoumi M15.2, to discover novel CNVs in chickens. These lines have been previously studied for disease resistance, and to our knowledge, this represents the first global assessment of CNVs in the Fayoumi breed. Genomic DNA from individuals was examined using the Agilent chicken 244 K comparative genomic hybridization array and quantitative PCR. We identified a total of 273 CNVs overall, with 112 CNVs being novel and not previously reported. Quantitative PCR using the standard curve method validated a subset of our array data. Through enrichment analysis of genes within CNV regions, we observed multiple chromosomes, terms and pathways that were significantly enriched, largely dealing with the major histocompatibility complex and immune responsiveness. Using an additional round of computational and statistical analysis with a different bioinformatic pipeline, we identified 43 CNVs among these as high‐confidence regions, 14 of which were found to be novel. We further compared and contrasted individuals of the two inbred lines to discover regions that have a significant difference in copy number between lines. A total of 40 regions had significant deletions or duplications between the lines. Gene Ontology analysis of genomic regions containing CNVs between lines also was performed. This between‐line candidate CNV list will be useful in studies with these two unique genetic lines, which may harbor variations that underlie quantitative trait loci for disease resistance and other important traits. Through the global discovery of novel CNVs in chicken, these data also provide resources for further genetic and functional genomics studies.  相似文献   

20.
Summary Analysis of mitochondrial DNAs (mtDNAs) from parthenogenetic lizards of theHeteronotia binoei complex with restriction enzymes revealed an 5-kb addition present in all 77 individuals. Cleavage site mapping suggested the presence of a direct tandem duplication spanning the 16S and 12S rRNA genes, the control region and most, if not all, of the gene for the subunit 1 of NADH dehydrogenase (ND1). The location of the duplication was confirmed by Southern hybridization. A restriction enzyme survey provided evidence for modifications to each copy of the duplicated sequence, including four large deletions. Each gene affected by a deletion was complemented by an intact version in the other copy of the sequence, although for one gene the functional copy was heteroplasmic for another deletion. Sequencing of a fragment from one copy of the duplication which encompassed the tRNAleu(UUR) and parts of the 16S rRNA and ND1 genes, revealed mutations expected to disrupt function. Thus, evolution subsequent to the duplication event has resulted in mitochondrial pseudogenes. The presence of duplications in all of these parthenogens, but not among representatives of their maternal sexual ancestors, suggests that the duplications arose in the parthenogenetic form. This provides the second instance inH. binoei of mtDNA duplication associated with the transition from sexual to parthenogenetic reproduction. The increased incidence of duplications in parthenogenetic lizards may be caused by errors in mtDNA replication due to either polyploidy or hybridity of their nuclear genomes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号