首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Although copy number variation (CNV) has recently received much attention as a form of structure variation within the human genome, knowledge is still inadequate on fundamental CNV characteristics such as occurrence rate, genomic distribution and ethnic differentiation. In the present study, we used the Affymetrix GeneChip® Mapping 500K Array to discover and characterize CNVs in the human genome and to study ethnic differences of CNVs between Caucasians and Asians. Three thousand and nineteen CNVs, including 2381 CNVs in autosomes and 638 CNVs in X chromosome, from 985 Caucasian and 692 Asian individuals were identified, with a mean length of 296 kb. Among these CNVs, 190 had frequencies greater than 1% in at least one ethnic group, and 109 showed significant ethnic differences in frequencies (p<0.01). After merging overlapping CNVs, 1135 copy number variation regions (CNVRs), covering approximately 439 Mb (14.3%) of the human genome, were obtained. Our findings of ethnic differentiation of CNVs, along with the newly constructed CNV genomic map, extend our knowledge on the structural variation in the human genome and may furnish a basis for understanding the genomic differentiation of complex traits across ethnic groups.  相似文献   

2.
Amazingly little sequence variation is reported for the kringle IV 2 copy number variation (KIV 2 CNV) in the human LPA gene. Apart from whole genome sequencing projects, this region has only been analyzed in some detail in samples of European populations. We have performed a systematic resequencing study of the exonic and flanking intron regions within the KIV 2 CNV in 90 alleles from Asian, European, and four different African populations. Alleles have been separated according to their CNV length by pulsed field gel electrophoresis prior to unbiased specific PCR amplification of the target regions. These amplicons covered all KIV 2 copies of an individual allele simultaneously. In addition, cloned amplicons from genomic DNA of an African individual were sequenced. Our data suggest that sequence variation in this genomic region may be higher than previously appreciated. Detection probability of variants appeared to depend on the KIV 2 copy number of the analyzed DNA and on the proportion of copies carrying the variant. Asians had a high frequency of so-called KIV 2 type B and type C (together 70% of alleles), which differ by three or two synonymous substitutions respectively from the reference type A. This is most likely explained by the strong bottleneck suggested to have occurred when modern humans migrated to East Asia. A higher frequency of variable sites was detected in the Africans. In particular, two previously unreported splice site variants were found. One was associated with non-detectable Lp(a). The other was observed at high population frequencies (10% to 40%). Like the KIV 2 type B and C variants, this latter variant was also found in a high proportion of KIV 2 repeats in the affected alleles and in alleles differing in copy numbers. Our findings may have implications for the interpretation of SNP analyses in other repetitive loci of the human genome.  相似文献   

3.
Copy number variations (CNVs) have provided a dynamic aspect to the apparently static human genome. We have analyzed CNVs larger than 100 kb in 477 healthy individuals from 26 diverse Indian populations of different linguistic, ethnic and geographic backgrounds. These CNVRs were identified using the Affymetrix 50K Xba 240 Array. We observed 1,425 and 1,337 CNVRs in the deletion and amplification sets, respectively, after pooling data from all the populations. More than 50% of the genes encompassed entirely in CNVs had both deletions and amplifications. There was wide variability across populations not only with respect to CNV extent (ranging from 0.04–1.14% of genome under deletion and 0.11–0.86% under amplification) but also in terms of functional enrichments of processes like keratinization, serine proteases and their inhibitors, cadherins, homeobox, olfactory receptors etc. These did not correlate with linguistic, ethnic, geographic backgrounds and size of populations. Certain processes were near exclusive to deletion (serine proteases, keratinization, olfactory receptors, GPCRs) or duplication (homeobox, serine protease inhibitors, embryonic limb morphogenesis) datasets. Populations having same enriched processes were observed to contain genes from different genomic loci. Comparison of polymorphic CNVRs (5% or more) with those cataloged in Database of Genomic Variants revealed that 78% (2473) of the genes in CNVRs in Indian populations are novel. Validation of CNVs using Sequenom MassARRAY revealed extensive heterogeneity in CNV boundaries. Exploration of CNV profiles in such diverse populations would provide a widely valuable resource for understanding diversity in phenotypes and disease.  相似文献   

4.
Wang Y  Gu X  Feng C  Song C  Hu X  Li N 《Animal genetics》2012,43(3):282-289
The discovery of copy number variation (CNV) in the genome has provided new insight into genomic polymorphism. Studies with chickens have identified a number of large CNV segments using a 385k comparative genomic hybridization (CGH) chip (mean length >140 kb). We present a detailed CNV map for local Chinese chicken breeds and commercial chicken lines using an Agilent 400k array CGH platform with custom-designed probes. We identified a total of 130 copy number variation regions (CNVRs; mean length = 25.70 kb). Of these, 104 (80.0%) were novel segments reported for the first time in chickens. Among the 104 novel CNVRs, 56 (53.8%) of the segments were non-coding sequences, 65 (62.5%) showed the gain of DNA and 40 (38.5%) showed the loss of DNA (one locus showed both loss and gain). Overlapping with the formal selective sweep data and the quantitative trait loci data, we identified four loci that might be considered to be high-confidence selective segments that arose during the domestication of chickens. Compared with the CNVRs reported previously, genes for the positive regulation of phospholipase A2 activity were discovered to be significantly over-represented in the novel CNVRs reported here by gene ontology analysis. Availability of our results should facilitate further research in the study of the genetic variability in chicken breeds.  相似文献   

5.
Copy number variants (CNVs) in the human genome contribute to both Mendelian and complex traits as well as to genomic plasticity in evolution. The investigation of mutational rates of CNVs is critical to understanding genomic instability and the etiology of the copy number variation (CNV)-related traits. However, the evaluation of the CNV mutation rate at the genome level poses an insurmountable practical challenge that requires large samples and accurate typing. In this study, we show that an approximate estimation of the CNV mutation rate could be achieved by using the phylogeny information of flanking SNPs. This allows a genome-wide comparison of mutation rates between CNVs with the use of vast, readily available data of SNP genotyping. A total of 4187 CNV regions (CNVRs) previously identified in HapMap populations were investigated in this study. We showed that the mutation rates for the majority of these CNVRs are at the order of 10−5 per generation, consistent with experimental observations at individual loci. Notably, the mutation rates of 104 (2.5%) CNVRs were estimated at the order of 10−3 per generation; therefore, they were identified as potential hotspots. Additional analyses revealed that genome architecture at CNV loci has a potential role in inciting mutational hotspots in the human genome. Interestingly, 49 (47%) CNV hotspots include human genes, some of which are known to be functional CNV loci (e.g., CNVs of C4 and β-defensin causing autoimmune diseases and CNVs of HYDIN with implication in control of cerebral cortex size), implicating the important role of CNV in human health and evolution, especially in common and complex diseases.  相似文献   

6.
Copy number variants (CNVs) are thought to play an important role in the predisposition to autism spectrum disorder (ASD). However, their relatively low frequency and widespread genomic distribution complicates their accurate characterization and utilization for clinical genetics purposes. Here we present a comprehensive analysis of multi-study, genome-wide CNV data from AutDB (http://mindspec.org/autdb.html), a genetic database that accommodates detailed annotations of published scientific reports of CNVs identified in ASD individuals. Overall, we evaluated 4,926 CNVs in 2,373 ASD subjects from 48 scientific reports, encompassing ∼2.12×109 bp of genomic data. Remarkable variation was seen in CNV size, with duplications being significantly larger than deletions, (P  =  3×10−105; Wilcoxon rank sum test). Examination of the CNV burden across the genome revealed 11 loci with a significant excess of CNVs among ASD subjects (P<7×10−7). Altogether, these loci covered 15,610 kb of the genome and contained 166 genes. Remarkable variation was seen both in locus size (20 - 4950 kb), and gene content, with seven multigenic (≥3 genes) and four monogenic loci. CNV data from control populations was used to further refine the boundaries of these ASD susceptibility loci. Interestingly, our analysis indicates that 15q11.2-13.3, a genomic region prone to chromosomal rearrangements of various sizes, contains three distinct ASD susceptibility CNV loci that vary in their genomic boundaries, CNV types, inheritance patterns, and overlap with CNVs from control populations. In summary, our analysis of AutDB CNV data provides valuable insights into the genomic characteristics of ASD susceptibility CNV loci and could therefore be utilized in various clinical settings and facilitate future genetic research of this disorder.  相似文献   

7.
We carried out a comprehensive genomic analysis of porcine copy number variants (CNVs) based on whole‐genome SNP genotyping data and provided new measures of genomic diversity (number, length and distribution of CNV events) for a highly inbred strain (the Guadyerbas strain). This strain represents one of the most ancient surviving populations of the Iberian breed, and it is currently in serious danger of extinction. CNV detection was conducted on the complete Guadyerbas population, adjusted for genomic waves, and used strict quality criteria, pedigree information and the latest porcine genome annotation. The analysis led to the detection of 65 CNV regions (CNVRs). These regions cover 0.33% of the autosomal genome of this particular strain. Twenty‐nine of these CNVRs were identified here for the first time. The relatively low number of detected CNVRs is in line with the low variability and high inbreeding estimated previously for this Iberian strain using pedigree, microsatellite or SNP data. A comparison across different porcine studies has revealed that more than half of these regions overlap with previously identified CNVRs or multicopy regions. Also, a preliminary analysis of CNV detection using whole‐genome sequence data for four Guadyerbas pigs showed overlapping for 16 of the CNVRs, supporting their reliability. Some of the identified CNVRs contain relevant functional genes (e.g., the SCD and USP15 genes), which are worth being further investigated because of their importance in determining the quality of Iberian pig products. The CNVR data generated could be useful for improving the porcine genome annotation.  相似文献   

8.
To examine the performance and information content of different marker systems, comparative assessment of population genetic diversity was undertaken in nine populations of Athyrium distentifolium using nine genomic and 10 expressed sequence tag (EST) microsatellite (SSR) loci, and 265 amplified fragment length polymorphism (AFLP) loci from two primer combinations. In range-wide comparisons (European vs. North American populations), the EST-SSR loci showed more reliable amplification and produced more easily scorable bands than genomic simple sequence repeats (SSRs). Genomic SSRs showed significantly higher levels of allelic diversity than EST-SSRs, but there was a significant correlation in the rank order of population diversities revealed by both marker types. When AFLPs, genomic SSRs, and EST-SSRs are considered, comparisons of different population diversity metrics/markers revealed a mixture of significant and nonsignificant rank-order correlations. However, no hard incongruence was detected (in no pairwise comparison of populations did different marker systems or metrics detect opposingly significant different amounts of variation). Comparable population pairwise estimates of F(ST) were obtained for all marker types, but whilst absolute values for genomic and EST-SSRs were very similar (F(ST) = 0.355 and 0.342, respectively), differentiation was consistently higher for AFLPs in pairwise and global comparisons (global AFLP F(ST) = 0.496). The two AFLP primer combinations outperformed 18 SSR loci in assignment tests and discriminatory power in phenetic cluster analyses. The results from marker comparisons on A. distentifolium are discussed in the context of the few other studies on natural plant populations comparing microsatellite and AFLP variability.  相似文献   

9.
We attempt to address the issue of genetic variation and the pattern of male gene flow among and between five Indian population groups of two different geographic and linguistic affiliations using Y-chromosome markers. We studied 221 males at three Y-chromosome biallelic loci and 184 males for the five Y-chromosome STRs. We observed 111 Y-chromosome STR haplotypes. An analysis of molecular variance (AMOVA) based on Y-chromosome STRs showed that the variation observed between the population groups belonging to two major regions (western and southwestern India) was 0.17%, which was significantly lower than the level of genetic variance among the five populations (0.59%) considered as a single group. Combined haplotype analysis of the five STRs and the biallelic locus 92R7 revealed minimal sharing of haplotypes among these five ethnic groups, irrespective of the similar origin of the linguistic and geographic affiliations; this minimal sharing indicates restricted male gene flow. As a consequence, most of the haplotypes were population specific. Network analysis showed that the haplotypes, which were shared between the populations, seem to have originated from different mutational pathways at different loci. Biallelic markers showed that all five ethnic groups have a similar ancestral origin despite their geographic and linguistic diversity.  相似文献   

10.
Recent comparative genome hybridization studies revealed that hundreds to thousands of human genomic loci can have interindividual copy number variations (CNVs). One of such CNV loci in the HLA codes for the immune effector protein complement component C4. Sensitive, specific, and accurate assays to interrogate the C4 CNV and its associated polymorphisms by using submicrogram quantities of genomic DNA are needed for high throughput epidemiologic studies of C4 CNVs in autoimmune, infectious, and neurological diseases. Quantitative real-time PCR (qPCR) assays were developed using TaqMan chemistry and based on sequences specific for C4A and C4B genes, structural characteristics corresponding to the long and short forms of C4 genes, and the breakpoint region of RP-C4-CYP21-TNX (RCCX) modular duplication. Assignments for gene copy numbers were achieved by relative standard curve methods using cloned C4 genomic DNA covering 6 logs of DNA concentrations for calibrations. The accuracies of test results were cross-confirmed internally in each sample, as the sum of C4A plus C4B equals to the sum of C4L plus C4S or the total copy number of RCCX modules. These qPCR assays were applied to determine C4 CNVs from samples of 50 consanguineous subjects who were mostly homozygous in HLA genotypes. The results revealed eight HLA haplotypes with single C4 genes in monomodular RCCX that are associated with multiple autoimmune and infectious diseases and 32 bimodular, 4 trimodular, and one quadrimodular RCCX. These C4 qPCR assays are proven to be robust, sensitive, and reliable, as they have contributed to the elucidation of C4 CNVs in >1000 human samples with autoimmune and neurological diseases.  相似文献   

11.
Genomic copy number variation (CNV) is a recently identified form of global genetic variation in the human genome. The Affymetrix GeneChip 100 and 500 K SNP genotyping platforms were used to perform a large-scale population-based study of CNV frequency. We constructed a genomic map of 578 CNV regions, covering approximately 220 Mb (7.3%) of the human genome, identifying 183 previously unknown intervals. Copy number changes were observed to occur infrequently (<1%) in the majority (>93%) of these genomic regions, but encompass hundreds of genes and disease loci. This North American population-based map will be a useful resource for future genetic studies. Electronic supplementary material The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

12.
Using an enriched genomic library, we developed seven (CT)n/(GA)n microsatellite loci for eelgrass Zostera marina L. Enrichment is described and highly recommended for genomes in which microsatellites are rare, such as in many plants. A test for polymorphism was performed on individuals from three geographically separated populations (N = 15/population) and revealed considerable genetic variation. The number of alleles per locus varied between five and 11 and the observed heterozygosities for single loci ranged from 0.16 to 0.81 within populations. Mean allele lengths were markedly different among populations, indicating that the identified loci will be useful in studying population structure in Z. marina. As the frequency of the most abundant multilocus genotype within populations was always < 1%, these loci have sufficient resolving power to address clone size in predominantly vegetatively reproducing populations.  相似文献   

13.
Detailed analyses of the population-genetic nature of copy number variations (CNVs) and the linkage disequilibrium between CNV and single nucleotide polymorphism (SNP) loci from high-throughput experimental data require a computational tool to accurately infer alleles of CNVs and haplotypes composed of both CNV alleles and SNP alleles. Here we developed a new tool to infer population frequencies of such alleles and haplotypes from observed copy numbers and SNP genotypes, using the expectation-maximization algorithm. This tool can also handle copy numbers ambiguously determined, such as 2 or 3 copies, due to experimental noise. AVAILABILITY: http://emu.src.riken.jp/MOCSphaser/MOCSphaser.zip.  相似文献   

14.
Despite considerable excitement over the potential functional significance of copy-number variants (CNVs), we still lack knowledge of the fine-scale architecture of the large majority of CNV regions in the human genome. In this study, we used a high-resolution array-based comparative genomic hybridization (aCGH) platform that targeted known CNV regions of the human genome at approximately 1 kb resolution to interrogate the genomic DNAs of 30 individuals from four HapMap populations. Our results revealed that 1020 of 1153 CNV loci (88%) were actually smaller in size than what is recorded in the Database of Genomic Variants based on previously published studies. A reduction in size of more than 50% was observed for 876 CNV regions (76%). We conclude that the total genomic content of currently known common human CNVs is likely smaller than previously thought. In addition, approximately 8% of the CNV regions observed in multiple individuals exhibited genomic architectural complexity in the form of smaller CNVs within larger ones and CNVs with interindividual variation in breakpoints. Future association studies that aim to capture the potential influences of CNVs on disease phenotypes will need to consider how to best ascertain this previously uncharacterized complexity.  相似文献   

15.
《Genomics》2022,114(4):110430
Ribosomal DNA genes (rDNA) encode the major ribosomal RNAs and in eukaryotes typically form tandem repeat arrays. Species have characteristic rDNA copy numbers, but there is substantial intra-species variation in copy number that results from frequent rDNA recombination. Copy number differences can have phenotypic consequences, however difficulties in quantifying copy number mean we lack a comprehensive understanding of how copy number evolves and the consequences. Here we present a genomic sequence read approach to estimate rDNA copy number based on modal coverage to help overcome limitations with existing mean coverage-based approaches. We validated our method using Saccharomyces cerevisiae strains with known rDNA copy numbers. Application of our pipeline to a global sample of S. cerevisiae isolates showed that different populations have different rDNA copy numbers. Our results demonstrate the utility of the modal coverage method, and highlight the high level of rDNA copy number variation within and between populations.  相似文献   

16.
Copy number variation (CNV) is implicated in important traits in multiple crop plants, but can be challenging to genotype using conventional methods. The Rhg1 locus of soybean, which confers resistance to soybean cyst nematode (SCN), is a CNV of multiple 31.2‐kb genomic units each containing four genes. Reliable, high‐throughput methods to quantify Rhg1 and other CNVs for selective breeding were developed. The CNV genotyping assay described here uses a homeologous gene copy within the paleopolyploid soybean genome to provide the internal control for a single‐tube TaqMan copy number assay. Using this assay, CNV in breeding populations can be tracked with high precision. We also show that extensive CNV exists within Fayette, a released, inbred SCN‐resistant soybean cultivar with a high copy number at Rhg1 derived from a single donor parent. Copy number at Rhg1 is therefore unstable within a released variety over a relatively small number of generations. Using this assay to select for individuals with altered copy number, plants were obtained with both increased copy number and increased SCN resistance relative to control plants. Thus, CNV genotyping technologies can be used as a new type of marker‐assisted selection to select for desirable traits in breeding populations, and to control for undesirable variation within cultivars.  相似文献   

17.

Background

Copy number variations (CNVs) confer significant effects on genetic innovation and phenotypic variation. Previous CNV studies in swine seldom focused on in-depth characterization of global CNVs.

Results

Using whole-genome assembly comparison (WGAC) and whole-genome shotgun sequence detection (WSSD) approaches by next generation sequencing (NGS), we probed formation signatures of both segmental duplications (SDs) and individualized CNVs in an integrated fashion, building the finest resolution CNV and SD maps of pigs so far. We obtained copy number estimates of all protein-coding genes with copy number variation carried by individuals, and further confirmed two genes with high copy numbers in Meishan pigs through an enlarged population. We determined genome-wide CNV hotspots, which were significantly enriched in SD regions, suggesting evolution of CNV hotspots may be affected by ancestral SDs. Through systematically enrichment analyses based on simulations and bioinformatics analyses, we revealed CNV-related genes undergo a different selective constraint from those CNV-unrelated regions, and CNVs may be associated with or affect pig health and production performance under recent selection.

Conclusions

Our studies lay out one way for characterization of CNVs in the pig genome, provide insight into the pig genome variation and prompt CNV mechanisms studies when using pigs as biomedical models for human diseases.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-593) contains supplementary material, which is available to authorized users.  相似文献   

18.
Obesity is one of the most complex human diseases that are widely concerned and studied. More recently, copy number variations (CNVs) emerge as another important genetic marker to influence various human diseases. To elucidate the relationship between obesity and CNVs, this current study selected obesity-related candidate CNVs and analyzed their association with body mass index (BMI). Results showed that a CNV locus, 8q24.3, was significantly different (P = 0.0070) in CNV frequency between the obese and healthy controls in a young eastern Chinese cohort, while no statistical significance was observed in other seven candidate loci including well reported 10q11.22 and 16p11.2 loci. The association of 8q24.3 CNVs with BMI of the subjects only showed marginal significance, while the copy number (CN) of 5p15.33 had a significant correlation with the BMI of the subject. These results suggested that 8q24.3 CN gains was associated with obesity, and 5p15.33 might also contribute to obesity pathogenesis, highlighting the importance of these CNVs for obesity risks, as well as providing new evidence for CNVs in the pathology of common diseases.  相似文献   

19.
The majority of biological traits are genetically complex. Mapping the quantitative trait loci (QTL) that determine these phenotypes is a powerful means for estimating many parameters of the genetic architecture for a trait and potentially identifying the genes responsible for natural variation. Typically, such experiments are conducted in a single mapping population and, therefore, have only the potential to reveal genomic regions that are polymorphic between the progenitors of the population. What remains unclear is how well the QTL identified in any one mapping experiment characterize the genetics that underlie natural variation in traits. Here we provide QTL mapping data for trichome density from four recombinant inbred mapping populations of Arabidopsis thaliana. By aligning the linkage maps for these four populations onto a common physical map, the results from each experiment were directly compared. Seven of the nine QTL identified are population specific while two were mapped in all four populations. Our results show that many lineage-specific alleles that either increase or decrease trichome density persist in natural populations and that most of this genetic variation is additive. More generally, these findings suggest that the use of multiple populations holds great promise for better understanding the genetic architecture of natural variation.  相似文献   

20.
Array-based comparative genomic hybridization (aCGH) is a molecular cytogenetic technique used in detecting and mapping DNA copy number alterations. aCGH is able to interrogate the entire genome at a previously unattainable, high resolution and has directly led to the recent appreciation of a novel class of genomic variation: copy number variation (CNV) in mammalian genomes. All forms of DNA variation/polymorphism are important for studying the basis of phenotypic diversity among individuals. CNV research is still at its infancy, requiring careful collation and annotation of accumulating CNV data that will undoubtedly be useful for accurate interpretation of genomic imbalances identified during cancer research.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号