首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Submicroscopic (less than 2 Mb) segmental DNA copy number changes are a recently recognized source of genetic variability between individuals. The biological consequences of copy number variants (CNVs) are largely undefined. In some cases, CNVs that cause gene dosage effects have been implicated in phenotypic variation. CNVs have been detected in diverse species, including mice and humans. Published studies in mice have been limited by resolution and strain selection. We chose to study 21 well-characterized inbred mouse strains that are the focus of an international effort to measure, catalog, and disseminate phenotype data. We performed comparative genomic hybridization using long oligomer arrays to characterize CNVs in these strains. This technique increased the resolution of CNV detection by more than an order of magnitude over previous methodologies. The CNVs range in size from 21 to 2,002 kb. Clustering strains by CNV profile recapitulates aspects of the known ancestry of these strains. Most of the CNVs (77.5%) contain annotated genes, and many (47.5%) colocalize with previously mapped segmental duplications in the mouse genome. We demonstrate that this technique can identify copy number differences associated with known polymorphic traits. The phenotype of previously uncharacterized strains can be predicted based on their copy number at these loci. Annotation of CNVs in the mouse genome combined with sequence-based analysis provides an important resource that will help define the genetic basis of complex traits.  相似文献   

2.
Copy number variation refers to regions along chromosomes that harbor a type of structural variation, such as duplications or deletions. Copy number variants (CNVs) play a role in many important traits as well as in genetic diversity. Previous analyses of chickens using array comparative genomic hybridizations or single‐nucleotide polymorphism chip assays have been performed on various breeds and genetic lines to discover CNVs. In this study, we assessed individuals from two highly inbred (inbreeding coefficiency > 99.99%) lines, Leghorn G‐B2 and Fayoumi M15.2, to discover novel CNVs in chickens. These lines have been previously studied for disease resistance, and to our knowledge, this represents the first global assessment of CNVs in the Fayoumi breed. Genomic DNA from individuals was examined using the Agilent chicken 244 K comparative genomic hybridization array and quantitative PCR. We identified a total of 273 CNVs overall, with 112 CNVs being novel and not previously reported. Quantitative PCR using the standard curve method validated a subset of our array data. Through enrichment analysis of genes within CNV regions, we observed multiple chromosomes, terms and pathways that were significantly enriched, largely dealing with the major histocompatibility complex and immune responsiveness. Using an additional round of computational and statistical analysis with a different bioinformatic pipeline, we identified 43 CNVs among these as high‐confidence regions, 14 of which were found to be novel. We further compared and contrasted individuals of the two inbred lines to discover regions that have a significant difference in copy number between lines. A total of 40 regions had significant deletions or duplications between the lines. Gene Ontology analysis of genomic regions containing CNVs between lines also was performed. This between‐line candidate CNV list will be useful in studies with these two unique genetic lines, which may harbor variations that underlie quantitative trait loci for disease resistance and other important traits. Through the global discovery of novel CNVs in chicken, these data also provide resources for further genetic and functional genomics studies.  相似文献   

3.
Copy number variants (CNVs) are genomic rearrangements resulting from gains or losses of DNA segments. Typically, the term refers to rearrangements of sequences larger than 1 kb. This type of polymorphism has recently been shown to be a key contributor to intra-species genetic variation, along with single-nucleotide polymorphisms and short insertion-deletion polymorphisms. Over the last decade, a growing number of studies have highlighted the importance of copy number variation (CNV) as a factor affecting human phenotype and individual CNVs have been linked to risks for severe diseases. In plants, the exploration of the extent and role of CNV is still just beginning. Initial genomic analyses indicate that CNVs are prevalent in plants and have greatly affected plant genome evolution. Many CNV events have been observed in outcrossing and autogamous species. CNVs are usually found on all chromosomes, with CNV hotspots interspersed with regions of very low genetic variation. Although CNV is mainly associated with intergenic regions, many CNVs encompass protein-coding genes. The collected data suggest that CNV mainly affects the members of large families of functionally redundant genes. Thus, the effects of individual CNV events on phenotype are usually modest. Nevertheless, there are many cases in which CNVs for specific genes have been linked to important traits such as flowering time, plant height and resistance to biotic and abiotic stress. Recent reports suggest that CNVs may form rapidly in response to stress.  相似文献   

4.
Copy number variants (CNVs) contribute to human genetic and phenotypic diversity. However, the distribution of larger CNVs in the general population remains largely unexplored. We identify large variants in ~2500 individuals by using Illumina SNP data, with an emphasis on “hotspots” prone to recurrent mutations. We find variants larger than 500 kb in 5%–10% of individuals and variants greater than 1 Mb in 1%–2%. In contrast to previous studies, we find limited evidence for stratification of CNVs in geographically distinct human populations. Importantly, our sample size permits a robust distinction between truly rare and polymorphic but low-frequency copy number variation. We find that a significant fraction of individual CNVs larger than 100 kb are rare and that both gene density and size are strongly anticorrelated with allele frequency. Thus, although large CNVs commonly exist in normal individuals, which suggests that size alone can not be used as a predictor of pathogenicity, such variation is generally deleterious. Considering these observations, we combine our data with published CNVs from more than 12,000 individuals contrasting control and neurological disease collections. This analysis identifies known disease loci and highlights additional CNVs (e.g., 3q29, 16p12, and 15q25.2) for further investigation. This study provides one of the first analyses of large, rare (0.1%–1%) CNVs in the general population, with insights relevant to future analyses of genetic disease.  相似文献   

5.
We conducted a comprehensive study of copy number variants (CNVs) well-tagged by SNPs (r(2)≥ 0.8) by analyzing their effect on gene expression and their association with disease susceptibility and other complex human traits. We tested whether these CNVs were more likely to be functional than frequency-matched SNPs as trait-associated loci or as expression quantitative trait loci (eQTLs) influencing phenotype by altering gene regulation. Our study found that CNV-tagging SNPs are significantly enriched for cis eQTLs; furthermore, we observed that trait associations from the NHGRI catalog show an overrepresentation of SNPs tagging CNVs relative to frequency-matched SNPs. We found that these SNPs tagging CNVs are more likely to affect multiple expression traits than frequency-matched variants. Given these findings on the functional relevance of CNVs, we created an online resource of expression-associated CNVs (eCNVs) using the most comprehensive population-based map of CNVs to inform future studies of complex traits. Although previous studies of common CNVs that can be typed on existing platforms and/or interrogated by SNPs in genome-wide association studies concluded that such CNVs appear unlikely to have a major role in the genetic basis of several complex diseases examined, our findings indicate that it would be premature to dismiss the possibility that even common CNVs may contribute to complex phenotypes and at least some common diseases.  相似文献   

6.
Autism spectrum disorders (ASDs) are highly heritable and characterised by deficits in social interaction and communication, as well as restricted and repetitive behaviours. Although a number of highly penetrant ASD gene variants have been identified, there is growing evidence to support a causal role for combinatorial effects arising from the contributions of multiple loci. By examining synaptic and circadian neurological phenotypes resulting from the dosage variants of unique human:fly orthologues in Drosophila, we observe numerous synergistic interactions between pairs of informatically-identified candidate genes whose orthologues are jointly affected by large de novo copy number variants (CNVs). These CNVs were found in the genomes of individuals with autism, including a patient carrying a 22q11.2 deletion. We first demonstrate that dosage alterations of the unique Drosophila orthologues of candidate genes from de novo CNVs that harbour only a single candidate gene display neurological defects similar to those previously reported in Drosophila models of ASD-associated variants. We then considered pairwise dosage changes within the set of orthologues of candidate genes that were affected by the same single human de novo CNV. For three of four CNVs with complete orthologous relationships, we observed significant synergistic effects following the simultaneous dosage change of gene pairs drawn from a single CNV. The phenotypic variation observed at the Drosophila synapse that results from these interacting genetic variants supports a concordant phenotypic outcome across all interacting gene pairs following the direction of human gene copy number change. We observe both specificity and transitivity between interactors, both within and between CNV candidate gene sets, supporting shared and distinct genetic aetiologies. We then show that different interactions affect divergent synaptic processes, demonstrating distinct molecular aetiologies. Our study illustrates mechanisms through which synergistic effects resulting from large structural variation can contribute to human disease.  相似文献   

7.
Age-related macular degeneration (AMD) is a complex genetic disease, with many loci demonstrating appreciable attributable disease risk. Despite significant progress toward understanding the genetic and environmental etiology of AMD, identification of additional risk factors is necessary to fully appreciate and treat AMD pathology. In this study, we investigated copy number variants (CNVs) as potential AMD risk variants in a cohort of 400 AMD patients and 500 AMD-free controls ascertained at the University of Iowa. We used three publicly available copy number programs to analyze signal intensity data from Affymetrix GeneChip SNP Microarrays. CNVs were ranked based on prevalence in the disease cohort and absence from the control group; high interest CNVs were subsequently confirmed by qPCR. While we did not observe a single-locus "risk CNV" that could account for a major fraction of AMD, we identified several rare and overlapping CNVs containing or flanking compelling candidate genes such as NPHP1 and EFEMP1. These and other candidate genes highlighted by this study deserve further scrutiny as sources of genetic risk for AMD.  相似文献   

8.

Background  

Both somatic copy number alterations (CNAs) and germline copy number variants (CNVs) that are prevalent in healthy individuals can appear as recurrent changes in comparative genomic hybridization (CGH) analyses of tumors. In order to identify important cancer genes CNAs and CNVs must be distinguished. Although the Database of Genomic Variants (DGV) contains a list of all known CNVs, there is no standard methodology to use the database effectively.  相似文献   

9.
10.
Autism spectrum disorder (ASD) is characterized by impairments in reciprocal social interaction and communication, and by restricted and repetitive behaviors. Family studies indicate a significant genetic basis for ASD susceptibility, and genomic scanning is beginning to elucidate the underlying genetic architecture. Some 5-15% of individuals with ASD have an identifiable genetic etiology corresponding to known chromosomal rearrangements or single gene disorders. Rare (<1% frequency) de novo or inherited copy number variations (CNVs) (especially those that affect genes with synaptic function) are observed in 5-10% of idiopathic ASD cases. These findings, coupled with genome sequencing data suggest the existence of hundreds of ASD risk genes. Common variants, yet unidentified, exert only small effects on risk. Identification of ASD risk genes with high penetrance will broaden the targets amenable to genetic testing; while the biological pathways revealed by the deeper list of ASD genes should narrow the targets for therapeutic intervention.  相似文献   

11.
Gene expression as an intermediate molecular phenotype has been a focus of research interest. In particular, studies of expression quantitative trait loci (eQTL) have offered promise for understanding gene regulation through the discovery of genetic variants that explain variation in gene expression levels. Existing eQTL methods are designed for assessing the effects of common variants, but not rare variants. Here, we address the problem by establishing a novel analytical framework for evaluating the effects of rare or private variants on gene expression. Our method starts from the identification of outlier individuals that show markedly different gene expression from the majority of a population, and then reveals the contributions of private SNPs to the aberrant gene expression in these outliers. Using population-scale mRNA sequencing data, we identify outlier individuals using a multivariate approach. We find that outlier individuals are more readily detected with respect to gene sets that include genes involved in cellular regulation and signal transduction, and less likely to be detected with respect to the gene sets with genes involved in metabolic pathways and other fundamental molecular functions. Analysis of polymorphic data suggests that private SNPs of outlier individuals are enriched in the enhancer and promoter regions of corresponding aberrantly-expressed genes, suggesting a specific regulatory role of private SNPs, while the commonly-occurring regulatory genetic variants (i.e., eQTL SNPs) show little evidence of involvement. Additional data suggest that non-genetic factors may also underlie aberrant gene expression. Taken together, our findings advance a novel viewpoint relevant to situations wherein common eQTLs fail to predict gene expression when heritable, rare inter-individual variation exists. The analytical framework we describe, taking into consideration the reality of differential phenotypic robustness, may be valuable for investigating complex traits and conditions.  相似文献   

12.
Structural genetic changes, especially copy number variants (CNVs), represent a major source of genetic variation contributing to human disease. Tetralogy of Fallot (TOF) is the most common form of cyanotic congenital heart disease, but to date little is known about the role of CNVs in the etiology of TOF. Using high-resolution genome-wide microarrays and stringent calling methods, we investigated rare CNVs in a prospectively recruited cohort of 433 unrelated adults with TOF and/or pulmonary atresia at a single centre. We excluded those with recognized syndromes, including 22q11.2 deletion syndrome. We identified candidate genes for TOF based on converging evidence between rare CNVs that overlapped the same gene in unrelated individuals and from pathway analyses comparing rare CNVs in TOF cases to those in epidemiologic controls. Even after excluding the 53 (10.7%) subjects with 22q11.2 deletions, we found that adults with TOF had a greater burden of large rare genic CNVs compared to controls (8.82% vs. 4.33%, p?=?0.0117). Six loci showed evidence for recurrence in TOF or related congenital heart disease, including typical 1q21.1 duplications in four (1.18%) of 340 Caucasian probands. The rare CNVs implicated novel candidate genes of interest for TOF, including PLXNA2, a gene involved in semaphorin signaling. Independent pathway analyses highlighted developmental processes as potential contributors to the pathogenesis of TOF. These results indicate that individually rare CNVs are collectively significant contributors to the genetic burden of TOF. Further, the data provide new evidence for dosage sensitive genes in PLXNA2-semaphorin signaling and related developmental processes in human cardiovascular development, consistent with previous animal models.  相似文献   

13.
14.
Polygenic diseases with a broad phenotypic spectrum, such as polycystic ovary syndrome (PCOS), present a particular challenge in terms of identifying the underlying genetic mechanisms, nevertheless genetic variants have impact on the individual phenotype. We aimed to determine if next to genetic variations like SNPs further mechanisms might play a role in the pathogenesis of PCOS. We examined the effect of copy-number variations (CNVs) on metabolic phenotypes in PCOS. The intragenic rs1244979, rs2815752 in NEGR1 gene, and rs780094 in GCKR gene were genotyped and CNVs were determined by droplet digital polymerase chain reaction (ddPCR) in PCOS patients (n?=?153) and controls without metabolic syndrome (n?=?142). The study indicated that SNPs are not associated with the pathogenesis of PCOS but affect metabolic phenotypes. The CNVs investigated show a lower variability in PCOS than in CON. Furthermore, we provided direct evidence that the copy number, but not the genotype of the CNV in the genomic regions of rs780094(GCKR) is associated with low level of high-density lipoprotein cholesterol in PCOS. This study supports the hypothesis that not only genetic variants, but also CNVs in metabolically relevant genes, have an effect on metabolic phenotypes in our group of PCOS patients.  相似文献   

15.
《PloS one》2014,9(8)
Asthma is a complex genetic disease caused by a combination of genetic and environmental risk factors. We sought to test classes of genetic variants largely missed by genome-wide association studies (GWAS), including copy number variants (CNVs) and low-frequency variants, by performing whole-genome sequencing (WGS) on 16 individuals from asthma-enriched and asthma-depleted families. The samples were obtained from an extended 13-generation Hutterite pedigree with reduced genetic heterogeneity due to a small founding gene pool and reduced environmental heterogeneity as a result of a communal lifestyle. We sequenced each individual to an average depth of 13-fold, generated a comprehensive catalog of genetic variants, and tested the most severe mutations for association with asthma. We identified and validated 1960 CNVs, 19 nonsense or splice-site single nucleotide variants (SNVs), and 18 insertions or deletions that were out of frame. As follow-up, we performed targeted sequencing of 16 genes in 837 cases and 540 controls of Puerto Rican ancestry and found that controls carry a significantly higher burden of mutations in IL27RA (2.0% of controls; 0.23% of cases; nominal p = 0.004; Bonferroni p = 0.21). We also genotyped 593 CNVs in 1199 Hutterite individuals. We identified a nominally significant association (p = 0.03; Odds ratio (OR) = 3.13) between a 6 kbp deletion in an intron of NEDD4L and increased risk of asthma. We genotyped this deletion in an additional 4787 non-Hutterite individuals (nominal p = 0.056; OR = 1.69). NEDD4L is expressed in bronchial epithelial cells, and conditional knockout of this gene in the lung in mice leads to severe inflammation and mucus accumulation. Our study represents one of the early instances of applying WGS to complex disease with a large environmental component and demonstrates how WGS can identify risk variants, including CNVs and low-frequency variants, largely untested in GWAS.  相似文献   

16.
The discovery of genomic structural variants (SVs), such as copy number variants (CNVs), is essential to understand genetic variation of human populations and complex diseases. Over recent years, the advent of new high-throughput sequencing (HTS) platforms has opened many opportunities for SVs discovery, and a very promising approach consists in measuring the depth of coverage (DOC) of reads aligned to the human reference genome. At present, few computational methods have been developed for the analysis of DOC data and all of these methods allow to analyse only one sample at time. For these reasons, we developed a novel algorithm (JointSLM) that allows to detect common CNVs among individuals by analysing DOC data from multiple samples simultaneously. We test JointSLM performance on synthetic and real data and we show its unprecedented resolution that enables the detection of recurrent CNV regions as small as 500 bp in size. When we apply JointSLM to analyse chromosome one of eight genomes with different ancestry, we identify 3000 regions with recurrent CNVs of different frequency and size: hierarchical clustering on these regions segregates the eight individuals in two groups that reflect their ancestry, demonstrating the potential utility of JointSLM for population genetics studies.  相似文献   

17.
Copy-number variations (CNVs) are widespread in the human genome, but comprehensive assignments of integer locus copy-numbers (i.e., copy-number genotypes) that, for example, enable discrimination of homozygous from heterozygous CNVs, have remained challenging. Here we present CopySeq, a novel computational approach with an underlying statistical framework that analyzes the depth-of-coverage of high-throughput DNA sequencing reads, and can incorporate paired-end and breakpoint junction analysis based CNV-analysis approaches, to infer locus copy-number genotypes. We benchmarked CopySeq by genotyping 500 chromosome 1 CNV regions in 150 personal genomes sequenced at low-coverage. The assessed copy-number genotypes were highly concordant with our performed qPCR experiments (Pearson correlation coefficient 0.94), and with the published results of two microarray platforms (95-99% concordance). We further demonstrated the utility of CopySeq for analyzing gene regions enriched for segmental duplications by comprehensively inferring copy-number genotypes in the CNV-enriched >800 olfactory receptor (OR) human gene and pseudogene loci. CopySeq revealed that OR loci display an extensive range of locus copy-numbers across individuals, with zero to two copies in some OR loci, and two to nine copies in others. Among genetic variants affecting OR loci we identified deleterious variants including CNVs and SNPs affecting ~15% and ~20% of the human OR gene repertoire, respectively, implying that genetic variants with a possible impact on smell perception are widespread. Finally, we found that for several OR loci the reference genome appears to represent a minor-frequency variant, implying a necessary revision of the OR repertoire for future functional studies. CopySeq can ascertain genomic structural variation in specific gene families as well as at a genome-wide scale, where it may enable the quantitative evaluation of CNVs in genome-wide association studies involving high-throughput sequencing.  相似文献   

18.
Copy number variations (CNVs) constitute an important class of variation in the human genome and the interpretation of their pathogenicity considering different frequencies across populations is still a challenge for geneticists. Since the CNV databases are predominantly composed of European and non-admixed individuals, and Brazilian genetic constitution is admixed and ethnically diverse, diagnostic screenings on Brazilian variants are greatly difficulted by the lack of populational references. We analyzed a clinical sample of 268 Brazilian individuals, including patients with neurodevelopment disorders and/or congenital malformations. The pathogenicity of CNVs was classified according to their gene content and overlap with known benign and pathogenic variants. A total of 1,504 autosomal CNVs (1,207 gains and 297 losses) were classified as benign (92.9%), likely benign (1.6%), VUS (2.6%), likely pathogenic (0.2%) and pathogenic (2.7%). Some of the CNVs were recurrent and with frequency increased in our sample, when compared to populational open resources of structural variants: 14q32.33, 22q11.22, 1q21.1, and 1p36.32 gains. Thus, these highly recurrent CNVs classified as likely benign or VUS were considered non-pathogenic in our Brazilian sample. This study shows the relevance of introducing CNV data from diverse cohorts to improve on the interpretation of clinical impact of genomic variations.  相似文献   

19.
Genomic copy number variants (CNVs) have been implicated in multiple psychiatric disorders, but not much is known about their influence on anxiety disorders specifically. Using next-generation sequencing (NGS) and two additional array-based genotyping approaches, we detected CNVs in a mouse model consisting of two inbred mouse lines showing high (HAB) and low (LAB) anxiety-related behavior, respectively. An influence of CNVs on gene expression in the central (CeA) and basolateral (BLA) amygdala, paraventricular nucleus (PVN), and cingulate cortex (Cg) was shown by a two-proportion Z-test (p = 1.6 x 10-31), with a positive correlation in the CeA (p = 0.0062), PVN (p = 0.0046) and Cg (p = 0.0114), indicating a contribution of CNVs to the genetic predisposition to trait anxiety in the specific context of HAB/LAB mice. In order to confirm anxiety-relevant CNVs and corresponding genes in a second mouse model, we further examined CD-1 outbred mice. We revealed the distribution of CNVs by genotyping 64 CD 1 individuals using a high-density genotyping array (Jackson Laboratory). 78 genes within those CNVs were identified to show nominally significant association (48 genes), or a statistical trend in their association (30 genes) with the time animals spent on the open arms of the elevated plus-maze (EPM). Fifteen of them were considered promising candidate genes of anxiety-related behavior as we could show a significant overlap (permutation test, p = 0.0051) with genes within HAB/LAB CNVs. Thus, here we provide what is to our knowledge the first extensive catalogue of CNVs in CD-1 mice and potential corresponding candidate genes linked to anxiety-related behavior in mice.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号