期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

High resolution discovery and confirmation of copy number variants in 90 Yoruba Nigerians

Hajime Matsuzaki Pei-Hua Wang Jing Hu Rich Rava Glenn K Fu 《Genome biology》2009,10(11):R125-18

Background

Copy number variants (CNVs) account for a large proportion of genetic variation in the genome. The initial discoveries of long (> 100 kb) CNVs in normal healthy individuals were made on BAC arrays and low resolution oligonucleotide arrays. Subsequent studies that used higher resolution microarrays and SNP genotyping arrays detected the presence of large numbers of CNVs that are < 100 kb, with median lengths of approximately 10 kb. More recently, whole genome sequencing of individuals has revealed an abundance of shorter CNVs with lengths < 1 kb. 相似文献

2.

Detection of copy number variation from array intensity and sequencing read depth using a stepwise Bayesian model

Zhengdong D Zhang Mark B Gerstein 《BMC bioinformatics》2010,11(1):539

Background

Copy number variants (CNVs) have been demonstrated to occur at a high frequency and are now widely believed to make a significant contribution to the phenotypic variation in human populations. Array-based comparative genomic hybridization (array-CGH) and newly developed read-depth approach through ultrahigh throughput genomic sequencing both provide rapid, robust, and comprehensive methods to identify CNVs on a whole-genome scale. 相似文献

3.

Rare De Novo Copy Number Variants in Patients with Congenital Pulmonary Atresia

Li Xie Jin-Lan Chen Wei-Zhi Zhang Shou-Zheng Wang Tian-Li Zhao Can Huang Jian Wang Jin-Fu Yang Yi-Feng Yang Zhi-Ping Tan 《PloS one》2014,9(5)

Background

Ongoing studies using genomic microarrays and next-generation sequencing have demonstrated that the genetic contributions to cardiovascular diseases have been significantly ignored in the past. The aim of this study was to identify rare copy number variants in individuals with congenital pulmonary atresia (PA).

Methods and Results

Based on the hypothesis that rare structural variants encompassing key genes play an important role in heart development in PA patients, we performed high-resolution genome-wide microarrays for copy number variations (CNVs) in 82 PA patient-parent trios and 189 controls with an Illumina SNP array platform. CNVs were identified in 17/82 patients (20.7%), and eight of these CNVs (9.8%) are considered potentially pathogenic. Five de novo CNVs occurred at two known congenital heart disease (CHD) loci (16p13.1 and 22q11.2). Two de novo CNVs that may affect folate and vitamin B₁₂ metabolism were identified for the first time. A de novo 1-Mb deletion at 17p13.2 may represent a rare genomic disorder that involves mild intellectual disability and associated facial features.

Conclusions

Rare CNVs contribute to the pathogenesis of PA (9.8%), suggesting that the causes of PA are heterogeneous and pleiotropic. Together with previous data from animal models, our results might help identify a link between CHD and folate-mediated one-carbon metabolism (FOCM). With the accumulation of high-resolution SNP array data, these previously undescribed rare CNVs may help reveal critical gene(s) in CHD and may provide novel insights about CHD pathogenesis. 相似文献

4.

The effect of algorithms on copy number variant detection

Tsuang DW Millard SP Ely B Chi P Wang K Raskind WH Kim S Brkanac Z Yu CE 《PloS one》2010,5(12):e14456

Background

The detection of copy number variants (CNVs) and the results of CNV-disease association studies rely on how CNVs are defined, and because array-based technologies can only infer CNVs, CNV-calling algorithms can produce vastly different findings. Several authors have noted the large-scale variability between CNV-detection methods, as well as the substantial false positive and false negative rates associated with those methods. In this study, we use variations of four common algorithms for CNV detection (PennCNV, QuantiSNP, HMMSeg, and cnvPartition) and two definitions of overlap (any overlap and an overlap of at least 40% of the smaller CNV) to illustrate the effects of varying algorithms and definitions of overlap on CNV discovery.

Methodology and Principal Findings

We used a 56 K Illumina genotyping array enriched for CNV regions to generate hybridization intensities and allele frequencies for 48 Caucasian schizophrenia cases and 48 age-, ethnicity-, and gender-matched control subjects. No algorithm found a difference in CNV burden between the two groups. However, the total number of CNVs called ranged from 102 to 3,765 across algorithms. The mean CNV size ranged from 46 kb to 787 kb, and the average number of CNVs per subject ranged from 1 to 39. The number of novel CNVs not previously reported in normal subjects ranged from 0 to 212.

Conclusions and Significance

Motivated by the availability of multiple publicly available genome-wide SNP arrays, investigators are conducting numerous analyses to identify putative additional CNVs in complex genetic disorders. However, the number of CNVs identified in array-based studies, and whether these CNVs are novel or valid, will depend on the algorithm(s) used. Thus, given the variety of methods used, there will be many false positives and false negatives. Both guidelines for the identification of CNVs inferred from high-density arrays and the establishment of a gold standard for validation of CNVs are needed. 相似文献

5.

CNV Workshop: an integrated platform for high-throughput copy number variation discovery and clinical diagnostics

Xiaowu Gai Juan C Perin Kevin Murphy Ryan O'Hara Monica D'arcy Adam Wenocur Hongbo M Xie Eric F Rappaport Tamim H Shaikh Peter S White 《BMC bioinformatics》2010,11(1):74

Background

Recent studies have shown that copy number variations (CNVs) are frequent in higher eukaryotes and associated with a substantial portion of inherited and acquired risk for various human diseases. The increasing availability of high-resolution genome surveillance platforms provides opportunity for rapidly assessing research and clinical samples for CNV content, as well as for determining the potential pathogenicity of identified variants. However, few informatics tools for accurate and efficient CNV detection and assessment currently exist. 相似文献

6.

A classification model for distinguishing copy number variants from cancer-related alterations

Irina Ostrovnaya Gouri Nanjangud Adam B Olshen 《BMC bioinformatics》2010,11(1):297

Background

Both somatic copy number alterations (CNAs) and germline copy number variants (CNVs) that are prevalent in healthy individuals can appear as recurrent changes in comparative genomic hybridization (CGH) analyses of tumors. In order to identify important cancer genes CNAs and CNVs must be distinguished. Although the Database of Genomic Variants (DGV) contains a list of all known CNVs, there is no standard methodology to use the database effectively. 相似文献

7.

R-Gada: a fast and flexible pipeline for copy number analysis in association studies

Roger Pique-Regi Alejandro Cáceres Juan R González 《BMC bioinformatics》2010,11(1):380

Background

Genome-wide association studies (GWAS) using Copy Number Variation (CNV) are becoming a central focus of genetic research. CNVs have successfully provided target genome regions for some disease conditions where simple genetic variation (i.e., SNPs) has previously failed to provide a clear association. 相似文献

8.

Refinement of primate copy number variation hotspots identifies candidate genomic regions evolving under positive selection

Gokcumen O Babb PL Iskow RC Zhu Q Shi X Mills RE Ionita-Laza I Vallender EJ Clark AG Johnson WE Lee C 《Genome biology》2011,12(5):R52-11

Background

Copy number variants (CNVs), defined as losses and gains of segments of genomic DNA, are a major source of genomic variation.

Results

In this study, we identified over 2,000 human CNVs that overlap with orthologous chimpanzee or orthologous macaque CNVs. Of these, 170 CNVs overlap with both chimpanzee and macaque CNVs, and these were collapsed into 34 hotspot regions of CNV formation. Many of these hotspot regions of CNV formation are functionally relevant, with a bias toward genes involved in immune function, some of which were previously shown to evolve under balancing selection in humans. The genes in these primate CNV formation hotspots have significant differential expression levels between species and show evidence for positive selection, indicating that they have evolved under species-specific, directional selection.

Conclusions

These hotspots of primate CNV formation provide a novel perspective on divergence and selective pressures acting on these genomic regions. 相似文献

9.

A panel of microsatellites to individually identify leopards and its application to leopard monitoring in human dominated landscapes

Samrat Mondol Navya R Vidya Athreya Kartik Sunagar Velu Mani Selvaraj Uma Ramakrishnan 《BMC genetics》2009,10(1):1-7

Background

Copy number variants (CNVs) have been identified in several studies to be associated with complex diseases. It is important, therefore, to understand the distribution of CNVs within and among populations. This study is the first report of a CNV map in African Americans.

Results

Employing a SNP platform with greater than 500,000 SNPs, a first-generation CNV map of the African American genome was generated using DNA from 385 healthy African American individuals, and compared to a sample of 435 healthy White individuals. A total of 1362 CNVs were identified within African Americans, which included two CNV regions that were significantly different in frequency between African Americans and Whites (17q21 and 15q11). In addition, a duplication was identified in 74% of DNAs derived from cell lines that was not present in any of the whole blood derived DNAs.

Conclusion

The Affymetrix 500 K array provides reliable CNV mapping information. However, using cell lines as a source of DNA may introduce artifacts. The duplication identified in high frequency in Whites and low frequency in African Americans on chromosome 17q21 reflects haplotype specific frequency differences between ancestral groups. The generation of the CNV map will be a valuable tool for identifying disease associated CNVs in African Americans. 相似文献

10.

Elusive Copy Number Variation in the Mouse Genome

Avigail Agam Binnaz Yalcin Amarjit Bhomra Matthew Cubin Caleb Webber Christopher Holmes Jonathan Flint Richard Mott 《PloS one》2010,5(9)

Background

Array comparative genomic hybridization (aCGH) to detect copy number variants (CNVs) in mammalian genomes has led to a growing awareness of the potential importance of this category of sequence variation as a cause of phenotypic variation. Yet there are large discrepancies between studies, so that the extent of the genome affected by CNVs is unknown. We combined molecular and aCGH analyses of CNVs in inbred mouse strains to investigate this question.

Principal Findings

Using a 2.1 million probe array we identified 1,477 deletions and 499 gains in 7 inbred mouse strains. Molecular characterization indicated that approximately one third of the CNVs detected by the array were false positives and we estimate the false negative rate to be more than 50%. We show that low concordance between studies is largely due to the molecular nature of CNVs, many of which consist of a series of smaller deletions and gains interspersed by regions where the DNA copy number is normal.

Conclusions

Our results indicate that CNVs detected by arrays may be the coincidental co-localization of smaller CNVs, whose presence is more likely to perturb an aCGH hybridization profile than the effect of an isolated, small, copy number alteration. Our findings help explain the hitherto unexplored discrepancies between array-based studies of copy number variation in the mouse genome. 相似文献

11.

Accounting for uncertainty when assessing association between copy number and disease: a latent class model

Juan R González Isaac Subirana Geòrgia Escaramís Solymar Peraza Alejandro Cáceres Xavier Estivill Lluís Armengol 《BMC bioinformatics》2009,10(1):172

Background

Copy number variations (CNVs) may play an important role in disease risk by altering dosage of genes and other regulatory elements, which may have functional and, ultimately, phenotypic consequences. Therefore, determining whether a CNV is associated or not with a given disease might be relevant in understanding the genesis and progression of human diseases. Current stage technology give CNV probe signal from which copy number status is inferred. Incorporating uncertainty of CNV calling in the statistical analysis is therefore a highly important aspect. In this paper, we present a framework for assessing association between CNVs and disease in case-control studies where uncertainty is taken into account. We also indicate how to use the model to analyze continuous traits and adjust for confounding covariates. 相似文献

12.

De novo mutations in ARID1B associated with both syndromic and non-syndromic short stature

Yongguo Yu RuEn Yao Lili Wang Yanjie Fan Xiaodong Huang Joel Hirschhorn Andrew Dauber Yiping Shen 《BMC genomics》2015,16(1)

Background

Human height is a complex trait with a strong genetic basis. Recently, a significant association between rare copy number variations (CNVs) and short stature has been identified, and candidate genes in these rare CNVs are being explored. This study aims to evaluate the association between mutations in ARID1B gene and short stature, both the syndromic and non-syndromic form.

Results

Based on a case-control study of whole genome chromosome microarray analysis (CMA), three overlapping CNVs were identified in patients with developmental disorders who exhibited short stature. ARID1B, a causal gene for Coffin Siris syndrome, is the only gene encompassed by all three CNVs. A following retrospective genotype-phenotype analysis based on a literature review confirmed that short stature is a frequent feature in those Coffin-Siris syndrome patients with ARID1B mutations. Mutation screening of ARID1B coding regions was further conducted in a cohort of 48 non-syndromic short stature patients,andfour novel missense variants including two de novo mutations were found.

Conclusion

These results suggest that haploinsufficient mutations of ARID1B are associated with syndromic short stature including Coffin-Siris syndrome and intellectual disability, while rare missense variants in ARID1B are associated with non-syndromic short stature. This study supports the notion that mutations in genes related to syndromic short stature may exert milder effect and contribute to short stature in the general population.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1898-1) contains supplementary material, which is available to authorized users. 相似文献

13.

Novel Association Strategy with Copy Number Variation for Identifying New Risk Loci of Human Diseases

Xianfeng Chen Xinlei Li Ping Wang Yang Liu Zhenguo Zhang Guoping Zhao Haiming Xu Jun Zhu Xueying Qin Suchao Chen Landian Hu Xiangyin Kong 《PloS one》2010,5(8)

Background

Copy number variations (CNV) are important causal genetic variations for human disease; however, the lack of a statistical model has impeded the systematic testing of CNVs associated with disease in large-scale cohort.

Methodology/Principal Findings

Here, we developed a novel integrated strategy to test CNV-association in genome-wide case-control studies. We converted the single-nucleotide polymorphism (SNP) signal to copy number states using a well-trained hidden Markov model. We mapped the susceptible CNV-loci through SNP site-specific testing to cope with the physiological complexity of CNVs. We also ensured the credibility of the associated CNVs through further window-based CNV-pattern clustering. Genome-wide data with seven diseases were used to test our strategy and, in total, we identified 36 new susceptible loci that are associated with CNVs for the seven diseases: 5 with bipolar disorder, 4 with coronary artery disease, 1 with Crohn''s disease, 7 with hypertension, 9 with rheumatoid arthritis, 7 with type 1 diabetes and 3 with type 2 diabetes. Fifteen of these identified loci were validated through genotype-association and physiological function from previous studies, which provide further confidence for our results. Notably, the genes associated with bipolar disorder converged in the phosphoinositide/calcium signaling, a well-known affected pathway in bipolar disorder, which further supports that CNVs have impact on bipolar disorder.

Conclusions/Significance

Our results demonstrated the effectiveness and robustness of our CNV-association analysis and provided an alternative avenue for discovering new associated loci of human diseases. 相似文献

14.

CONAN: copy number variation analysis software for genome-wide association studies

Lukas Forer Sebastian Schönherr Hansi Weissensteiner Florian Haider Thomas Kluckner Christian Gieger Heinz-Erich Wichmann Günther Specht Florian Kronenberg Anita Kloss-Brandstätter 《BMC bioinformatics》2010,11(1):318

Background

Genome-wide association studies (GWAS) based on single nucleotide polymorphisms (SNPs) revolutionized our perception of the genetic regulation of complex traits and diseases. Copy number variations (CNVs) promise to shed additional light on the genetic basis of monogenic as well as complex diseases and phenotypes. Indeed, the number of detected associations between CNVs and certain phenotypes are constantly increasing. However, while several software packages support the determination of CNVs from SNP chip data, the downstream statistical inference of CNV-phenotype associations is still subject to complicated and inefficient in-house solutions, thus strongly limiting the performance of GWAS based on CNVs. 相似文献

15.

Partitioning of copy-number genotypes in pedigrees

Louis-Philippe Lemieux Perreault Gregor U Andelfinger Géraldine Asselin Marie-Pierre Dubé 《BMC bioinformatics》2010,11(1):226

Background

Copy number variations (CNVs) and polymorphisms (CNPs) have only recently gained the genetic community's attention. Conservative estimates have shown that CNVs and CNPs might affect more than 10% of the genome and that they may be at least as important as single nucleotide polymorphisms in assessing human variability. Widely used tools for CNP analysis have been implemented in Birdsuite and PLINK for the purpose of conducting genetic association studies based on the unpartitioned total number of CNP copies provided by the intensities from Affymetrix's Genome-Wide Human SNP Array. Here, we are interested in partitioning copy number variations and polymorphisms in extended pedigrees for the purpose of linkage analysis on familial data. 相似文献

16.

Precision-mapping and statistical validation of quantitative trait loci by machine learning

Justin Bedo Peter Wenzl Adam Kowalczyk Andrzej Kilian 《BMC genetics》2008,9(1):1-18

Background

DNA sequence diversity within the human genome may be more greatly affected by copy number variations (CNVs) than single nucleotide polymorphisms (SNPs). Although the importance of CNVs in genome wide association studies (GWAS) is becoming widely accepted, the optimal methods for identifying these variants are still under evaluation. We have previously reported a comprehensive view of CNVs in the HapMap DNA collection using high density 500 K EA (Early Access) SNP genotyping arrays which revealed greater than 1,000 CNVs ranging in size from 1 kb to over 3 Mb. Although the arrays used most commonly for GWAS predominantly interrogate SNPs, CNV identification and detection does not necessarily require the use of DNA probes centered on polymorphic nucleotides and may even be hindered by the dependence on a successful SNP genotyping assay.

Results

In this study, we have designed and evaluated a high density array predicated on the use of non-polymorphic oligonucleotide probes for CNV detection. This approach effectively uncouples copy number detection from SNP genotyping and thus has the potential to significantly improve probe coverage for genome-wide CNV identification. This array, in conjunction with PCR-based, complexity-reduced DNA target, queries over 1.3 M independent NspI restriction enzyme fragments in the 200 bp to 1100 bp size range, which is a several fold increase in marker density as compared to the 500 K EA array. In addition, a novel algorithm was developed and validated to extract CNV regions and boundaries.

Conclusion

Using a well-characterized pair of DNA samples, close to 200 CNVs were identified, of which nearly 50% appear novel yet were independently validated using quantitative PCR. The results indicate that non-polymorphic probes provide a robust approach for CNV identification, and the increasing precision of CNV boundary delineation should allow a more complete analysis of their genomic organization. 相似文献

17.

Genomic copy number variation in Mus musculus

M Elizabeth O Locke Maja Milojevic Susan T Eitutis Nisha Patel Andrea E Wishart Mark Daley Kathleen A Hill 《BMC genomics》2015,16(1)

Background

Copy number variation is an important dimension of genetic diversity and has implications in development and disease. As an important model organism, the mouse is a prime candidate for copy number variant (CNV) characterization, but this has yet to be completed for a large sample size. Here we report CNV analysis of publicly available, high-density microarray data files for 351 mouse tail samples, including 290 mice that had not been characterized for CNVs previously.

Results

We found 9634 putative autosomal CNVs across the samples affecting 6.87 % of the mouse reference genome. We find significant differences in the degree of CNV uniqueness (single sample occurrence) and the nature of CNV-gene overlap between wild-caught mice and classical laboratory strains. CNV-gene overlap was associated with lipid metabolism, pheromone response and olfaction compared to immunity, carbohydrate metabolism and amino-acid metabolism for wild-caught mice and classical laboratory strains, respectively. Using two subspecies of wild-caught Mus musculus, we identified putative CNVs unique to those subspecies and show this diversity is better captured by wild-derived laboratory strains than by the classical laboratory strains. A total of 9 genic copy number variable regions (CNVRs) were selected for experimental confirmation by droplet digital PCR (ddPCR).

Conclusion

The analysis we present is a comprehensive, genome-wide analysis of CNVs in Mus musculus, which increases the number of known variants in the species and will accelerate the identification of novel variants in future studies.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1713-z) contains supplementary material, which is available to authorized users. 相似文献

18.

A method of predicting changes in human gene splicing induced by genetic variants in context of cis-acting elements

Alexander Churbanov Igor Vořechovský Chindo Hicks 《BMC bioinformatics》2010,11(1):22

Background

Polymorphic variants and mutations disrupting canonical splicing isoforms are among the leading causes of human hereditary disorders. While there is a substantial evidence of aberrant splicing causing Mendelian diseases, the implication of such events in multi-genic disorders is yet to be well understood. We have developed a new tool (SpliceScan II) for predicting the effects of genetic variants on splicing and cis-regulatory elements. The novel Bayesian non-canonical 5'GC splice site (SS) sensor used in our tool allows inference on non-canonical exons. 相似文献

19.

Identification of copy number variants from exome sequence data

Pubudu Saneth Samarakoon Hanne S?rmo Sorte Bj?rn Evert Kristiansen Tove Skodje Ying Sheng Geir E Tj?nnfjord Barbro Stadheim Asbj?rg Stray-Pedersen Olaug Kristin R?dningen Robert Lyle 《BMC genomics》2014,15(1)

Background

With advances in next generation sequencing technologies and genomic capture techniques, exome sequencing has become a cost-effective approach for mutation detection in genetic diseases. However, computational prediction of copy number variants (CNVs) from exome sequence data is a challenging task. Whilst numerous programs are available, they have different sensitivities, and have low sensitivity to detect smaller CNVs (1–4 exons). Additionally, exonic CNV discovery using standard aCGH has limitations due to the low probe density over exonic regions. The goal of our study was to develop a protocol to detect exonic CNVs (including shorter CNVs that cover 1–4 exons), combining computational prediction algorithms and a high-resolution custom CGH array.

Results

We used six published CNV prediction programs (ExomeCNV, CONTRA, ExomeCopy, ExomeDepth, CoNIFER, XHMM) and an in-house modification to ExomeCopy and ExomeDepth (ExCopyDepth) for computational CNV prediction on 30 exomes from the 1000 genomes project and 9 exomes from primary immunodeficiency patients. CNV predictions were tested using a custom CGH array designed to capture all exons (exaCGH). After this validation, we next evaluated the computational prediction of shorter CNVs. ExomeCopy and the in-house modified algorithm, ExCopyDepth, showed the highest capability in detecting shorter CNVs. Finally, the performance of each computational program was assessed by calculating the sensitivity and false positive rate.

Conclusions

In this paper, we assessed the ability of 6 computational programs to predict CNVs, focussing on short (1–4 exon) CNVs. We also tested these predictions using a custom array targeting exons. Based on these results, we propose a protocol to identify and confirm shorter exonic CNVs combining computational prediction algorithms and custom aCGH experiments.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-661) contains supplementary material, which is available to authorized users. 相似文献

20.

iHAP – integrated haplotype analysis pipeline for characterizing the haplotype structure of genes

Chun Meng Song Boon Huat Yeo Erwin Tantoso Yuchen Yang Yun Ping Lim Kuo-Bin Li Gunaretnam Rajagopal 《BMC bioinformatics》2006,7(1):525

Background

The advent of genotype data from large-scale efforts that catalog the genetic variants of different populations have given rise to new avenues for multifactorial disease association studies. Recent work shows that genotype data from the International HapMap Project have a high degree of transferability to the wider population. This implies that the design of genotyping studies on local populations may be facilitated through inferences drawn from information contained in HapMap populations. 相似文献