共查询到20条相似文献,搜索用时 68 毫秒
1.
Timour Baslan Sam Kovaka Fritz J Sedlazeck Yanming Zhang Robert Wappel Sha Tian Scott
W Lowe Sara Goodwin Michael
C Schatz 《Nucleic acids research》2021,49(21):e124
Genome copy number is an important source of genetic variation in health and disease. In cancer, Copy Number Alterations (CNAs) can be inferred from short-read sequencing data, enabling genomics-based precision oncology. Emerging Nanopore sequencing technologies offer the potential for broader clinical utility, for example in smaller hospitals, due to lower instrument cost, higher portability, and ease of use. Nonetheless, Nanopore sequencing devices are limited in the number of retrievable sequencing reads/molecules compared to short-read sequencing platforms, limiting CNA inference accuracy. To address this limitation, we targeted the sequencing of short-length DNA molecules loaded at optimized concentration in an effort to increase sequence read/molecule yield from a single nanopore run. We show that short-molecule nanopore sequencing reproducibly returns high read counts and allows high quality CNA inference. We demonstrate the clinical relevance of this approach by accurately inferring CNAs in acute myeloid leukemia samples. The data shows that, compared to traditional approaches such as chromosome analysis/cytogenetics, short molecule nanopore sequencing returns more sensitive, accurate copy number information in a cost effective and expeditious manner, including for multiplex samples. Our results provide a framework for short-molecule nanopore sequencing with applications in research and medicine, which includes but is not limited to, CNAs. 相似文献
2.
3.
Ionita-Laza I Laird NM Raby BA Weiss ST Lange C 《Bioinformatics (Oxford, England)》2008,24(20):2350-2355
MOTIVATION: Estimating the frequency distribution of copy number variants (CNVs) is an important aspect of the effort to characterize this new type of genetic variation. Currently, most studies report a strong skew toward low-frequency CNVs. In this article, our goal is to investigate the frequencies of CNVs. We employ a two-step procedure for the CNV frequency estimation process. We use family information a posteriori to select only the most reliable CNV regions, i.e. those showing high rates of Mendelian transmission. RESULTS: Our results suggest that the current skew toward low-frequency CNVs may not be representative of the true frequency distribution, but may be due, among other reasons, to the non-negligible false negative rates that characterize CNV detection methods. Moreover, false positives are also likely, as low-frequency CNVs are hard to detect with small sample sizes and technologies that are not ideally suited for their detection. Without appropriate validation methods, such as incorporation of biologically relevant information (for example, in our case, the transmission of heritable CNVs from parents to offspring), it is difficult to assess the validity of specific CNVs, and even harder to obtain reliable frequency estimates. 相似文献
4.
Copy number variation (CNV) is likely to be an important component of heritable variation in livestock. To characterise CNVs in cattle, we performed a genome wide survey to determine the number, location and gene content of these genomic features. A tiling oligonucleotide array with ~385,000 probes was used for comparative genomic hybridisation of both taurine and zebu cattle. Using a conservative set of calling criteria, a total of 51 CNV were detected that collectively spanned approximately half of one percent of the bovine genome. The size of the average CNV within each animal ranged from 213 kb up to 335 kb. Half of the CNV were detected in a single animal only, whilst the remainder was independently identified in multiple individuals. Analysis was performed to determine the gene content for each CNV region. This revealed that the majority of CNV (82%) spanned at least one gene, with a number of CNV containing genes which are known to control aspects of phenotypic variation in cattle. Whilst additional studies are required to determine the impact of individual CNV, this study confirmed them as an important class of genomic variation in cattle. 相似文献
5.
We develop a coalescent-based simulation tool to generate patterns of single nucleotide polymorphisms (SNPs) in a wide region encompassing both the original and duplicated genes. Selection on the new duplicated copy and interlocus gene conversion between the two copies are incorporated. This simulation enables us to explore how selection on duplicated copies affects the pattern of SNPs. The fixation of an advantageous duplicated copy causes a strong reduction in polymorphism not only in the duplicated copy but also in its flanking regions, which is a typical signature of a selective sweep by positive selection. After fixation, polymorphism gradually increases by accumulating neutral mutations and eventually reaches the equilibrium value if there is no gene conversion. When gene conversion is active, the number of SNPs in the duplicated copy quickly increases by transferring SNPs from the original copy; therefore, the time when we can recognize the signature of selection is decreased. Because this effect of gene conversion is restricted only to the duplicated region, more power to detect selection is expected if a flanking region to the duplicated copy is used. 相似文献
6.
Beata Nowakowska 《Journal of applied genetics》2017,58(4):449-457
Molecular methods, by which copy number variants (CNVs) detection is available, have been gradually introduced into routine diagnostics over the last 15 years. Despite this, some CNVs continue to be a huge challenge when it comes to clinical interpretation. CNVs are an important source of normal and pathogenic variants, but, in many cases, their impact on human health depends on factors that are not yet known. Therefore, perception of their clinical consequences can change over time, as our knowledge grows. This review summarises guidelines that facilitate correct classification of identified changes and discusses difficulties with the interpretation of rare, small CNVs. 相似文献
7.
Over the past decade, the ubiquity of copy number variants (CNVs, the gain or loss of genomic material) in the genomes of healthy humans has become apparent. Although some of these variants are associated with disorders, a handful of studies documented an adaptive advantage conferred by CNVs. In this review, we propose that CNVs are substrates for human evolution and adaptation. We discuss the possible mechanisms and evolutionary processes in which CNVs are selected, outline the current challenges in identifying these loci, and highlight that copy number variable regions allow for the creation of novel genes that may diversify the repertoire of such genes in response to rapidly changing environments. We expect that many more adaptive CNVs will be discovered in the coming years, and we believe that these new findings will contribute to our understanding of human-specific phenotypes. 相似文献
8.
Meyer KJ Davis LK Schindler EI Beck JS Rudd DS Grundstad AJ Scheetz TE Braun TA Fingert JH Alward WL Kwon YH Folk JC Russell SR Wassink TH Stone EM Sheffield VC 《Human genetics》2011,129(1):91-100
Age-related macular degeneration (AMD) is a complex genetic disease, with many loci demonstrating appreciable attributable disease risk. Despite significant progress toward understanding the genetic and environmental etiology of AMD, identification of additional risk factors is necessary to fully appreciate and treat AMD pathology. In this study, we investigated copy number variants (CNVs) as potential AMD risk variants in a cohort of 400 AMD patients and 500 AMD-free controls ascertained at the University of Iowa. We used three publicly available copy number programs to analyze signal intensity data from Affymetrix GeneChip SNP Microarrays. CNVs were ranked based on prevalence in the disease cohort and absence from the control group; high interest CNVs were subsequently confirmed by qPCR. While we did not observe a single-locus "risk CNV" that could account for a major fraction of AMD, we identified several rare and overlapping CNVs containing or flanking compelling candidate genes such as NPHP1 and EFEMP1. These and other candidate genes highlighted by this study deserve further scrutiny as sources of genetic risk for AMD. 相似文献
9.
Pubudu Saneth Samarakoon Hanne S?rmo Sorte Bj?rn Evert Kristiansen Tove Skodje Ying Sheng Geir E Tj?nnfjord Barbro Stadheim Asbj?rg Stray-Pedersen Olaug Kristin R?dningen Robert Lyle 《BMC genomics》2014,15(1)
Background
With advances in next generation sequencing technologies and genomic capture techniques, exome sequencing has become a cost-effective approach for mutation detection in genetic diseases. However, computational prediction of copy number variants (CNVs) from exome sequence data is a challenging task. Whilst numerous programs are available, they have different sensitivities, and have low sensitivity to detect smaller CNVs (1–4 exons). Additionally, exonic CNV discovery using standard aCGH has limitations due to the low probe density over exonic regions. The goal of our study was to develop a protocol to detect exonic CNVs (including shorter CNVs that cover 1–4 exons), combining computational prediction algorithms and a high-resolution custom CGH array.Results
We used six published CNV prediction programs (ExomeCNV, CONTRA, ExomeCopy, ExomeDepth, CoNIFER, XHMM) and an in-house modification to ExomeCopy and ExomeDepth (ExCopyDepth) for computational CNV prediction on 30 exomes from the 1000 genomes project and 9 exomes from primary immunodeficiency patients. CNV predictions were tested using a custom CGH array designed to capture all exons (exaCGH). After this validation, we next evaluated the computational prediction of shorter CNVs. ExomeCopy and the in-house modified algorithm, ExCopyDepth, showed the highest capability in detecting shorter CNVs. Finally, the performance of each computational program was assessed by calculating the sensitivity and false positive rate.Conclusions
In this paper, we assessed the ability of 6 computational programs to predict CNVs, focussing on short (1–4 exon) CNVs. We also tested these predictions using a custom array targeting exons. Based on these results, we propose a protocol to identify and confirm shorter exonic CNVs combining computational prediction algorithms and custom aCGH experiments.Electronic supplementary material
The online version of this article (doi:10.1186/1471-2164-15-661) contains supplementary material, which is available to authorized users. 相似文献10.
11.
12.
13.
Magi A Tattini L Pippucci T Torricelli F Benelli M 《Bioinformatics (Oxford, England)》2012,28(4):470-478
MOTIVATION: The advent of high-throughput sequencing technologies is revolutionizing our ability in discovering and genotyping DNA copy number variants (CNVs). Read count-based approaches are able to detect CNV regions with an unprecedented resolution. Although this computational strategy has been recently introduced in literature, much work has been already done for the preparation, normalization and analysis of this kind of data. RESULTS: Here we face the many aspects that cover the detection of CNVs by using read count approach. We first study the characteristics and systematic biases of read count distributions, focusing on the normalization methods designed for removing these biases. Subsequently, we compare the algorithms designed to detect the boundaries of CNVs and we investigate the ability of read count data to predict the exact number of DNA copy. Finally, we review the tools publicly available for analysing read count data. To better understand the state of the art of read count approaches, we compare the performance of the three most widely used sequencing technologies (Illumina Genome Analyzer, Roche 454 and Life Technologies SOLiD) in all the analyses that we perform. 相似文献
14.
Chia-Huei Lee Chih-Min Liu Chun-Chiang Wen Shun-Min Chang Hai-Gwo Hwu 《Journal of biomedical science》2010,17(1):2
Background
Schizophrenia is a complex disorder with involvement of multiple genes. 相似文献15.
Alberto Magi Lorenzo Tattini Ingrid Cifola Romina D’Aurizio Matteo Benelli Eleonora Mangano Cristina Battaglia Elena Bonora Ants Kurg Marco Seri Pamela Magini Betti Giusti Giovanni Romeo Tommaso Pippucci Gianluca De Bellis Rosanna Abbate Gian Franco Gensini 《Genome biology》2013,14(10):R120
We developed a novel software tool, EXCAVATOR, for the detection of copy number variants (CNVs) from whole-exome sequencing data. EXCAVATOR combines a three-step normalization procedure with a novel heterogeneous hidden Markov model algorithm and a calling method that classifies genomic regions into five copy number states. We validate EXCAVATOR on three datasets and compare the results with three other methods. These analyses show that EXCAVATOR outperforms the other methods and is therefore a valuable tool for the investigation of CNVs in largescale projects, as well as in clinical research and diagnostics. EXCAVATOR is freely available at http://sourceforge.net/projects/excavatortool/. 相似文献
16.
MP Hitz LP Lemieux-Perreault C Marshall Y Feroz-Zada R Davies SW Yang AC Lionel G D'Amours E Lemyre R Cullum JL Bigras M Thibeault P Chetaille A Montpetit P Khairy B Overduin S Klaassen P Hoodless M Nemer AF Stewart C Boerkoel SW Scherer A Richter MP Dubé G Andelfinger 《PLoS genetics》2012,8(9):e1002903
17.
Pugh TJ Delaney AD Farnoud N Flibotte S Griffith M Li HI Qian H Farinha P Gascoyne RD Marra MA 《Nucleic acids research》2008,36(13):e80
Large-scale copy number variants (CNVs) have recently been recognized to play a role in human genome variation and disease. Approaches for analysis of CNVs in small samples such as microdissected tissues can be confounded by limited amounts of material. To facilitate analyses of such samples, whole genome amplification (WGA) techniques were developed. In this study, we explored the impact of Phi29 multiple-strand displacement amplification on detection of CNVs using oligonucleotide arrays. We extracted DNA from fresh frozen lymph node samples and used this for amplification and analysis on the Affymetrix Mapping 500k SNP array platform. We demonstrated that the WGA procedure introduces hundreds of potentially confounding CNV artifacts that can obscure detection of bona fide variants. Our analysis indicates that many artifacts are reproducible, and may correlate with proximity to chromosome ends and GC content. Pair-wise comparison of amplified products considerably reduced the number of apparent artifacts and partially restored the ability to detect real CNVs. Our results suggest WGA material may be appropriate for copy number analysis when amplified samples are compared to similarly amplified samples and that only the CNVs with the greatest significance values detected by such comparisons are likely to be representative of the unamplified samples. 相似文献
18.
Egle Preiksaitiene Alma Molytė Jurate Kasnauskiene Zivile Ciuladaite Algirdas Utkus Philippos C. Patsalis Vaidutis Kučinskas 《Journal of applied genetics》2014,55(2):189-196
Since the introduction of high-resolution microarray technologies, it has become apparent that structural chromosomal rearrangements can lead to a wide variety of clinical manifestations, including developmental delay/intellectual disability (DD/ID). It has been shown previously that the diagnostic yield of genome-wide array-based identification of submicroscopic alterations in patients with ID varies widely and depends on the patient selection criteria. More attempts have recently been made to define the phenotypic clues of pathogenic copy number variants (CNVs). The aim of this study was to investigate a well-phenotyped cohort of patients with DD/ID and determine whether certain clinical features may serve as indicators for pathogenic CNVs. A retrospective analysis was conducted for patients with DD/ID (n?=?211) who were tested using genome-wide chromosomal microarray technologies and a review of the clinical data was performed. Pathogenic CNVs were detected in 29 patients. In comparison with individuals who had normal molecular karyotyping results (n?=?182), malformations of the musculoskeletal system; congenital malformations of the CNS (particularly hydrocephalus and congenital malformations of the corpus callosum); minor anomalies of the eye, face, and neck subgroup (particularly downward-slanting palpebral fissures, minor anomalies of the ear, and micrognathia); brachydactyly; and umbilical hernia were more common in patients with chromosomal alterations. A multivariate logistic regression analysis allowed the identification of three independent pathogenic CNV predictors: congenital malformations of the corpus callosum, minor anomalies of the ear, and brachydactyly. Insights into the chromosomal phenotype may help to increase the diagnostic yield of microarray technologies and sharpen the distinction between chromosomal alterations and other conditions. 相似文献
19.
While complex diseases, such as inflammatory bowel disease, do not follow distinctive Mendelian inheritance patterns, there is now considerable evidence from twin and pedigree studies to show that there are significant genetic influences in the development of many such diseases. In times past, this type of information was considered to be interesting, and was used mainly to alert other members of the families that they may also be at increased risk of developing the disease. However, with the ability to evaluate the genetic basis of common disease, this information will have important consequences for the diagnosis, prevention and treatment of the disorder. The genetic basis for common disease is likely to be more complicated than we had previously anticipated, since we now recognise epigenetic causes of disease, and other subtle gene regulatory mechanisms. Copy number variants have been highlighted in this review, as being a phenomenon that we have known about for a long time, but that has not previously been clearly associated with human disease. As complex disease is related to changes in gene expression, any variation in the human genome that alters gene expression is now a candidate for being involved in the disease process. 相似文献
20.
S. Bovo A. Ribani M. Muñoz E. Alves J. P. Araujo R. Bozzi R. Charneca F. Di Palma G. Etherington A. I. Fernandez F. García J. García-Casco D. Karolyi M. Gallo K. Gvozdanović J. M. Martins M. J. Mercat Y. Núñez R. Quintanilla Č. Radović V. Razmaite J. Riquet R. Savić G. Schiavo M. Škrlep G. Usai V. J. Utzeri C. Zimmer C. Ovilo L. Fontanesi 《Animal genetics》2020,51(4):541-556
In this study, we identified copy number variants (CNVs) in 19 European autochthonous pig breeds and in two commercial breeds (Italian Large White and Italian Duroc) that represent important genetic resources for this species. The genome of 725 pigs was sequenced using a breed-specific DNA pooling approach (30–35 animals per pool) obtaining an average depth per pool of 42×. This approach maximised CNV discovery as well as the related copy number states characterising, on average, the analysed breeds. By mining more than 17.5 billion reads, we identified a total of 9592 CNVs (~683 CNVs per breed) and 3710 CNV regions (CNVRs; 1.15% of the reference pig genome), with an average of 77 CNVRs per breed that were considered as private. A few CNVRs were analysed in more detail, together with other information derived from sequencing data. For example, the CNVR encompassing the KIT gene was associated with coat colour phenotypes in the analysed breeds, confirming the role of the multiple copies in determining breed-specific coat colours. The CNVR covering the MSRB3 gene was associated with ear size in most breeds. The CNVRs affecting the ELOVL6 and ZNF622 genes were private features observed in the Lithuanian Indigenous Wattle and in the Turopolje pig breeds respectively. Overall, the genome variability unravelled here can explain part of the genetic diversity among breeds and might contribute to explain their origin, history and adaptation to a variety of production systems. 相似文献