共查询到20条相似文献,搜索用时 31 毫秒
1.
Andre Loboda Michael V Nebozhyn James W Watters Carolyne A Buser Peter Martin Shaw Pearl S Huang Laura Van’t Veer Rob AEM Tollenaar David B Jackson Deepak Agrawal Hongyue Dai Timothy J Yeatman 《BMC medical genomics》2011,4(1):1-10
Background
Copy number alterations (CNA) play a key role in cancer development and progression. Since more than one CNA can be detected in most tumors, frequently co-occurring genetic CNA may point to cooperating cancer related genes. Existing methods for co-occurrence evaluation so far have not considered the overall heterogeneity of CNA per tumor, resulting in a preferential detection of frequent changes with limited specificity for each association due to the high genetic instability of many samples.Method
We hypothesize that in cancer some linkage-independent CNA may display a non-random co-occurrence, and that these CNA could be of pathogenetic relevance for the respective cancer. We also hypothesize that the statistical relevance of co-occurring CNA may depend on the sample specific CNA complexity. We verify our hypotheses with a simulation based algorithm CDCOCA (complexity dependence of co-occurring chromosomal aberrations).Results
Application of CDCOCA to example data sets identified co-occurring CNA from low complex background which otherwise went unnoticed. Identification of cancer associated genes in these co-occurring changes can provide insights of cooperative genes involved in oncogenesis.Conclusions
We have developed a method to detect associations of regional copy number abnormalities in cancer data. Along with finding statistically relevant CNA co-occurrences, our algorithm points towards a generally low specificity for co-occurrence of regional imbalances in CNA rich samples, which may have negative impact on pathway modeling approaches relying on frequent CNA events. 相似文献2.
Panagiotis Moulos Olga Papadodima Aristotelis Chatziioannou Heleni Loutrari Charis Roussos Fragiskos N Kolisis 《BMC medical genomics》2009,2(1):1-15
Background
DNA copy number alterations are frequently observed in ovarian cancer, but it remains a challenge to identify the most relevant alterations and the specific causal genes in those regions.Methods
We obtained high-resolution 500K SNP array data for 52 ovarian tumors and identified the most statistically significant minimal genomic regions with the most prevalent and highest-level copy number alterations (recurrent CNAs). Within a region of recurrent CNA, comparison of expression levels in tumors with a given CNA to tumors lacking that CNA and to whole normal ovary samples was used to select genes with CNA-specific expression patterns. A public expression array data set of laser capture micro-dissected (LCM) non-malignant fallopian tube epithelia and LCM ovarian serous adenocarcinoma was used to evaluate the effect of cell-type mixture biases.Results
Fourteen recurrent deletions were detected on chromosomes 4, 6, 9, 12, 13, 15, 16, 17, 18, 22 and most prevalently on X and 8. Copy number and expression data suggest several apoptosis mediators as candidate drivers of the 8p deletions. Sixteen recurrent gains were identified on chromosomes 1, 2, 3, 5, 8, 10, 12, 15, 17, 19, and 20, with the most prevalent gains localized to 8q and 3q. Within the 8q amplicon, PVT1, but not MYC, was strongly over-expressed relative to tumors lacking this CNA and showed over-expression relative to normal ovary. Likewise, the cell polarity regulators PRKCI and ECT2 were identified as putative drivers of two distinct amplicons on 3q. Co-occurrence analyses suggested potential synergistic or antagonistic relationships between recurrent CNAs. Genes within regions of recurrent CNA showed an enrichment of Cancer Census genes, particularly when filtered for CNA-specific expression.Conclusion
These analyses provide detailed views of ovarian cancer genomic changes and highlight the benefits of using multiple reference sample types for the evaluation of CNA-specific expression changes. 相似文献3.
Beibei Guo Alejandro Villagran Marina Vannucci Jian Wang Caleb Davis Tsz-Kwong Man Ching Lau Rudy Guerra 《BMC research notes》2010,3(1):1-18
Background
The identification of copy number aberration in the human genome is an important area in cancer research. We develop a model for determining genomic copy numbers using high-density single nucleotide polymorphism genotyping microarrays. The method is based on a Bayesian spatial normal mixture model with an unknown number of components corresponding to true copy numbers. A reversible jump Markov chain Monte Carlo algorithm is used to implement the model and perform posterior inference.Results
The performance of the algorithm is examined on both simulated and real cancer data, and it is compared with the popular CNAG algorithm for copy number detection.Conclusions
We demonstrate that our Bayesian mixture model performs at least as well as the hidden Markov model based CNAG algorithm and in certain cases does better. One of the added advantages of our method is the flexibility of modeling normal cell contamination in tumor samples. 相似文献4.
Background
DNA copy number alterations are one of the main characteristics of the cancer cell karyotype and can contribute to the complex phenotype of these cells. These alterations can lead to gains in cellular oncogenes as well as losses in tumor suppressor genes and can span small intervals as well as involve entire chromosomes. The ability to accurately detect these changes is central to understanding how they impact the biology of the cell.Results
We describe a novel algorithm called CARAT (Copy Number Analysis with Regression And Tree) that uses probe intensity information to infer copy number in an allele-specific manner from high density DNA oligonuceotide arrays designed to genotype over 100, 000 SNPs. Total and allele-specific copy number estimations using CARAT are independently evaluated for a subset of SNPs using quantitative PCR and allelic TaqMan reactions with several human breast cancer cell lines. The sensitivity and specificity of the algorithm are characterized using DNA samples containing differing numbers of X chromosomes as well as a test set of normal individuals. Results from the algorithm show a high degree of agreement with results from independent verification methods.Conclusion
Overall, CARAT automatically detects regions with copy number variations and assigns a significance score to each alteration as well as generating allele-specific output. When coupled with SNP genotype calls from the same array, CARAT provides additional detail into the structure of genome wide alterations that can contribute to allelic imbalance. 相似文献5.
Yuri Kotliarov Serdar Bozdag Hangjiong Cheng Stefan Wuchty Jean-Claude Zenklusen Howard A Fine 《BMC medical genomics》2010,3(1):1-5
Background
Genomic copy number alterations are widely associated with a broad range of human tumors and offer the potential to be used as a diagnostic tool. Especially in the emerging era of personalized medicine medical informatics tools that allow the fast visualization and analysis of genomic alterations of a patient's genomic profile for diagnostic and potential treatment purposes increasingly gain importance.Results
We developed CNAReporter, a software tool that allows users to visualize SNP-specific data obtained from Affymetrix arrays and generate PDF-reports as output. We combined standard algorithms for the analysis of chromosomal alterations, utilizing the widely applied GenePattern framework. As an example, we show genome analyses of two patients with distinctly different CNA profiles using the tool.Conclusions
Glioma subtypes, characterized by different genomic alterations, are often treated differently but can be difficult to differentiate pathologically. CNAReporter offers a user-friendly way to visualize and analyse genomic changes of any given tumor genomic profile, thereby leading to an accurate diagnosis and patient-specific treatment. 相似文献6.
Background
Matched sequencing of both tumor and normal tissue is routinely used to classify variants of uncertain significance (VUS) into somatic vs. germline. However, assays used in molecular diagnostics focus on known somatic alterations in cancer genes and often only sequence tumors. Therefore, an algorithm that reliably classifies variants would be helpful for retrospective exploratory analyses. Contamination of tumor samples with normal cells results in differences in expected allelic fractions of germline and somatic variants, which can be exploited to accurately infer genotypes after adjusting for local copy number. However, existing algorithms for determining tumor purity, ploidy and copy number are not designed for unmatched short read sequencing data.Results
We describe a methodology and corresponding open source software for estimating tumor purity, copy number, loss of heterozygosity (LOH), and contamination, and for classification of single nucleotide variants (SNVs) by somatic status and clonality. This R package, PureCN, is optimized for targeted short read sequencing data, integrates well with standard somatic variant detection pipelines, and has support for matched and unmatched tumor samples. Accuracy is demonstrated on simulated data and on real whole exome sequencing data.Conclusions
Our algorithm provides accurate estimates of tumor purity and ploidy, even if matched normal samples are not available. This in turn allows accurate classification of SNVs. The software is provided as open source (Artistic License 2.0) R/Bioconductor package PureCN (http://bioconductor.org/packages/PureCN/).7.
Fatemeh Kaveh Hege Edvardsen Anne-Lise Børresen-Dale Vessela N Kristensen Hiroko K Solvang 《BMC medical genomics》2011,4(1):1-13
Background
Molecular alterations critical to development of cancer include mutations, copy number alterations (amplifications and deletions) as well as genomic rearrangements resulting in gene fusions. Massively parallel next generation sequencing, which enables the discovery of such changes, uses considerable quantities of genomic DNA (> 5 ug), a serious limitation in ever smaller clinical samples. However, a commonly available microarray platforms such as array comparative genomic hybridization (array CGH) allows the characterization of gene copy number at a single gene resolution using much smaller amounts of genomic DNA. In this study we evaluate the sensitivity of ultra-dense array CGH platforms developed by Agilent, especially that of the 1 million probe array (1 M array), and their application when whole genome amplification is required because of limited sample quantities.Methods
We performed array CGH on whole genome amplified and not amplified genomic DNA from MCF-7 breast cancer cells, using 244 K and 1 M Agilent arrays. The ADM-2 algorithm was used to identify micro-copy number alterations that measured less than 1 Mb in genomic length.Results
DNA from MCF-7 breast cancer cells was analyzed for micro-copy number alterations, defined as measuring less than 1 Mb in genomic length. The 4-fold extra resolution of the 1 M array platform relative to the less dense 244 K array platform, led to the improved detection of copy number variations (CNVs) and micro-CNAs. The identification of intra-genic breakpoints in areas of DNA copy number gain signaled the possible presence of gene fusion events. However, the ultra-dense platforms, especially the densest 1 M array, detect artifacts inherent to whole genome amplification and should be used only with non-amplified DNA samples.Conclusions
This is a first report using 1 M array CGH for the discovery of cancer genes and biomarkers. We show the remarkable capacity of this technology to discover CNVs, micro-copy number alterations and even gene fusions. However, these platforms require excellent genomic DNA quality and do not tolerate relatively small imperfections related to the whole genome amplification. 相似文献8.
Background
Array-based comparative genomic hybridization (aCGH) is a high-throughput method for measuring genome-wide DNA copy number changes. Current aCGH methods have limited resolution, sensitivity and reproducibility. Microarrays for aCGH are available only for a few organisms and combination of aCGH data with expression data is cumbersome.Results
We present a novel method of using commercial oligonucleotide expression microarrays for aCGH, enabling DNA copy number measurements and expression profiles to be combined using the same platform. This method yields aCGH data from genomic DNA without complexity reduction at a median resolution of approximately 17,500 base pairs. Due to the well-defined nature of oligonucleotide probes, DNA amplification and deletion can be defined at the level of individual genes and can easily be combined with gene expression data.Conclusion
A novel method of gene resolution analysis of copy number variation (graCNV) yields high-resolution maps of DNA copy number changes and is applicable to a broad range of organisms for which commercial oligonucleotide expression microarrays are available. Due to the standardization of oligonucleotide microarrays, graCNV results can reliably be compared between laboratories and can easily be combined with gene expression data using the same platform. 相似文献9.
Background
Regional genomic copy number alterations (CNA) are observed in the vast majority of cancers. Besides specifically targeting well-known, canonical oncogenes, CNAs may also play more subtle roles in terms of modulating genetic potential and broad gene expression patterns of developing tumors. Any significant differences in the overall CNA patterns between different cancer types may thus point towards specific biological mechanisms acting in those cancers. In addition, differences among CNA profiles may prove valuable for cancer classifications beyond existing annotation systems.Principal Findings
We have analyzed molecular-cytogenetic data from 25579 tumors samples, which were classified into 160 cancer types according to the International Classification of Disease (ICD) coding system. When correcting for differences in the overall CNA frequencies between cancer types, related cancers were often found to cluster together according to similarities in their CNA profiles. Based on a randomization approach, distance measures from the cluster dendrograms were used to identify those specific genomic regions that contributed significantly to this signal. This approach identified 43 non-neutral genomic regions whose propensity for the occurrence of copy number alterations varied with the type of cancer at hand. Only a subset of these identified loci overlapped with previously implied, highly recurrent (hot-spot) cytogenetic imbalance regions.Conclusions
Thus, for many genomic regions, a simple null-hypothesis of independence between cancer type and relative copy number alteration frequency can be rejected. Since a subset of these regions display relatively low overall CNA frequencies, they may point towards second-tier genomic targets that are adaptively relevant but not necessarily essential for cancer development. 相似文献10.
Hanna G?ransson Karolina Edlund Maria Ryd?ker Markus Rasmussen Johan Winquist Simon Ekman Michael Bergqvist Andrew Thomas Mats Lambe Richard Rosenquist Lars Holmberg Patrick Micke Johan Botling Anders Isaksson 《PloS one》2009,4(6)
Background
Technologies based on DNA microarrays have the potential to provide detailed information on genomic aberrations in tumor cells. In practice a major obstacle for quantitative detection of aberrations is the heterogeneity of clinical tumor tissue. Since tumor tissue invariably contains genetically normal stromal cells, this may lead to a failure to detect aberrations in the tumor cells.Principal Finding
Using SNP array data from 44 non-small cell lung cancer samples we have developed a bioinformatic algorithm that accurately models the fractions of normal and tumor cells in clinical tumor samples. The proportion of normal cells in combination with SNP array data can be used to detect and quantify copy number neutral loss-of-heterozygosity (CNNLOH) in the tumor cells both in crude tumor tissue and in samples enriched for tumor cells by laser capture microdissection.Conclusion
Genome-wide quantitative analysis of CNNLOH using the CNNLOH Quantifier method can help to identify recurrent aberrations contributing to tumor development in clinical tumor samples. In addition, SNP-array based analysis of CNNLOH may become important for detection of aberrations that can be used for diagnostic and prognostic purposes. 相似文献11.
Jarkko Soronen Pirkka-Pekka Laurila Jussi Naukkarinen Ida Surakka Samuli Ripatti Matti Jauhiainen Vesa M Olkkonen Hannele Yki-Järvinen 《BMC medical genomics》2012,5(1):1-13
Background
To elucidate gene expression associated with copy number changes, we performed a genome-wide copy number and expression microarray analysis of 25 pairs of gastric tissues.Methods
We applied laser capture microdissection (LCM) to obtain samples for microarray experiments and profiled DNA copy number and gene expression using 244K CGH Microarray and Human Exon 1.0 ST Microarray.Results
Obviously, gain at 8q was detected at the highest frequency (70%) and 20q at the second (63%). We also identified molecular genetic divergences for different TNM-stages or histological subtypes of gastric cancers. Interestingly, the C20orf11 amplification and gain at 20q13.33 almost separated moderately differentiated (MD) gastric cancers from poorly differentiated (PD) type. A set of 163 genes showing the correlations between gene copy number and expression was selected and the identified genes were able to discriminate matched adjacent noncancerous samples from gastric cancer samples in an unsupervised two-way hierarchical clustering. Quantitative RT-PCR analysis for 4 genes (C20orf11, XPO5, PUF60, and PLOD3) of the 163 genes validated the microarray results. Notably, some candidate genes (MCM4 and YWHAZ) and its adjacent genes such as PRKDC, UBE2V2, ANKRD46, ZNF706, and GRHL2, were concordantly deregulated by genomic aberrations.Conclusions
Taken together, our results reveal diverse chromosomal region alterations for different TNM-stages or histological subtypes of gastric cancers, which is helpful in researching clinicopathological classification, and highlight several interesting genes as potential biomarkers for gastric cancer. 相似文献12.
Lennart Opitz Gabriela Salinas-Riester Marian Grade Klaus Jung Peter Jo Georg Emons B Michael Ghadimi Tim Beißbarth Jochen Gaedcke 《BMC medical genomics》2010,3(1):1-14
Background
Cancer is a heterogeneous disease caused by genomic aberrations and characterized by significant variability in clinical outcomes and response to therapies. Several subtypes of common cancers have been identified based on alterations of individual cancer genes, such as HER2, EGFR, and others. However, cancer is a complex disease driven by the interaction of multiple genes, so the copy number status of individual genes is not sufficient to define cancer subtypes and predict responses to treatments. A classification based on genome-wide copy number patterns would be better suited for this purpose.Method
To develop a more comprehensive cancer taxonomy based on genome-wide patterns of copy number abnormalities, we designed an unsupervised classification algorithm that identifies genomic subgroups of tumors. This algorithm is based on a modified genomic Non-negative Matrix Factorization (gNMF) algorithm and includes several additional components, namely a pilot hierarchical clustering procedure to determine the number of clusters, a multiple random initiation scheme, a new stop criterion for the core gNMF, as well as a 10-fold cross-validation stability test for quality assessment.Result
We applied our algorithm to identify genomic subgroups of three major cancer types: non-small cell lung carcinoma (NSCLC), colorectal cancer (CRC), and malignant melanoma. High-density SNP array datasets for patient tumors and established cell lines were used to define genomic subclasses of the diseases and identify cell lines representative of each genomic subtype. The algorithm was compared with several traditional clustering methods and showed improved performance. To validate our genomic taxonomy of NSCLC, we correlated the genomic classification with disease outcomes. Overall survival time and time to recurrence were shown to differ significantly between the genomic subtypes.Conclusions
We developed an algorithm for cancer classification based on genome-wide patterns of copy number aberrations and demonstrated its superiority to existing clustering methods. The algorithm was applied to define genomic subgroups of three cancer types and identify cell lines representative of these subgroups. Our data enabled the assembly of representative cell line panels for testing drug candidates. 相似文献13.
Background
Tumor single nucleotide polymorphism (SNP) array is a common platform for investigating the cancer genomic aberration and the functionally important altered genes. Original SNP array signals are usually corrupted by noise, and need to be de-convoluted into absolute copy number profile by analytical methods. Unfortunately, in contrast with the popularity of tumor Affymetrix SNP array, the methods that are specifically designed for this platform are still limited. The complicated characteristics of noise in signals is one of the difficulties for dissecting tumor Affymetrix SNP array data, as they inevitably blur the distinction between aberrations and create an obstacle for the copy number aberration (CNA) identification.Results
We propose a tool named TAFFYS for comprehensive analysis of tumor Affymetrix SNP array data. TAFFYS introduce a wavelet-based de-noising approach and copy number-specific signal variance model for suppressing and modelling the noise in signals. Then a hidden Markov model is employed for copy number inference. Finally, by using the absolute copy number profile, statistical significance of each aberration region is calculated in term of different aberration types, including amplification, deletion and loss of heterozygosity (LOH). The result shows that copy number specific-variance model and wavelet de-noising algorithm fits well with the Affymetrix SNP array signals, leading to more accurate estimation for diluted tumor sample (even with only 30% of cancer cells) than other existed methods. Results of examinations also demonstrate a good compatibility and extensibility for different Affymetrix SNP array platforms. Application on the 35 breast tumor samples shows that TAFFYS can automatically dissect the tumor samples and reveal statistically significant aberration regions where cancer-related genes locate.Conclusions
TAFFYS provide an efficient and convenient tool for identifying the copy number alteration and allelic imbalance and assessing the recurrent aberrations for the tumor Affymetrix SNP array data. 相似文献14.
Fan B Dachrut S Coral H Yuen ST Chu KM Law S Zhang L Ji J Leung SY Chen X 《PloS one》2012,7(4):e29824
Background
Genomic instability with frequent DNA copy number alterations is one of the key hallmarks of carcinogenesis. The chromosomal regions with frequent DNA copy number gain and loss in human gastric cancer are still poorly defined. It remains unknown how the DNA copy number variations contributes to the changes of gene expression profiles, especially on the global level.Principal Findings
We analyzed DNA copy number alterations in 64 human gastric cancer samples and 8 gastric cancer cell lines using bacterial artificial chromosome (BAC) arrays based comparative genomic hybridization (aCGH). Statistical analysis was applied to correlate previously published gene expression data obtained from cDNA microarrays with corresponding DNA copy number variation data to identify candidate oncogenes and tumor suppressor genes. We found that gastric cancer samples showed recurrent DNA copy number variations, including gains at 5p, 8q, 20p, 20q, and losses at 4q, 9p, 18q, 21q. The most frequent regions of amplification were 20q12 (7/72), 20q12–20q13.1 (12/72), 20q13.1–20q13.2 (11/72) and 20q13.2–20q13.3 (6/72). The most frequent deleted region was 9p21 (8/72). Correlating gene expression array data with aCGH identified 321 candidate oncogenes, which were overexpressed and showed frequent DNA copy number gains; and 12 candidate tumor suppressor genes which were down-regulated and showed frequent DNA copy number losses in human gastric cancers. Three networks of significantly expressed genes in gastric cancer samples were identified by ingenuity pathway analysis.Conclusions
This study provides insight into DNA copy number variations and their contribution to altered gene expression profiles during human gastric cancer development. It provides novel candidate driver oncogenes or tumor suppressor genes for human gastric cancer, useful pathway maps for the future understanding of the molecular pathogenesis of this malignancy, and the construction of new therapeutic targets. 相似文献15.
Breaking the waves: improved detection of copy number variation from microarray-based comparative genomic hybridization
下载免费PDF全文
![点击此处可从《Genome biology》网站下载免费的PDF全文](/ch/ext_images/free.gif)
Marioni JC Thorne NP Valsesia A Fitzgerald T Redon R Fiegler H Andrews TD Stranger BE Lynch AG Dermitzakis ET Carter NP Tavaré S Hurles ME 《Genome biology》2007,8(10):R228-14
Background
Large-scale high throughput studies using microarray technology have established that copy number variation (CNV) throughout the genome is more frequent than previously thought. Such variation is known to play an important role in the presence and development of phenotypes such as HIV-1 infection and Alzheimer's disease. However, methods for analyzing the complex data produced and identifying regions of CNV are still being refined.Results
We describe the presence of a genome-wide technical artifact, spatial autocorrelation or 'wave', which occurs in a large dataset used to determine the location of CNV across the genome. By removing this artifact we are able to obtain both a more biologically meaningful clustering of the data and an increase in the number of CNVs identified by current calling methods without a major increase in the number of false positives detected. Moreover, removing this artifact is critical for the development of a novel model-based CNV calling algorithm - CNVmix - that uses cross-sample information to identify regions of the genome where CNVs occur. For regions of CNV that are identified by both CNVmix and current methods, we demonstrate that CNVmix is better able to categorize samples into groups that represent copy number gains or losses.Conclusion
Removing artifactual 'waves' (which appear to be a general feature of array comparative genomic hybridization (aCGH) datasets) and using cross-sample information when identifying CNVs enables more biological information to be extracted from aCGH experiments designed to investigate copy number variation in normal individuals. 相似文献16.
Sriram KB Larsen JE Savarimuthu Francis SM Wright CM Clarke BE Duhig EE Brown KM Hayward NK Yang IA Bowman RV Fong KM 《PloS one》2012,7(2):e30398
Background
Primary tumor recurrence commonly occurs after surgical resection of lung squamous cell carcinoma (SCC). Little is known about the genes driving SCC recurrence.Methods
We used array comparative genomic hybridization (aCGH) to identify genes affected by copy number alterations that may be involved in SCC recurrence. Training and test sets of resected primary lung SCC were assembled. aCGH was used to determine genomic copy number in a training set of 62 primary lung SCCs (28 with recurrence and 34 with no evidence of recurrence) and the altered copy number of candidate genes was confirmed by quantitative PCR (qPCR). An independent test set of 72 primary lung SCCs (20 with recurrence and 52 with no evidence of recurrence) was used for biological validation. mRNA expression of candidate genes was studied using qRT-PCR. Candidate gene promoter methylation was evaluated using methylation microarrays and Sequenom EpiTYPER analysis.Results
18q22.3 loss was identified by aCGH as being significantly associated with recurrence (p = 0.038). Seven genes within 18q22.3 had aCGH copy number loss associated with recurrence but only SOCS6 copy number was both technically replicated by qPCR and biologically validated in the test set. SOCS6 copy number loss correlated with reduced mRNA expression in the study samples and in the samples with copy number loss, there was a trend for increased methylation, albeit non-significant. Overall survival was significantly poorer in patients with SOCS6 loss compared to patients without SOCS6 loss in both the training (30 vs. 43 months, p = 0.023) and test set (27 vs. 43 months, p = 0.010).Conclusion
Reduced copy number and mRNA expression of SOCS6 are associated with disease recurrence in primary lung SCC and may be useful prognostic biomarkers. 相似文献17.
Ágnes Baross Allen D Delaney H Irene Li Tarun Nayar Stephane Flibotte Hong Qian Susanna Y Chan Jennifer Asano Adrian Ally Manqiu Cao Patricia Birch Mabel Brown-John Nicole Fernandes Anne Go Giulia Kennedy Sylvie Langlois Patrice Eydoux JM Friedman Marco A Marra 《BMC bioinformatics》2007,8(1):1-18
Background
Genomic deletions and duplications are important in the pathogenesis of diseases, such as cancer and mental retardation, and have recently been shown to occur frequently in unaffected individuals as polymorphisms. Affymetrix GeneChip whole genome sampling analysis (WGSA) combined with 100 K single nucleotide polymorphism (SNP) genotyping arrays is one of several microarray-based approaches that are now being used to detect such structural genomic changes. The popularity of this technology and its associated open source data format have resulted in the development of an increasing number of software packages for the analysis of copy number changes using these SNP arrays.Results
We evaluated four publicly available software packages for high throughput copy number analysis using synthetic and empirical 100 K SNP array data sets, the latter obtained from 107 mental retardation (MR) patients and their unaffected parents and siblings. We evaluated the software with regards to overall suitability for high-throughput 100 K SNP array data analysis, as well as effectiveness of normalization, scaling with various reference sets and feature extraction, as well as true and false positive rates of genomic copy number variant (CNV) detection.Conclusion
We observed considerable variation among the numbers and types of candidate CNVs detected by different analysis approaches, and found that multiple programs were needed to find all real aberrations in our test set. The frequency of false positive deletions was substantial, but could be greatly reduced by using the SNP genotype information to confirm loss of heterozygosity. 相似文献18.
Andrey Loboda Walter K Kraft Bernard Fine Jeffrey Joseph Michael Nebozhyn Chunsheng Zhang Yudong He Xia Yang Christopher Wright Mark Morris Ira Chalikonda Mark Ferguson Valur Emilsson Amy Leonardson John Lamb Hongyue Dai Eric Schadt Howard E Greenberg Pek Yee Lum 《BMC medical genomics》2009,2(1):1-16
19.
20.
Rebecca F. Halperin John D. Carpten Zarko Manojlovic Jessica Aldrich Jonathan Keats Sara Byron Winnie S. Liang Megan Russell Daniel Enriquez Ana Claasen Irene Cherni Baffour Awuah Joseph Oppong Max S. Wicha Lisa A. Newman Evelyn Jaigge Seungchan Kim David W. Craig 《BMC medical genomics》2017,10(1):61