首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 906 毫秒
1.
Major efforts are underway to systematically define the somatic and germline genetic variations causally associated with disease. Genome-wide genetic analysis of actual clinical samples is, however, limited by the paucity of genomic DNA available. Here we have tested the fidelity and genome representation of phi29 polymerase-based genome amplification (phi29MDA) using direct sequencing and high density oligonucleotide arrays probing >10,000 SNP alleles. Genome representation was comprehensive and estimated to be 99.82% complete, although six regions encompassing a maximum of 5.62 Mb failed to amplify. There was no degradation in the accuracy of SNP genotyping and, in direct sequencing experiments sampling 500,000 bp, the estimated error rate (9.5 x 10(-6)) was the same as in paired unamplified samples. The detection of cancer-associated loss of heterozygosity and copy number changes, including homozygous deletion and gene amplification, were similarly robust. These results suggest that phi29MDA yields high fidelity, near-complete genome representation suitable for high resolution genetic analysis.  相似文献   

2.
We present a protocol for reliably detecting DNA copy number aberrations in a single human cell. Multiple displacement-amplified DNAs of a cell are hybridized to a 3,000-bacterial artificial chromosome (BAC) array and to an Affymetrix 250,000 (250K)-SNP array. Subsequent copy number calling is based on the integration of BAC probe-specific copy number probabilities that are estimated by comparing probe intensities with a single-cell whole-genome amplification (WGA) reference model for diploid chromosomes, as well as SNP copy number and loss-of-heterozygosity states estimated by hidden Markov models (HMM). All methods for detecting DNA copy number aberrations in single human cells have difficulty in confidently discriminating WGA artifacts from true genetic variants. Furthermore, some methods lack thorough validation for segmental DNA imbalance detection. Our protocol minimizes false-positive variant calling and enables uniparental isodisomy detection in single cells. Additionally, it provides quality assessment, allowing the exclusion of uninterpretable single-cell WGA samples. The protocol takes 5-7 d.  相似文献   

3.

Background

Tumor single nucleotide polymorphism (SNP) array is a common platform for investigating the cancer genomic aberration and the functionally important altered genes. Original SNP array signals are usually corrupted by noise, and need to be de-convoluted into absolute copy number profile by analytical methods. Unfortunately, in contrast with the popularity of tumor Affymetrix SNP array, the methods that are specifically designed for this platform are still limited. The complicated characteristics of noise in signals is one of the difficulties for dissecting tumor Affymetrix SNP array data, as they inevitably blur the distinction between aberrations and create an obstacle for the copy number aberration (CNA) identification.

Results

We propose a tool named TAFFYS for comprehensive analysis of tumor Affymetrix SNP array data. TAFFYS introduce a wavelet-based de-noising approach and copy number-specific signal variance model for suppressing and modelling the noise in signals. Then a hidden Markov model is employed for copy number inference. Finally, by using the absolute copy number profile, statistical significance of each aberration region is calculated in term of different aberration types, including amplification, deletion and loss of heterozygosity (LOH). The result shows that copy number specific-variance model and wavelet de-noising algorithm fits well with the Affymetrix SNP array signals, leading to more accurate estimation for diluted tumor sample (even with only 30% of cancer cells) than other existed methods. Results of examinations also demonstrate a good compatibility and extensibility for different Affymetrix SNP array platforms. Application on the 35 breast tumor samples shows that TAFFYS can automatically dissect the tumor samples and reveal statistically significant aberration regions where cancer-related genes locate.

Conclusions

TAFFYS provide an efficient and convenient tool for identifying the copy number alteration and allelic imbalance and assessing the recurrent aberrations for the tumor Affymetrix SNP array data.  相似文献   

4.

Background

Molecular alterations critical to development of cancer include mutations, copy number alterations (amplifications and deletions) as well as genomic rearrangements resulting in gene fusions. Massively parallel next generation sequencing, which enables the discovery of such changes, uses considerable quantities of genomic DNA (> 5 ug), a serious limitation in ever smaller clinical samples. However, a commonly available microarray platforms such as array comparative genomic hybridization (array CGH) allows the characterization of gene copy number at a single gene resolution using much smaller amounts of genomic DNA. In this study we evaluate the sensitivity of ultra-dense array CGH platforms developed by Agilent, especially that of the 1 million probe array (1 M array), and their application when whole genome amplification is required because of limited sample quantities.

Methods

We performed array CGH on whole genome amplified and not amplified genomic DNA from MCF-7 breast cancer cells, using 244 K and 1 M Agilent arrays. The ADM-2 algorithm was used to identify micro-copy number alterations that measured less than 1 Mb in genomic length.

Results

DNA from MCF-7 breast cancer cells was analyzed for micro-copy number alterations, defined as measuring less than 1 Mb in genomic length. The 4-fold extra resolution of the 1 M array platform relative to the less dense 244 K array platform, led to the improved detection of copy number variations (CNVs) and micro-CNAs. The identification of intra-genic breakpoints in areas of DNA copy number gain signaled the possible presence of gene fusion events. However, the ultra-dense platforms, especially the densest 1 M array, detect artifacts inherent to whole genome amplification and should be used only with non-amplified DNA samples.

Conclusions

This is a first report using 1 M array CGH for the discovery of cancer genes and biomarkers. We show the remarkable capacity of this technology to discover CNVs, micro-copy number alterations and even gene fusions. However, these platforms require excellent genomic DNA quality and do not tolerate relatively small imperfections related to the whole genome amplification.  相似文献   

5.

Background

Technologies based on DNA microarrays have the potential to provide detailed information on genomic aberrations in tumor cells. In practice a major obstacle for quantitative detection of aberrations is the heterogeneity of clinical tumor tissue. Since tumor tissue invariably contains genetically normal stromal cells, this may lead to a failure to detect aberrations in the tumor cells.

Principal Finding

Using SNP array data from 44 non-small cell lung cancer samples we have developed a bioinformatic algorithm that accurately models the fractions of normal and tumor cells in clinical tumor samples. The proportion of normal cells in combination with SNP array data can be used to detect and quantify copy number neutral loss-of-heterozygosity (CNNLOH) in the tumor cells both in crude tumor tissue and in samples enriched for tumor cells by laser capture microdissection.

Conclusion

Genome-wide quantitative analysis of CNNLOH using the CNNLOH Quantifier method can help to identify recurrent aberrations contributing to tumor development in clinical tumor samples. In addition, SNP-array based analysis of CNNLOH may become important for detection of aberrations that can be used for diagnostic and prognostic purposes.  相似文献   

6.
Major efforts are underway to systematically define the somatic and germline genetic variations causally associated with disease. Genome-wide genetic analysis of actual clinical samples is, however, limited by the paucity of genomic DNA available. Here we have tested the fidelity and genome representation of 29 polymerase-based genome amplification (29MDA) using direct sequencing and high density oligonucleotide arrays probing >10000 SNP alleles. Genome representation was comprehensive and estimated to be 99.82% complete, although six regions encompassing a maximum of 5.62 Mb failed to amplify. There was no degradation in the accuracy of SNP genotyping and, in direct sequencing experiments sampling 500 000 bp, the estimated error rate (9.5 × 10–6) was the same as in paired unamplified samples. The detection of cancer-associated loss of heterozygosity and copy number changes, including homozygous deletion and gene amplification, were similarly robust. These results suggest that 29MDA yields high fidelity, near-complete genome representation suitable for high resolution genetic analysis.  相似文献   

7.
Copy number variations (CNVs) in the human genome are conventionally detected using high-throughput scanning technologies, such as comparative genomic hybridization and high-density single nucleotide polymorphism (SNP) microarrays, or relatively low-throughput techniques, such as quantitative polymerase chain reaction (PCR). All these approaches are limited in resolution and can at best distinguish a twofold (or 50%) difference in copy number. We have developed a new technology to study copy numbers using a platform known as the digital array, a nanofluidic biochip capable of accurately quantitating genes of interest in DNA samples. We have evaluated the digital array's performance using a model system, to show that this technology is exquisitely sensitive, capable of differentiating as little as a 15% difference in gene copy number (or between 6 and 7 copies of a target gene). We have also analyzed commercial DNA samples for their CYP2D6 copy numbers and confirmed that our results were consistent with those obtained independently using conventional techniques. In a screening experiment with breast cancer and normal DNA samples, the ERBB2 gene was found to be amplified in about 35% of breast cancer samples. The use of the digital array enables accurate measurement of gene copy numbers and is of significant value in CNV studies.  相似文献   

8.

Background

DNA copy number alterations are one of the main characteristics of the cancer cell karyotype and can contribute to the complex phenotype of these cells. These alterations can lead to gains in cellular oncogenes as well as losses in tumor suppressor genes and can span small intervals as well as involve entire chromosomes. The ability to accurately detect these changes is central to understanding how they impact the biology of the cell.

Results

We describe a novel algorithm called CARAT (Copy Number Analysis with Regression And Tree) that uses probe intensity information to infer copy number in an allele-specific manner from high density DNA oligonuceotide arrays designed to genotype over 100, 000 SNPs. Total and allele-specific copy number estimations using CARAT are independently evaluated for a subset of SNPs using quantitative PCR and allelic TaqMan reactions with several human breast cancer cell lines. The sensitivity and specificity of the algorithm are characterized using DNA samples containing differing numbers of X chromosomes as well as a test set of normal individuals. Results from the algorithm show a high degree of agreement with results from independent verification methods.

Conclusion

Overall, CARAT automatically detects regions with copy number variations and assigns a significance score to each alteration as well as generating allele-specific output. When coupled with SNP genotype calls from the same array, CARAT provides additional detail into the structure of genome wide alterations that can contribute to allelic imbalance.  相似文献   

9.
10.
Affymetrix SNP arrays have been widely used for single-nucleotide polymorphism (SNP) genotype calling and DNA copy number variation inference. Although numerous methods have achieved high accuracy in these fields, most studies have paid little attention to the modeling of hybridization of probes to off-target allele sequences, which can affect the accuracy greatly. In this study, we address this issue and demonstrate that hybridization with mismatch nucleotides (HWMMN) occurs in all SNP probe-sets and has a critical effect on the estimation of allelic concentrations (ACs). We study sequence binding through binding free energy and then binding affinity, and develop a probe intensity composite representation (PICR) model. The PICR model allows the estimation of ACs at a given SNP through statistical regression. Furthermore, we demonstrate with cell-line data of known true copy numbers that the PICR model can achieve reasonable accuracy in copy number estimation at a single SNP locus, by using the ratio of the estimated AC of each sample to that of the reference sample, and can reveal subtle genotype structure of SNPs at abnormal loci. We also demonstrate with HapMap data that the PICR model yields accurate SNP genotype calls consistently across samples, laboratories and even across array platforms.  相似文献   

11.

Background

Genomic deletions and duplications are important in the pathogenesis of diseases, such as cancer and mental retardation, and have recently been shown to occur frequently in unaffected individuals as polymorphisms. Affymetrix GeneChip whole genome sampling analysis (WGSA) combined with 100 K single nucleotide polymorphism (SNP) genotyping arrays is one of several microarray-based approaches that are now being used to detect such structural genomic changes. The popularity of this technology and its associated open source data format have resulted in the development of an increasing number of software packages for the analysis of copy number changes using these SNP arrays.

Results

We evaluated four publicly available software packages for high throughput copy number analysis using synthetic and empirical 100 K SNP array data sets, the latter obtained from 107 mental retardation (MR) patients and their unaffected parents and siblings. We evaluated the software with regards to overall suitability for high-throughput 100 K SNP array data analysis, as well as effectiveness of normalization, scaling with various reference sets and feature extraction, as well as true and false positive rates of genomic copy number variant (CNV) detection.

Conclusion

We observed considerable variation among the numbers and types of candidate CNVs detected by different analysis approaches, and found that multiple programs were needed to find all real aberrations in our test set. The frequency of false positive deletions was substantial, but could be greatly reduced by using the SNP genotype information to confirm loss of heterozygosity.  相似文献   

12.
Breast cancer is a widespread disease in Japan and across the world. Breast cancer cells, as well as most other types of cancer cells, have diverse chromosomal aberrations. Clarifying the character of these chromosomal aberrations should contribute to the development of more suitable therapies, along with the predictions of metastasis and prognosis. Twenty-four breast cancer cell lines were analyzed by bacterial artificial chromosome (BAC) array comparative genomic hybridization (CGH). The array slide contained duplicate spots of 4030 BAC clone DNAs covering the entire human genome with 1 Mbp resolution. In all 24 breast cancer cell lines, frequent and significant amplifications as well as deletions were detected by BAC array CGH. Common DNA copy number gains, detected in 60% (above 15 cell lines) of the 24 breast cancer cell lines were found in 76 BAC clones, located at 1q, 5p, 8q, 9p, 16p, 17q, and 20q. Moreover, common DNA copy number loss was detected in 136 BAC clones, located at 1q, 2q, 3p, 4p, 6q, 8p, 9p, 11p, 13q, 17p, 18q, 19p, Xp, and Xq. The DNA copy number abnormalities found included abnormality of the well-known oncogene cMYC (8q24.21); however, most of them were not reported to relate to breast cancer. BAC array CGH has great potential to detect DNA copy number abnormalities, and has revealed that breast cancer cell lines have substantial heterogeneity.  相似文献   

13.
Neuroblastomas (NBs) are tumours of the sympathetic nervous system accounting for 8–10% of paediatric cancers. NBs exhibit extensive intertumour genetic heterogeneity, but their extent of intratumour genetic diversity has remained unexplored. We aimed to assess intratumour genetic variation in NBs with a focus on whole chromosome changes and their underlying mechanism. Allelic ratios obtained by SNP-array data from 30 aneuploid primary NBs and NB cell lines were used to quantify the size of clones harbouring specific genomic imbalances. In 13 cases, this was supplemented by fluorescence in situ hybridisation to assess copy number diversity in detail. Computer simulations of different mitotic segregation errors, single cell cloning, analysis of mitotic figures, and time lapse imaging of dividing NB cells were used to infer the most likely mechanism behind intratumour variation in chromosome number. Combined SNP array and FISH analyses showed that all cases exhibited higher inter-cellular copy number variation than non-neoplastic control tissue, with up to 75% of tumour cells showing non-modal chromosome copy numbers. Comparisons of copy number profiles, resulting from simulations of different segregation errors to genomic profiles of 120 NBs indicated that loss of chromosomes from a tetraploid state was more likely than other mechanisms to explain numerical aberrations in NB. This was supported by a high frequency of lagging chromosomes at anaphase and polyploidisation events in growing NB cells. The dynamic nature of numerical aberrations was corroborated further by detecting substantial copy number diversity in cell populations grown from single NB cells. We conclude that aneuploid NBs typically show extensive intratumour chromosome copy number diversity, and that this phenomenon is most likely explained by continuous loss of chromosomes from a polyploid state.  相似文献   

14.

Background

DNA sequence diversity within the human genome may be more greatly affected by copy number variations (CNVs) than single nucleotide polymorphisms (SNPs). Although the importance of CNVs in genome wide association studies (GWAS) is becoming widely accepted, the optimal methods for identifying these variants are still under evaluation. We have previously reported a comprehensive view of CNVs in the HapMap DNA collection using high density 500 K EA (Early Access) SNP genotyping arrays which revealed greater than 1,000 CNVs ranging in size from 1 kb to over 3 Mb. Although the arrays used most commonly for GWAS predominantly interrogate SNPs, CNV identification and detection does not necessarily require the use of DNA probes centered on polymorphic nucleotides and may even be hindered by the dependence on a successful SNP genotyping assay.

Results

In this study, we have designed and evaluated a high density array predicated on the use of non-polymorphic oligonucleotide probes for CNV detection. This approach effectively uncouples copy number detection from SNP genotyping and thus has the potential to significantly improve probe coverage for genome-wide CNV identification. This array, in conjunction with PCR-based, complexity-reduced DNA target, queries over 1.3 M independent NspI restriction enzyme fragments in the 200 bp to 1100 bp size range, which is a several fold increase in marker density as compared to the 500 K EA array. In addition, a novel algorithm was developed and validated to extract CNV regions and boundaries.

Conclusion

Using a well-characterized pair of DNA samples, close to 200 CNVs were identified, of which nearly 50% appear novel yet were independently validated using quantitative PCR. The results indicate that non-polymorphic probes provide a robust approach for CNV identification, and the increasing precision of CNV boundary delineation should allow a more complete analysis of their genomic organization.  相似文献   

15.
利用SNP数据检测肿瘤细胞染色体拷贝数变异是癌症相关研究的一个热点,目前已有多种方法可以通过分析SNP array数据检测染色体拷贝数。然而在某些情况下,这些检测方法检测结果与真实拷贝数具有一定错误率。目前并没有方法研究预测结果发生错误的规律。本文分别分析了GPHMM,ASCAT两种检测方法结果信息熵与检测正确率的关系,发现检测正确率与信息熵存在很强的相关性。通过对比不同肿瘤细胞比例下信息熵与正确率关系,本文发现随着肿瘤细胞比例的增大,检测结果信息熵平均值增大,方差减小;同时平均检测正确率也越来越大,方差显著减小。这些结果显示信息熵的大小可以反映出检测结果正确率的高低。最后,本文以高肿瘤细胞比例下拷贝数检测结果为例,研究了在变异类型单一,信息熵小的情况下,染色体倍性检测的正确率。结果表明信息熵可以作为衡量检测结果可信度的指标:即信息熵越高,检测结果越可信。  相似文献   

16.
Array comparative genomic hybridization (array CGH) allows the genome-wide analysis of copy number changes at a high resolution. In the last decade, such copy number aberrations have been found frequently and in large quantities in tumor genomes. Alterations in the array CGH profile of tumor DNA indicate the location of tumor suppressor or proto-oncogenes, thereby enabling identification of cancer-relevant genes. In addition, patterns of aberrations have been detected that allow the molecular subclassification of certain tumor types with diagnostic significance. Array CGH analyses have also been instrumental in identifying new prognostic markers. In the future, data evaluation by integrated approaches, including other molecular levels and the selective use of chromosome and tumor-specific microarrays, will be of particular importance.  相似文献   

17.
There is an increasing interest in using single nucleotide polymorphism (SNP) genotyping arrays for profiling chromosomal rearrangements in tumors, as they allow simultaneous detection of copy number and loss of heterozygosity with high resolution. Critical issues such as signal baseline shift due to aneuploidy, normal cell contamination, and the presence of GC content bias have been reported to dramatically alter SNP array signals and complicate accurate identification of aberrations in cancer genomes. To address these issues, we propose a novel Global Parameter Hidden Markov Model (GPHMM) to unravel tangled genotyping data generated from tumor samples. In contrast to other HMM methods, a distinct feature of GPHMM is that the issues mentioned above are quantitatively modeled by global parameters and integrated within the statistical framework. We developed an efficient EM algorithm for parameter estimation. We evaluated performance on three data sets and show that GPHMM can correctly identify chromosomal aberrations in tumor samples containing as few as 10% cancer cells. Furthermore, we demonstrated that the estimation of global parameters in GPHMM provides information about the biological characteristics of tumor samples and the quality of genotyping signal from SNP array experiments, which is helpful for data quality control and outlier detection in cohort studies.  相似文献   

18.

Background

Numerous efforts have been made to elucidate the etiology and improve the treatment of lung cancer, but the overall five-year survival rate is still only 15%. Although cigarette smoking is the primary risk factor for lung cancer, only 7% of female lung cancer patients in Taiwan have a history of smoking. Since cancer results from progressive accumulation of genetic aberrations, genomic rearrangements may be early events in carcinogenesis.

Results

In order to identify biomarkers of early-stage adenocarcinoma, the genome-wide DNA aberrations of 60 pairs of lung adenocarcinoma and adjacent normal lung tissue in non-smoking women were examined using Affymetrix Genome-Wide Human SNP 6.0 arrays. Common copy number variation (CNV) regions were identified by ≥30% of patients with copy number beyond 2 ± 0.5 of copy numbers for each single nucleotide polymorphism (SNP) and at least 100 continuous SNP variant loci. SNPs associated with lung adenocarcinoma were identified by McNemar’s test. Loss of heterozygosity (LOH) SNPs were identified in ≥18% of patients with LOH in the locus. Aberration of SNP rs10248565 at HDAC9 in chromosome 7p21.1 was identified from concurrent analyses of CNVs, SNPs, and LOH.

Conclusion

The results elucidate the genetic etiology of lung adenocarcinoma by demonstrating that SNP rs10248565 may be a potential biomarker of cancer susceptibility.  相似文献   

19.

Background

In single-cell human genome analysis using whole-genome amplified product, a strong amplification bias involving allele dropout and preferential amplification hampers the quality of results. Using an oligonucleotide single nucleotide polymorphism (SNP) array, we systematically examined the nature of this amplification bias, including frequency, degree, and preference for genomic location, and we assessed the effects of this amplification bias on subsequent genotype and chromosomal copy number analyses.

Methodology/Principal Findings

We found a large variability in amplification bias among the amplified products obtained by multiple displacement amplification (MDA), and this bias had a severe effect on the genotype and chromosomal copy number analyses. We established optimal experimental conditions for pre-screening for high-quality amplified products, processing array data, and analyzing chromosomal structural alterations. Using this optimized protocol, we successfully detected previously unidentified chromosomal structural alterations in single cells from a lymphoblastoid cell line. These alterations were subsequently confirmed by karyotype analysis. In addition, we successfully obtained reproducible chromosomal copy number profiles of single cells from the cell line with a complex karyotype, indicating the applicability and potential of our optimized workflow.

Conclusions/Significance

Our results suggest that the quality of amplification products should be critically assessed before using them for genomic analyses. The method of MDA-based whole-genome amplification followed by SNP array analysis described here will be useful for exploring chromosomal alterations in single cells.  相似文献   

20.
Array comparative genomic hybridization (CGH) enables, without the need for cell culture, the detection of changes in copy numbers with high accuracy below the resolution of standard chromosome analysis. The implementation of array?CGH in prenatal diagnosis has been hesitant in spite of these obvious advantages. This has been predominantly due to the likelihood of finding copy number variations (CNVs) of uncertain clinical significance. In prenatal diagnosis array?CGH should not be offered as a first tier but as an adjunct to standard diagnostic procedures in order to minimize uncertainty. Indications for the use of array?CGH will be defined and substantiated in the present article. Guidelines should be established at each laboratory regarding the minimum size of CNVs to be assessed and the genomic regions considered clinically significant.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号