首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Multiple regions of the genome are often amplified during breast cancer development and progression, as evidenced in a number of published studies by comparative genomic hybridization (CGH). However, only relatively few target genes for such amplifications have been identified. Here, we indicate how small-scale commercially available cDNA and CGH microarray formats combined with the tissue microarray technology enable rapid identification of putative amplification target genes as well as analysis of their clinical significance. According to CGH, the SUM-52 breast cancer cell line harbors several high-level DNA amplification sites, including the 10q26 chromosomal region where the fibroblast growth factor receptor 2 (FGFR2) gene has been localized. High level amplification of FGFR2 in SUM-52 was identified using CGH analysis on a microarray of BAC clones. A cDNA microarray survey of 588 genes showed >40-fold overexpression of FGFR2. Finally, a tissue microarray based FISH analysis of 750 uncultured primary breast cancers demonstrated in vivo amplification of the FGFR2 gene in about 1% of the tumors. In conclusion, three consecutive microarray (CGH, cDNA and tissue) experiments revealed high-level amplification and overexpression of the FGFR2 in a breast cancer cell line, but only a low frequency of involvement in primary breast tumors. Applied to a genomic scale with larger arrays, this strategy should facilitate identification of the most important target genes for cytogenetic rearrangements, such as DNA amplification sites detected by conventional CGH. Figures on http://www.esacp.org/acp/2001/22-4/heiskanen.htm  相似文献   

2.
Chromosomal amplifications and deletions are critical components of tumorigenesis and DNA copy-number variations also correlate with changes in mRNA expression levels. Genome-wide microarray comparative genomic hybridization (CGH) has become an important method for detecting and mapping chromosomal changes in tumors. Thus, the ability to detect twofold differences in fluorescent intensity between samples on microarrays depends on the generation of high-quality labeled probes. To enhance array-based CGH analysis, a random prime genomic DNA labeling method optimized for improved sensitivity, signal-to-noise ratios, and reproducibility has been developed. The labeling system comprises formulated random primers, nucleotide mixtures, and notably a high concentration of the double mutant exo-large fragment of DNA polymerase I (exo-Klenow). Microarray analyses indicate that the genomic DNA-labeled templates yield hybridization signals with higher fluorescent intensities and greater signal-to-noise ratios and detect more positive features than the standard random prime and conventional nick translation methods. Also, templates generated by this system have detected twofold differences in gene copy number between male and female genomic DNA and identified amplification and deletions from the BT474 breast cancer cell line in microarray hybridizations. Moreover, alterations in gene copy number were routinely detected with 0.5 microg of genomic DNA starting sample. The method is flexible and performs efficiently with different fluorescently labeled nucleotides. Application of the optimized CGH labeling system may enhance the resolution and sensitivity of array-based CGH analysis in cancer and medical genetic studies.  相似文献   

3.
MOTIVATION: To understand cancer etiology, it is important to explore molecular changes in cellular processes from normal state to cancerous state. Because genes interact with each other during cellular processes, carcinogenesis related genes may form differential co-expression patterns with other genes in different cell states. In this study, we develop a statistical method for identifying differential gene-gene co-expression patterns in different cell states. RESULTS: For efficient pattern recognition, we extend the traditional F-statistic and obtain an Expected Conditional F-statistic (ECF-statistic), which incorporates statistical information of location and correlation. We also propose a statistical method for data transformation. Our approach is applied to a microarray gene expression dataset for prostate cancer study. For a gene of interest, our method can select other genes that have differential gene-gene co-expression patterns with this gene in different cell states. The 10 most frequently selected genes, include hepsin, GSTP1 and AMACR, which have recently been proposed to be associated with prostate carcinogenesis. However, genes GSTP1 and AMACR cannot be identified by studying differential gene expression alone. By using tumor suppressor genes TP53, PTEN and RB1, we identify seven genes that also include hepsin, GSTP1 and AMACR. We show that genes associated with cancer may have differential gene-gene expression patterns with many other genes in different cell states. By discovering such patterns, we may be able to identify carcinogenesis related genes.  相似文献   

4.
The interplay between copy number variation (CNV) and differential gene expression may be able to shed light on molecular process underlying breast cancer and lead to the discovery of cancer-related genes. In the current study, genes concurrently identified in array comparative genomic hybridization (CGH) and gene expression microarrays were used to derive gene signatures for Han Chinese breast cancers.We performed 23 array CGHs and 81 gene expression microarrays in breast cancer samples from Taiwanese women. Genes with coherent patterns of both CNV and differential gene expression were identified from the 21 samples assayed using both platforms. We used these genes to derive signatures associated with clinical ER and HER2 status and disease-free survival.Distributions of signature genes were strongly associated with chromosomal location: chromosome 16 for ER and 17 for HER2. A breast cancer risk predictive model was built based on the first supervised principal component from 16 genes (RCAN3, MCOLN2, DENND2D, RWDD3, ZMYM6, CAPZA1, GPR18, WARS2, TRIM45, SCRN1, CSNK1E, HBXIP, CSDE1, MRPL20, IKZF1, and COL20A1), and distinct survival patterns were observed between the high- and low-risk groups from the combined dataset of 408 microarrays. The risk score was significantly higher in breast cancer patients with recurrence, metastasis, or mortality than in relapse-free individuals (0.241 versus 0, P<0.001). The concurrent gene risk predictive model remained discriminative across distinct clinical ER and HER2 statuses in subgroup analysis. Prognostic comparisons with published gene expression signatures showed a better discerning ability of concurrent genes, many of which were rarely identifiable if expression data were pre-selected by phenotype correlations or variability of individual genes.We conclude that parallel analysis of CGH and microarray data, in conjunction with known gene expression patterns, can be used to identify biomarkers with prognostic values in breast cancer.  相似文献   

5.
MOTIVATION: The analysis of gene expression data in its chromosomal context has been a recent development in cancer research. However, currently available methods fail to account for variation in the distance between genes, gene density and genomic features (e.g. GC content) in identifying increased or decreased chromosomal regions of gene expression. RESULTS: We have developed a model-based scan statistic that accounts for these aspects of the complex landscape of the human genome in the identification of extreme chromosomal regions of gene expression. This method may be applied to gene expression data regardless of the microarray platform used to generate it. To demonstrate the accuracy and utility of this method, we applied it to a breast cancer gene expression dataset and tested its ability to predict regions containing medium-to-high level DNA amplification (DNA ratio values >2). A classifier was developed from the scan statistic results that had a 10-fold cross-validated classification rate of 93% and a positive predictive value of 88%. This result strongly suggests that the model-based scan statistic and the expression characteristics of an increased chromosomal region of gene expression can be used to accurately predict chromosomal regions containing amplified genes. AVAILABILITY: Functions in the R-language are available from the author upon request. CONTACT: fcouples@umich.edu.  相似文献   

6.
Summary .  The central dogma of molecular biology relates DNA with mRNA. Array CGH measures DNA copy number and gene expression microarrays measure the amount of mRNA. Methods that integrate data from these two platforms may uncover meaningful biological relationships that further our understanding of cancer. We develop nonparametric tests for the detection of copy number induced differential gene expression. The tests incorporate the uncertainty of the calling of genomic aberrations. The test is preceded by a "tuning algorithm" that discards certain genes to improve the overall power of the false discovery rate selection procedure. Moreover, the test statistics are "shrunken" to borrow information across neighboring genes that share the same array CGH signature. For each gene we also estimate its effect, its amount of differential expression due to copy number changes, and calculate the coefficient of determination. The method is illustrated on breast cancer data, in which it confirms previously reported findings, now with a more profound statistical underpinning.  相似文献   

7.

Background

Chromosomal breakage followed by faulty DNA repair leads to gene amplifications and deletions in cancers. However, the mere assessment of the extent of genomic changes, amplifications and deletions may reduce the complexity of genomic data observed by array comparative genomic hybridization (array CGH). We present here a novel approach to array CGH data analysis, which focuses on putative breakpoints responsible for rearrangements within the genome.

Results

We performed array comparative genomic hybridization in 29 primary tumors from high risk patients with breast cancer. The specimens were flow sorted according to ploidy to increase tumor cell purity prior to array CGH. We describe the number of chromosomal breaks as well as the patterns of breaks on individual chromosomes in each tumor. There were differences in chromosomal breakage patterns between the 3 clinical subtypes of breast cancers, although the highest density of breaks occurred at chromosome 17 in all subtypes, suggesting a particular proclivity of this chromosome for breaks. We also observed chromothripsis affecting various chromosomes in 41% of high risk breast cancers.

Conclusions

Our results provide a new insight into the genomic complexity of breast cancer. Genomic instability dependent on chromosomal breakage events is not stochastic, targeting some chromosomes clearly more than others. We report a much higher percentage of chromothripsis than described previously in other cancers and this suggests that massive genomic rearrangements occurring in a single catastrophic event may shape many breast cancer genomes.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-579) contains supplementary material, which is available to authorized users.  相似文献   

8.
DNA microarray gene expression and microarray-based comparative genomic hybridization (aCGH) have been widely used for biomedical discovery. Because of the large number of genes and the complex nature of biological networks, various analysis methods have been proposed. One such method is "gene shaving," a procedure which identifies subsets of the genes with coherent expression patterns and large variation across samples. Since combining genomic information from multiple sources can improve classification and prediction of diseases, in this paper we proposed a new method, "ICA gene shaving" (ICA, independent component analysis), for jointly analyzing gene expression and copy number data. First we used ICA to analyze joint measurements, gene expression and copy number, of a biological system and project the data onto statistically independent biological processes. Next, we used these results to identify patterns of variation in the data and then applied an iterative shaving method. We investigated the properties of our proposed method by analyzing both simulated and real data. We demonstrated that the robustness of our method to noise using simulated data. Using breast cancer data, we showed that our method is superior to the Generalized Singular Value Decomposition (GSVD) gene shaving method for identifying genes associated with breast cancer.  相似文献   

9.
MOTIVATION: Chromosomal copy number changes (aneuploidies) are common in cell populations that undergo multiple cell divisions including yeast strains, cell lines and tumor cells. Identification of aneuploidies is critical in evolutionary studies, where changes in copy number serve an adaptive purpose, as well as in cancer studies, where amplifications and deletions of chromosomal regions have been identified as a major pathogenetic mechanism. Aneuploidies can be studied on whole-genome level using array CGH (a microarray-based method that measures the DNA content), but their presence also affects gene expression. In gene expression microarray analysis, identification of copy number changes is especially important in preventing aberrant biological conclusions based on spurious gene expression correlation or masked phenotypes that arise due to aneuploidies. Previously suggested approaches for aneuploidy detection from microarray data mostly focus on array CGH, address only whole-chromosome or whole-arm copy number changes, and rely on thresholds or other heuristics, making them unsuitable for fully automated general application to gene expression datasets. There is a need for a general and robust method for identification of aneuploidies of any size from both array CGH and gene expression microarray data. RESULTS: We present ChARM (Chromosomal Aberration Region Miner), a robust and accurate expectation-maximization based method for identification of segmental aneuploidies (partial chromosome changes) from gene expression and array CGH microarray data. Systematic evaluation of the algorithm on synthetic and biological data shows that the method is robust to noise, aneuploidal segment size and P-value cutoff. Using our approach, we identify known chromosomal changes and predict novel potential segmental aneuploidies in commonly used yeast deletion strains and in breast cancer. ChARM can be routinely used to identify aneuploidies in array CGH datasets and to screen gene expression data for aneuploidies or array biases. Our methodology is sensitive enough to detect statistically significant and biologically relevant aneuploidies even when expression or DNA content changes are subtle as in mixed populations of cells. AVAILABILITY: Code available by request from the authors and on Web supplement at http://function.cs.princeton.edu/ChARM/  相似文献   

10.
Microarray techniques using cDNA array and comparative genomic hybridization (CGH) have been developed for several discovery applications. They are frequently applied for the prediction and diagnosis of cancer in recent years. Many studies have shown that integrating genomic data from different sources may increase the reliability of gene expression analysis results in understanding cancer progression. Therefore, developing a good prognostic model dealing simultaneously with different types of dataset is important. The challenge with these types of data is high background noise. We describe an analytical two-stage framework with a multi-parallel data analysis method named wavelet-based generalized singular value decomposition and shaving method (WGSVD-shaving). This method is proposed for de-noising and dimension-reduction during early stage prognosis modeling. We also applied a supervised gene clustering technique with penalized logistic regression with Cox-model on an integrated data. We show the accuracy of the method using a simulated dataset with a case study on Hepatocelluar Carcinoma (HCC) cDNA and CGH data. The method shows improved results from GSVD-shaving and has application in the discovery of candidate genes associated with cancer.  相似文献   

11.
12.
13.

Background

Molecular alterations critical to development of cancer include mutations, copy number alterations (amplifications and deletions) as well as genomic rearrangements resulting in gene fusions. Massively parallel next generation sequencing, which enables the discovery of such changes, uses considerable quantities of genomic DNA (> 5 ug), a serious limitation in ever smaller clinical samples. However, a commonly available microarray platforms such as array comparative genomic hybridization (array CGH) allows the characterization of gene copy number at a single gene resolution using much smaller amounts of genomic DNA. In this study we evaluate the sensitivity of ultra-dense array CGH platforms developed by Agilent, especially that of the 1 million probe array (1 M array), and their application when whole genome amplification is required because of limited sample quantities.

Methods

We performed array CGH on whole genome amplified and not amplified genomic DNA from MCF-7 breast cancer cells, using 244 K and 1 M Agilent arrays. The ADM-2 algorithm was used to identify micro-copy number alterations that measured less than 1 Mb in genomic length.

Results

DNA from MCF-7 breast cancer cells was analyzed for micro-copy number alterations, defined as measuring less than 1 Mb in genomic length. The 4-fold extra resolution of the 1 M array platform relative to the less dense 244 K array platform, led to the improved detection of copy number variations (CNVs) and micro-CNAs. The identification of intra-genic breakpoints in areas of DNA copy number gain signaled the possible presence of gene fusion events. However, the ultra-dense platforms, especially the densest 1 M array, detect artifacts inherent to whole genome amplification and should be used only with non-amplified DNA samples.

Conclusions

This is a first report using 1 M array CGH for the discovery of cancer genes and biomarkers. We show the remarkable capacity of this technology to discover CNVs, micro-copy number alterations and even gene fusions. However, these platforms require excellent genomic DNA quality and do not tolerate relatively small imperfections related to the whole genome amplification.  相似文献   

14.
15.
Necrotic enteritis (NE) is an economically important disease of poultry caused by certain Clostridium perfringens type A strains. NE pathogenesis involves the NetB toxin, which is encoded on a large conjugative plasmid within a 42-kb pathogenicity locus. Recent multilocus sequence type (MLST) studies have identified two predominant NE-associated clonal groups, suggesting that host genes are also involved in NE pathogenesis. We used microarray comparative genomic hybridization (CGH) to assess the gene content of 54 poultry isolates from birds that were healthy or that suffered from NE. A total of 400 genes were variably present among the poultry isolates and nine nonpoultry strains, many of which had putative functions related to nutrient uptake and metabolism and cell wall and capsule biosynthesis. The variable genes were organized into 142 genomic regions, 49 of which contained genes significantly associated with netB-positive isolates. These regions included three previously identified NE-associated loci as well as several apparent fitness-related loci, such as a carbohydrate ABC transporter, a ferric-iron siderophore uptake system, and an adhesion locus. Additional loci were related to plasmid maintenance. Cluster analysis of the CGH data grouped all of the netB-positive poultry isolates into two major groups, separated according to two prevalent clonal groups based on MLST analysis. This study identifies chromosomal loci associated with netB-positive poultry strains, suggesting that the chromosomal background can confer a selective advantage to NE-causing strains, possibly through mechanisms involving iron acquisition, carbohydrate metabolism, and plasmid maintenance.  相似文献   

16.
Escherichia coli, including the closely related genus Shigella, is a highly diverse species in terms of genome structure. Comparative genomic hybridization (CGH) microarray analysis was used to compare the gene content of E. coli K-12 with the gene contents of pathogenic strains. Missing genes in a pathogen were detected on a microarray slide spotted with 4,071 open reading frames (ORFs) of W3110, a commonly used wild-type K-12 strain. For 22 strains subjected to the CGH microarray analyses 1,424 ORFs were found to be absent in at least one strain. The common backbone of the E. coli genome was estimated to contain about 2,800 ORFs. The mosaic distribution of absent regions indicated that the genomes of pathogenic strains were highly diversified because of insertions and deletions. Prophages, cell envelope genes, transporter genes, and regulator genes in the K-12 genome often were not present in pathogens. The gene contents of the strains tested were recognized as a matrix for a neighbor-joining analysis. The phylogenic tree obtained was consistent with the results of previous studies. However, unique relationships between enteroinvasive strains and Shigella, uropathogenic, and some enteropathogenic strains were suggested by the results of this study. The data demonstrated that the CGH microarray technique is useful not only for genomic comparisons but also for phylogenic analysis of E. coli at the strain level.  相似文献   

17.
Genome-wide profiling of gene amplification and deletion in cancer   总被引:3,自引:0,他引:3  
Kashiwagi H  Uchida K 《Human cell》2000,13(3):135-141
Accumulations of genetic changes in somatic cells induce phenotypic transformations leading to cancer. Among these genetic changes, gene amplification and deletion are most frequently observed in several kinds of cancers. Amplification of oncogene and/or deletion of tumor suppressor gene, together with dysfunction of the gene by point mutation, are the main causes of cancer. Genome-wide analysis of amplification and deletion of genes in cancers is basic to resolving the mechanisms of carcinogenesis. Comparative genomic hybridization (CGH) developed in 1992 has been utilized to identify DNA copy number abnormalities in various kind of cancers and several reports have shown its usefulness in screening of the genes involved in carcinogenesis, and also in the identification of prognostic factors in cancer. We have shown that 1q23 gain is associated with neuroblastomas that are resistant to aggressive treatment, and have poor prognosis, and 1q and 13q gains are possibly related to drug resistance in ovarian cancers. Recently, the "rough draft" of the human genome was reported and we are ready to utilize the vast information on genomic sequences in cancer research. Moreover, microarray technology enables us to analyze more than ten thousand genes at a time and revealed genetic abnormalities in cancers at a genome-wide level. By combination of microarray and CGH, a powerful screening method for oncogenes and tumor suppressor genes in cancers, called array-CGH, has been developed by several groups. In this article, we overview these genome-wide analytical methods, CGH and array-CGH, and discuss their potential in molecular characterization of cancers.  相似文献   

18.
19.
Fuzzy J-Means and VNS methods for clustering genes from microarray data   总被引:4,自引:0,他引:4  
MOTIVATION: In the interpretation of gene expression data from a group of microarray experiments that include samples from either different patients or conditions, special consideration must be given to the pleiotropic and epistatic roles of genes, as observed in the variation of gene coexpression patterns. Crisp clustering methods assign each gene to one cluster, thereby omitting information about the multiple roles of genes. RESULTS: Here, we present the application of a local search heuristic, Fuzzy J-Means, embedded into the variable neighborhood search metaheuristic for the clustering of microarray gene expression data. We show that for all the datasets studied this algorithm outperforms the standard Fuzzy C-Means heuristic. Different methods for the utilization of cluster membership information in determining gene coregulation are presented. The clustering and data analyses were performed on simulated datasets as well as experimental cDNA microarray data for breast cancer and human blood from the Stanford Microarray Database. AVAILABILITY: The source code of the clustering software (C programming language) is freely available from Nabil.Belacel@nrc-cnrc.gc.ca  相似文献   

20.
One important problem in genomic research is to identify genomic features such as gene expression data or DNA single nucleotide polymorphisms (SNPs) that are related to clinical phenotypes. Often these genomic data can be naturally divided into biologically meaningful groups such as genes belonging to the same pathways or SNPs within genes. In this paper, we propose group additive regression models and a group gradient descent boosting procedure for identifying groups of genomic features that are related to clinical phenotypes. Our simulation results show that by dividing the variables into appropriate groups, we can obtain better identification of the group features that are related to the phenotypes. In addition, the prediction mean square errors are also smaller than the component-wise boosting procedure. We demonstrate the application of the methods to pathway-based analysis of microarray gene expression data of breast cancer. Results from analysis of a breast cancer microarray gene expression data set indicate that the pathways of metalloendopeptidases (MMPs) and MMP inhibitors, as well as cell proliferation, cell growth, and maintenance are important to breast cancer-specific survival.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号