首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Due to versatile diagnostic and prognostic fidelity molecular signatures or fingerprints are anticipated as the most powerful tools for cancer management in the near future. Notwithstanding the experimental advancements in microarray technology, methods for analyzing either whole arrays or gene signatures have not been firmly established. Recently, an algorithm, ArraySolver has been reported by Khan for two-group comparison of microarray gene expression data using two-tailed Wilcoxon signed-rank test. Most of the molecular signatures are composed of two sets of genes (hybrid signatures) wherein up-regulation of one set and down-regulation of the other set collectively define the purpose of a gene signature. Since the direction of a selected gene's expression (positive or negative) with respect to a particular disease condition is known, application of one-tailed statistics could be a more relevant choice. A novel method, ArrayVigil, is described for comparing hybrid signatures using segregated-one-tailed (SOT) Wilcoxon signed-rank test and the results compared with integrated-two-tailed (ITT) procedures (SPSS and ArraySolver). ArrayVigil resulted in lower P values than those obtained from ITT statistics while comparing real data from four signatures.  相似文献   

2.

Background  

Genome-wide expression signatures are emerging as potential marker for overall survival and disease recurrence risk as evidenced by recent commercialization of gene expression based biomarkers in breast cancer. Similar predictions have recently been carried out using genome-wide copy number alterations and microRNAs. Existing software packages for microarray data analysis provide functions to define expression-based survival gene signatures. However, there is no software that can perform survival analysis using SNP array data or draw survival curves interactively for expression-based sample clusters.  相似文献   

3.
Microarray gene expression signatures hold great promise to improve diagnosis and prognosis of disease. However, current documentation standards of such signatures do not allow for an unambiguous application to study-external patients. This hinders independent evaluation, effectively delaying the use of signatures in clinical practice. Data from eight publicly available clinical microarray studies were analyzed and the consistency of study-internal with study-external diagnoses was evaluated. Study-external classifications were based on documented information only. Documenting a signature is conceptually different from reporting a list of genes. We show that even the exact quantitative specification of a classification rule alone does not define a signature unambiguously. We found that discrepancy between study-internal and study-external diagnoses can be as frequent as 30% (worst case) and 18% (median). By using the proposed documentation by value strategy, which documents quantitative preprocessing information, the median discrepancy was reduced to 1%. The process of evaluating microarray gene expression diagnostic signatures and bringing them to clinical practice can be substantially improved and made more reliable by better documentation of the signatures.  相似文献   

4.
Fröhlich H 《PloS one》2011,6(10):e25364
Diagnostic and prognostic biomarkers for cancer based on gene expression profiles are viewed as a major step towards a better personalized medicine. Many studies using various computational approaches have been published in this direction during the last decade. However, when comparing different gene signatures for related clinical questions often only a small overlap is observed. This can have various reasons, such as technical differences of platforms, differences in biological samples or their treatment in lab, or statistical reasons because of the high dimensionality of the data combined with small sample size, leading to unstable selection of genes. In conclusion retrieved gene signatures are often hard to interpret from a biological point of view. We here demonstrate that it is possible to construct a consensus signature from a set of seemingly different gene signatures by mapping them on a protein interaction network. Common upstream proteins of close gene products, which we identified via our developed algorithm, show a very clear and significant functional interpretation in terms of overrepresented KEGG pathways, disease associated genes and known drug targets. Moreover, we show that such a consensus signature can serve as prior knowledge for predictive biomarker discovery in breast cancer. Evaluation on different datasets shows that signatures derived from the consensus signature reveal a much higher stability than signatures learned from all probesets on a microarray, while at the same time being at least as predictive. Furthermore, they are clearly interpretable in terms of enriched pathways, disease associated genes and known drug targets. In summary we thus believe that network based consensus signatures are not only a way to relate seemingly different gene signatures to each other in a functional manner, but also to establish prior knowledge for highly stable and interpretable predictive biomarkers.  相似文献   

5.
Biomarkers derived from gene expression profiling data may have a high false-positive rate and must be rigorously validated using independent clinical data sets, which are not always available. Although animal model systems could provide alternative data sets to formulate hypotheses and limit the number of signatures to be tested in clinical samples, the predictive power of such an approach is not yet proven. The present study aims to analyze the molecular signatures of liver cancer in a c-MET-transgenic mouse model and investigate its prognostic relevance to human hepatocellular carcinoma (HCC). Tissue samples were obtained from tumor (TU), adjacent non-tumor (AN) and distant normal (DN) liver in Tet-operator regulated (TRE) human c-MET transgenic mice (n = 21) as well as from a Chinese cohort of 272 HBV- and 9 HCV-associated HCC patients. Whole genome microarray expression profiling was conducted in Affymetrix gene expression chips, and prognostic significances of gene expression signatures were evaluated across the two species. Our data revealed parallels between mouse and human liver tumors, including down-regulation of metabolic pathways and up-regulation of cell cycle processes. The mouse tumors were most similar to a subset of patient samples characterized by activation of the Wnt pathway, but distinctive in the p53 pathway signals. Of potential clinical utility, we identified a set of genes that were down regulated in both mouse tumors and human HCC having significant predictive power on overall and disease-free survival, which were highly enriched for metabolic functions. In conclusions, this study provides evidence that a disease model can serve as a possible platform for generating hypotheses to be tested in human tissues and highlights an efficient method for generating biomarker signatures before extensive clinical trials have been initiated.  相似文献   

6.
Global gene expression profiles of thousands of cancer samples have been completed, giving rise to hundreds of gene expression signatures (GES). Although many expression signatures show promise in predicting patient prognosis or response to therapies, the usefulness of the signatures in understanding the underlying mechanisms of cancer has not been fully exploited. While “reverse genomic” methods can test specific hypotheses of gene regulation, they fare less well in deciphering novel or combinatorial mechanisms of gene regulation. Recently we described SLAMS (stepwise linkage analysis of microarray signatures), a novel method that can prospectively identify genetic regulators of gene expression signatures in cancer. Applying SLAMS on a poor-prognosis wound signature in human breast cancer, we identified CSN5-mediated ubiquitination of MYC as a novel mechanism to activate a biological program favoring metastasis.  相似文献   

7.

Background  

Multiple gene expression signatures derived from microarray experiments have been published in the field of leukemia research. A comparison of these signatures with results from new experiments is useful for verification as well as for interpretation of the results obtained. Currently, the percentage of overlapping genes is frequently used to compare published gene signatures against a signature derived from a new experiment. However, it has been shown that the percentage of overlapping genes is of limited use for comparing two experiments due to the variability of gene signatures caused by different array platforms or assay-specific influencing parameters. Here, we present a robust approach for a systematic and quantitative comparison of published gene expression signatures with an exemplary query dataset.  相似文献   

8.
9.
The massive surge in the production of microarray data poses a great challenge for proper analysis and interpretation. In recent years numerous computational tools have been developed to extract meaningful interpretation of microarray gene expression data. However, a convenient tool for two-groups comparison of microarray data is still lacking and users have to rely on commercial statistical packages that might be costly and require special skills, in addition to extra time and effort for transferring data from one platform to other. Various statistical methods, including the t-test, analysis of variance, Pearson test and Mann-Whitney U test, have been reported for comparing microarray data, whereas the utilization of the Wilcoxon signed-rank test, which is an appropriate test for two-groups comparison of gene expression data, has largely been neglected in microarray studies. The aim of this investigation was to build an integrated tool, ArraySolver, for colour-coded graphical display and comparison of gene expression data using the Wilcoxon signed-rank test. The results of software validation showed similar outputs with ArraySolver and SPSS for large datasets. Whereas the former program appeared to be more accurate for 25 or fewer pairs (n 相似文献   

10.
We have used a supervised classification approach to systematically mine a large microarray database derived from livers of compound‐treated rats. Thirty‐four distinct signatures (classifiers) for pharmacological and toxicological end points can be identified. Just 200 genes are sufficient to classify these end points. Signatures were enriched in xenobiotic and immune response genes and contain un‐annotated genes, indicating that not all key genes in the liver xenobiotic responses have been characterized. Many signatures with equal classification capabilities but with no gene in common can be derived for the same phenotypic end point. The analysis of the union of all genes present in these signatures can reveal the underlying biology of that end point as illustrated here using liver fibrosis signatures. Our approach using the whole genome and a diverse set of compounds allows a comprehensive view of most pharmacological and toxicological questions and is applicable to other situations such as disease and development.  相似文献   

11.
limma is an R/Bioconductor software package that provides an integrated solution for analysing data from gene expression experiments. It contains rich features for handling complex experimental designs and for information borrowing to overcome the problem of small sample sizes. Over the past decade, limma has been a popular choice for gene discovery through differential expression analyses of microarray and high-throughput PCR data. The package contains particularly strong facilities for reading, normalizing and exploring such data. Recently, the capabilities of limma have been significantly expanded in two important directions. First, the package can now perform both differential expression and differential splicing analyses of RNA sequencing (RNA-seq) data. All the downstream analysis tools previously restricted to microarray data are now available for RNA-seq as well. These capabilities allow users to analyse both RNA-seq and microarray data with very similar pipelines. Second, the package is now able to go past the traditional gene-wise expression analyses in a variety of ways, analysing expression profiles in terms of co-regulated sets of genes or in terms of higher-order expression signatures. This provides enhanced possibilities for biological interpretation of gene expression differences. This article reviews the philosophy and design of the limma package, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.  相似文献   

12.
13.
Biological maintenance of cells under variable conditions should affect gene expression of only certain genes while leaving the rest unchanged. The latter, termed "housekeeping genes," by definition must reflect no change in their expression levels during cell development, treatment, or disease state anomalies. However, deviations from this rule have been observed. Using DNA microarray technology, we report here variations in expression levels of certain housekeeping genes in prostate cancer and a colorectal cancer gene therapy model system. To highlight, differential expression was observed for ribosomal protein genes in the prostate cancer cells and beta-actin in treated colorectal cells. High-throughput differential gene expression analysis via microarray technology and quantitative PCR has become a common platform for classifying variations in similar types of cancers, response to chemotherapy, identifying disease markers, etc. Therefore, normalization of the system based on housekeeping genes, such as those reported here in cancer, must be approached with caution.  相似文献   

14.

Background  

Cancer diagnosis and clinical outcome prediction are among the most important emerging applications of gene expression microarray technology with several molecular signatures on their way toward clinical deployment. Use of the most accurate classification algorithms available for microarray gene expression data is a critical ingredient in order to develop the best possible molecular signatures for patient care. As suggested by a large body of literature to date, support vector machines can be considered "best of class" algorithms for classification of such data. Recent work, however, suggests that random forest classifiers may outperform support vector machines in this domain.  相似文献   

15.
Gene expression analysis is generally performed on heterogeneous tissue samples consisting of multiple cell types. Current methods developed to separate heterogeneous gene expression rely on prior knowledge of the cell-type composition and/or signatures - these are not available in most public datasets. We present a novel method to identify the cell-type composition, signatures and proportions per sample without need for a-priori information. The method was successfully tested on controlled and semi-controlled datasets and performed as accurately as current methods that do require additional information. As such, this method enables the analysis of cell-type specific gene expression using existing large pools of publically available microarray datasets.  相似文献   

16.
Affymetrix GeneChip microarrays are the most widely used high-throughput technology to measure gene expression, and a wide variety of preprocessing methods have been developed to transform probe intensities reported by a microarray scanner into gene expression estimates. There have been numerous comparisons of these preprocessing methods, focusing on the most common analyses-detection of differential expression and gene or sample clustering. Recently, more complex multivariate analyses, such as gene co-expression, differential co-expression, gene set analysis and network modeling, are becoming more common; however, the same preprocessing methods are typically applied. In this article, we examine the effect of preprocessing methods on some of these multivariate analyses and provide guidance to the user as to which methods are most appropriate.  相似文献   

17.

Background

Many common diseases arise from an interaction between environmental and genetic factors. Our knowledge regarding environment and gene interactions is growing, but frameworks to build an association between gene-environment interactions and disease using preexisting, publicly available data has been lacking. Integrating freely-available environment-gene interaction and disease phenotype data would allow hypothesis generation for potential environmental associations to disease.

Methods

We integrated publicly available disease-specific gene expression microarray data and curated chemical-gene interaction data to systematically predict environmental chemicals associated with disease. We derived chemical-gene signatures for 1,338 chemical/environmental chemicals from the Comparative Toxicogenomics Database (CTD). We associated these chemical-gene signatures with differentially expressed genes from datasets found in the Gene Expression Omnibus (GEO) through an enrichment test.

Results

We were able to verify our analytic method by accurately identifying chemicals applied to samples and cell lines. Furthermore, we were able to predict known and novel environmental associations with prostate, lung, and breast cancers, such as estradiol and bisphenol A.

Conclusions

We have developed a scalable and statistical method to identify possible environmental associations with disease using publicly available data and have validated some of the associations in the literature.  相似文献   

18.
19.
Web Tools for Rice Transcriptome Analyses   总被引:1,自引:0,他引:1  
Gene expression databases provide profiling data for the expression of thousands of genes to researchers worldwide. Oligonucleotide microarray technology is a useful tool that has been employed to produce gene expression profiles in most species. In rice, there are five genome-wide DNA microarray platforms: NSF 45K, BGI/Yale 60K, Affymetrix, Agilent Rice 44K, and NimbleGen 390K. Presently, more than 1,700 hybridizations of microarray gene expression data are available from public microarray depositing databases such as NCBI gene expression omnibus and Arrayexpress at EBI. More processing or reformatting of public gene expression data is required for further applications or analyses. Web-based databases for expression meta-analyses are useful for guiding researchers in designing relevant research schemes. In this review, we summarize various databases for expression meta-analyses of rice genes and web tools for further applications, such as the development of co-expression network or functional gene network.  相似文献   

20.
Despite significant advances in the treatment of primary cancer, the ability to predict the metastatic behavior of a patient's cancer, as well as to detect and eradicate such recurrences, remain major clinical challenges in oncology. While many potential molecular biomarkers have been identified and tested previously, none have greatly improved the accuracy of specimen evaluation over routine histopathological criteria and they predict individual outcomes poorly. However, the recent introduction of high-throughput microarray technology has opened new avenues in genomic investigation of cancer, and through application in tissue-based studies and appropriate animal models, has facilitated the identification of gene expression signatures that are associated with the lethal progression of breast cancer. The use of these approaches has the potential to greatly impact our knowledge of tumor biology, to provide efficient biomarkers, and enable development towards customized prognostication and therapies for the individual.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号