首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
An important problem in the analysis of large-scale gene expression data is the validation of gene expression clusters. By examining the temporal expression patterns of 74 genes expressed in rat spinal cord under three different experimental conditions, we have found evidence that some genes cluster together under multiple conditions. Using RT-PCR data from spinal cord development and two sets of microarray data from spinal injury, we applied Spearman correlation to identify clusters and to assign P values to pairs of genes with highly similar temporal expression patterns. We found that 15% of genes occurred in statistically significant pairs in all three experimental conditions, providing both statistical and experimental support for the idea that genes that cluster together are co-regulated. In addition, we demonstrated that DNA microarray and RT-PCR data are comparable, and can be combined to confirm gene expression relationships.  相似文献   

2.

Background  

A common observation in the analysis of gene expression data is that many genes display similarity in their expression patterns and therefore appear to be co-regulated. However, the variation associated with microarray data and the complexity of the experimental designs make the acquisition of co-expressed genes a challenge. We developed a novel method for Extracting microarray gene expression Patterns and Identifying co-expressed Genes, designated as EPIG. The approach utilizes the underlying structure of gene expression data to extract patterns and identify co-expressed genes that are responsive to experimental conditions.  相似文献   

3.
4.
5.
Clustering techniques have been widely used in the analysis of microarray data to group genes with similar expression profiles. The similarity of expression profiles and hence the results of clustering greatly depend on how the data has been transformed. We present a method that uses the relative expression changes between pairs of conditions and an angular transformation to define the similarity of gene expression patterns. The pairwise comparisons of experimental conditions can be chosen to reflect the purpose of clustering allowing control the definition of similarity between genes. A variational Bayes mixture modeling approach is then used to find clusters within the transformed data. The purpose of microarray data analysis is often to locate groups genes showing particular patterns of expression change and within these groups to locate specific target genes that may warrant further experimental investigation. We show that the angular transformation maps data to a representation from which information, in terms of relative regulation changes, can be automatically mined. This information can be then be used to understand the "features" of expression change important to different clusters allowing potentially interesting clusters to be easily located. Finally, we show how the genes within a cluster can be visualized in terms of their expression pattern and intensity change, allowing potential target genes to be highlighted within the clusters of interest.  相似文献   

6.

Background  

Microarray technology has made it possible to simultaneously measure the expression levels of large numbers of genes in a short time. Gene expression data is information rich; however, extensive data mining is required to identify the patterns that characterize the underlying mechanisms of action. Clustering is an important tool for finding groups of genes with similar expression patterns in microarray data analysis. However, hard clustering methods, which assign each gene exactly to one cluster, are poorly suited to the analysis of microarray datasets because in such datasets the clusters of genes frequently overlap.  相似文献   

7.
MOTIVATION: Association pattern discovery (APD) methods have been successfully applied to gene expression data. They find groups of co-regulated genes in which the genes are either up- or down-regulated throughout the identified conditions. These methods, however, fail to identify similarly expressed genes whose expressions change between up- and down-regulation from one condition to another. In order to discover these hidden patterns, we propose the concept of mining co-regulated gene profiles. Co-regulated gene profiles contain two gene sets such that genes within the same set behave identically (up or down) while genes from different sets display contrary behavior. To reduce and group the large number of similar resulting patterns, we propose a new similarity measure that can be applied together with hierarchical clustering methods. RESULTS: We tested our proposed method on two well-known yeast microarray data sets. Our implementation mined the data effectively and discovered patterns of co-regulated genes that are hidden to traditional APD methods. The high content of biologically relevant information in these patterns is demonstrated by the significant enrichment of co-regulated genes with similar functions. Our experimental results show that the Mining Attribute Profile (MAP) method is an efficient tool for the analysis of gene expression data and competitive with bi-clustering techniques.  相似文献   

8.

Background  

The underlying goal of microarray experiments is to identify gene expression patterns across different experimental conditions. Genes that are contained in a particular pathway or that respond similarly to experimental conditions could be co-expressed and show similar patterns of expression on a microarray. Using any of a variety of clustering methods or gene network analyses we can partition genes of interest into groups, clusters, or modules based on measures of similarity. Typically, Pearson correlation is used to measure distance (or similarity) before implementing a clustering algorithm. Pearson correlation is quite susceptible to outliers, however, an unfortunate characteristic when dealing with microarray data (well known to be typically quite noisy.)  相似文献   

9.
The identification of the genes regulating neural progenitor cell (NPC) functions is of great importance to developmental neuroscience and neural repair. Previously, we combined genetic subtraction and microarray analysis to identify genes enriched in neural progenitor cultures. Here, we apply a strategy to further stratify the neural progenitor genes. In situ hybridization demonstrates expression in the central nervous system germinal zones of 54 clones so identified, making them highly relevant for study in brain and neural progenitor development. Using microarray analysis we find 73 genes enriched in three neural stem cell (NSC)-containing populations generated under different conditions. We use the custom microarray to identify 38 "stemness" genes, with enriched expression in the three NSC conditions and present in both embryonic stem cells and hematopoietic stem cells. However, comparison of expression profiles from these stem cell populations indicates that while there is shared gene expression, the amount of genetic overlap is no more than what would be expected by chance, indicating that different stem cells have largely different gene expression patterns. Taken together, these studies identify many genes not previously associated with neural progenitor cell biology and also provide a rational scheme for stratification of microarray data for functional analysis.  相似文献   

10.
华琳  郑卫英  刘红  林慧  高磊 《生物工程学报》2008,24(9):1643-1648
利用随机森林-通路分析法,通过袋外样本OOB的分类错误率筛选特征代谢通路,在特征通路上作基因表达相关性研究并对通路上的基因采用MAP(Mining attribute profile)算法挖掘不同实验条件下基因的共调控表达模式,对共调控表达模式进行聚类.分析结果显示同一特征代谢通路上的基因表达倾向相似,有2条特征代谢通路存在共表达模式.其中一条通路含108个表达模式,对这些模式进行聚类,其最低聚类的相似系数仍高达0.623.说明同一特征代谢通路上的基因共表达模式在不同实验条件下仍具有高度的相似性.对以通路作为基因模块进行复杂疾病的研究具有借鉴意义.  相似文献   

11.
DNA microarray analysis of Clostridium acetobutylicum was used to examine the genomic-scale gene expression changes during the shift from exponential-phase growth and acidogenesis to stationary phase and solventogenesis. Self-organizing maps were used to identify novel expression patterns of functional gene classes, including aromatic and branched-chain amino acid synthesis, ribosomal proteins, cobalt and iron transporters, cobalamin biosynthesis, and lipid biosynthesis. The majority of pSOL1 megaplasmid genes (in addition to the solventogenic genes aad-ctfA-ctfB and adc) had increased expression at the onset of solventogenesis, suggesting that other megaplasmid genes may play a role in stationary-phase phenomena. Analysis of sporulation genes and comparison with published Bacillus subtilis results indicated conserved expression patterns of early sporulation genes, including spo0A, the sigF operon, and putative canonical genes of the sigma(H) and sigma(F) regulons. However, sigE expression could not be detected within 7.5 h of initial spo0A expression, consistent with the observed extended time between the appearance of clostridial forms and endospore formation. The results were compared with microarray comparisons of the wild-type strain and the nonsolventogenic, asporogenous M5 strain, which lacks the pSOL1 megaplasmid. While some results were similar, the expression of primary metabolism genes and heat shock proteins was higher in M5, suggesting a difference in metabolic regulation or a butyrate stress response in M5. The results of this microarray platform and analysis were further validated by comparing gene expression patterns to previously published Northern analyses, reporter assays, and two-dimensional protein electrophoresis data of metabolic genes (including all major solventogenesis genes), sporulation genes, heat shock proteins, and other solventogenesis-induced gene expression.  相似文献   

12.
The detection of genes that show similar profiles under different experimental conditions is often an initial step in inferring the biological significance of such genes. Visualization tools are used to identify genes with similar profiles in microarray studies. Given the large number of genes recorded in microarray experiments, gene expression data are generally displayed on a low dimensional plot, based on linear methods. However, microarray data show nonlinearity, due to high-order terms of interaction between genes, so alternative approaches, such as kernel methods, may be more appropriate. We introduce a technique that combines kernel principal component analysis (KPCA) and Biplot to visualize gene expression profiles. Our approach relies on the singular value decomposition of the input matrix and incorporates an additional step that involves KPCA. The main properties of our method are the extraction of nonlinear features and the preservation of the input variables (genes) in the output display. We apply this algorithm to colon tumor, leukemia and lymphoma datasets. Our approach reveals the underlying structure of the gene expression profiles and provides a more intuitive understanding of the gene and sample association.  相似文献   

13.
14.
One of the essential issues in microarray data analysis is to identify differentially expressed genes (DEGs) under different experimental treatments. In this article, a statistical procedure was proposed to identify the DEGs for gene expression data with or without missing observations from microarray experiment with one- or two-treatment factors. An F statistic based on Henderson method III was constructed to test the significance of differential expression for each gene under different treatment(s) levels. The cutoff P value was adjusted to control the experimental-wise false discovery rate. A human acute leukemia dataset corrected from 38 leukemia patients was reanalyzed by the proposed method. In comparison to the results from significant analysis of microarray (SAM) and microarray analysis of variance (MAANOVA), it was indicated that the proposed method has similar performance with MAANOVA for data with one-treatment factor, but MAANOVA cannot directly handle missing data. In addition, a mouse brain dataset collected from six brain regions of two inbred strains (two-treatment factors) was reanalyzed to identify genes with distinct regional-specific expression patterns. The results showed that the proposed method could identify more distinct regional-specific expression patterns than the previous analysis of the same dataset. Moreover, a computer program was developed and incorporated in the software QTModel, which is freely available at .  相似文献   

15.
Yi Y  Mirosevich J  Shyr Y  Matusik R  George AL 《Genomics》2005,85(3):401-412
Microarray technology can be used to assess simultaneously global changes in expression of mRNA or genomic DNA copy number among thousands of genes in different biological states. In many cases, it is desirable to determine if altered patterns of gene expression correlate with chromosomal abnormalities or assess expression of genes that are contiguous in the genome. We describe a method, differential gene locus mapping (DIGMAP), which aligns the known chromosomal location of a gene to its expression value deduced by microarray analysis. The method partitions microarray data into subsets by chromosomal location for each gene interrogated by an array. Microarray data in an individual subset can then be clustered by physical location of genes at a subchromosomal level based upon ordered alignment in genome sequence. A graphical display is generated by representing each genomic locus with a colored cell that quantitatively reflects its differential expression value. The clustered patterns can be viewed and compared based on their expression signatures as defined by differential values between control and experimental samples. In this study, DIGMAP was tested using previously published studies of breast cancer analyzed by comparative genomic hybridization (CGH) and prostate cancer gene expression profiles assessed by cDNA microarray experiments. Analysis of the breast cancer CGH data demonstrated the ability of DIGMAP to deduce gene amplifications and deletions. Application of the DIGMAP method to the prostate data revealed several carcinoma-related loci, including one at 16q13 with marked differential expression encompassing 19 known genes including 9 encoding metallothionein proteins. We conclude that DIGMAP is a powerful computational tool enabling the coupled analysis of microarray data with genome location.  相似文献   

16.
17.
Interactive semisupervised learning for microarray analysis   总被引:3,自引:0,他引:3  
Microarray technology has generated vast amounts of gene expression data with distinct patterns. Based on the premise that genes of correlated functions tend to exhibit similar expression patterns, various machine learning methods have been applied to capture these specific patterns in microarray data. However, the discrepancy between the rich expression profiles and the limited knowledge of gene functions has been a major hurdle to the understanding of cellular networks. To bridge this gap so as to properly comprehend and interpret expression data, we introduce relevance feedback to microarray analysis and propose an interactive learning framework to incorporate the expert knowledge into the decision module. In order to find a good learning method and solve two intrinsic problems in microarray data, high dimensionality and small sample size, we also propose a semisupervised learning algorithm: kernel discriminant-EM (KDEM). This algorithm efficiently utilizes a large set of unlabeled data to compensate for the insufficiency of a small set of labeled data and it extends the linear algorithm in discriminant-EM (DEM) to a kernel algorithm to handle nonlinearly separable data in a lower dimensional space. The relevance feedback technique and KDEM together construct an efficient and effective interactive semisupervised learning framework for microarray analysis. Extensive experiments on the yeast cell cycle regulation data set and Plasmodium falciparum red blood cell cycle data set show the promise of this approach  相似文献   

18.
19.
GENEVESTIGATOR. Arabidopsis microarray database and analysis toolbox   总被引:26,自引:0,他引:26  
High-throughput gene expression analysis has become a frequent and powerful research tool in biology. At present, however, few software applications have been developed for biologists to query large microarray gene expression databases using a Web-browser interface. We present GENEVESTIGATOR, a database and Web-browser data mining interface for Affymetrix GeneChip data. Users can query the database to retrieve the expression patterns of individual genes throughout chosen environmental conditions, growth stages, or organs. Reversely, mining tools allow users to identify genes specifically expressed during selected stresses, growth stages, or in particular organs. Using GENEVESTIGATOR, the gene expression profiles of more than 22,000 Arabidopsis genes can be obtained, including those of 10,600 currently uncharacterized genes. The objective of this software application is to direct gene functional discovery and design of new experiments by providing plant biologists with contextual information on the expression of genes. The database and analysis toolbox is available as a community resource at https://www.genevestigator.ethz.ch.  相似文献   

20.
Time course microarray experiments designed to characterize the dynamic regulation of gene expression in biological systems are becoming increasingly important. One critical issue that arises when examining time course microarray data is the identification of genes that show different temporal expression patterns among biological conditions. Here we propose a Bayesian hierarchical model to incorporate important experimental factors and to account for correlated gene expression measurements over time and over different genes. A new gene selection algorithm is also presented with the model to simultaneously identify genes that show changes in expression among biological conditions, in response to time and other experimental factors of interest. The algorithm performs well in terms of the false positive and false negative rates in simulation studies. The methodology is applied to a mouse model time course experiment to correlate temporal changes in azoxymethane-induced gene expression profiles with colorectal cancer susceptibility.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号