首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 203 毫秒
1.
Progress and challenges in studies of the evolution of development   总被引:4,自引:0,他引:4  
Plant evolutionary developmental genetics (EDG) has made considerable progress over the last decade. This is in part due to the accumulation of large amounts of sequence data that have provided robust organismal phylogenies and, increasingly, broad assessments of molecular evolution. Attempts to use primary sequence data to identify genes that have changed function in evolutionary time have not been as successful as initially hoped. The coding sequences of most genes, which are more amenable to statistical analysis than are regulatory sequences, are generally under purifying selection, as would be expected if much evolutionary change is the result of changes in cis-regulatory sequences. Sequence-based analysis of the regulatory sequences themselves remains difficult. Comparative studies of gene expression have been useful to identify genes whose developmental role may have changed in evolutionary time and will be critical to the future development of EDG. Such studies can be used to test hypotheses of gene function. Transformation experiments are often illuminating, but can be hard to interpret, particularly if genes from multiple species are all placed into a single heterologous system such as Arabidopsis. The ideal experiment would be a gene swap or promoter swap between two species, but this awaits development of good transformation systems. The immediate need for EDG is studies of gene expression on a massive scale, far broader than any studies undertaken to date.  相似文献   

2.
Microarray experiments have yielded massive amounts of expression information measured under various conditions for the model species Arabidopsis (Arabidopsis thaliana) and rice (Oryza sativa). Expression compendia grouping multiple experiments make it possible to define correlated gene expression patterns within one species and to study how expression has evolved between species. We developed a robust framework to measure expression context conservation (ECC) and found, by analyzing 4,630 pairs of orthologous Arabidopsis and rice genes, that 77% showed conserved coexpression. Examples of nonconserved ECC categories suggested a link between regulatory evolution and environmental adaptations and included genes involved in signal transduction, response to different abiotic stresses, and hormone stimuli. To identify genomic features that influence expression evolution, we analyzed the relationship between ECC, tissue specificity, and protein evolution. Tissue-specific genes showed higher expression conservation compared with broadly expressed genes but were fast evolving at the protein level. No significant correlation was found between protein and expression evolution, implying that both modes of gene evolution are not strongly coupled in plants. By integration of cis-regulatory elements, many ECC conserved genes were significantly enriched for shared DNA motifs, hinting at the conservation of ancestral regulatory interactions in both model species. Surprisingly, for several tissue-specific genes, patterns of concerted network evolution were observed, unveiling conserved coexpression in the absence of conservation of tissue specificity. These findings demonstrate that orthologs inferred through sequence similarity in many cases do not share similar biological functions and highlight the importance of incorporating expression information when comparing genes across species.  相似文献   

3.
Real-time PCR has become increasingly important in gene expression profiling research, and it is widely agreed that normalized data are required for accurate estimates of messenger RNA (mRNA) expression. With increased gene expression profiling in preclinical research and toxicogenomics, a need for reference genes in the rat has emerged, and the studies in this area have not yet been thoroughly evaluated. The purpose of our study was to evaluate a panel of rat reference genes for variation of gene expression in different tissue types. We selected 48 known target genes based on their putative invariability. The gene expression of all targets was examined in 11 types of rat tissues using TaqMan low density array (LDA) technology. The variability of each gene was assessed using a two-step statistical model. The analysis of mean expression using multiple reference genes was shown to provide accurate and reliable normalized expression data. The least five variable genes from each specific tissue were recommended for future tissue-specific studies. Finally, a subset of investigated rat reference genes showing the least variation is recommended for further evaluation using the LDA platform. Our work should considerably enhance a researcher's ability to simply and efficiently identify appropriate reference genes for given experiments.  相似文献   

4.
5.
6.
Cross-species research in drug development is novel and challenging. A bivariate mixture model utilizing information across two species was proposed to solve the fundamental problem of identifying differentially expressed genes in microarray experiments in order to potentially improve the understanding of translation between preclinical and clinical studies for drug development. The proposed approach models the joint distribution of treatment effects estimated from independent linear models. The mixture model posits up to nine components, four of which include groups in which genes are differentially expressed in both species. A comprehensive simulation to evaluate the model performance and one application on a real world data set, a mouse and human type II diabetes experiment, suggest that the proposed model, though highly structured, can handle various configurations of differential gene expression and is practically useful on identifying differentially expressed genes, especially when the magnitude of differential expression due to different treatment intervention is weak. In the mouse and human application, the proposed mixture model was able to eliminate unimportant genes and identify a list of genes that were differentially expressed in both species and could be potential gene targets for drug development.  相似文献   

7.
The ultimate goal of functional genomics is to define the function of all the genes in the genome of an organism. A large body of information of the biological roles of genes has been accumulated and aggregated in the past decades of research, both from traditional experiments detailing the role of individual genes and proteins, and from newer experimental strategies that aim to characterize gene function on a genomic scale.It is clear that the goal of functional genomics can only be achieved by integrating information and data sources from the variety of these different experiments. Integration of different data is thus an important challenge for bioinformatics.The integration of different data sources often helps to uncover non-obvious relationships between genes, but there are also two further benefits. First, it is likely that whenever information from multiple independent sources agrees, it should be more valid and reliable. Secondly, by looking at the union of multiple sources, one can cover larger parts of the genome. This is obvious for integrating results from multiple single gene or protein experiments, but also necessary for many of the results from genome-wide experiments since they are often confined to certain (although sizable) subsets of the genome.In this paper, we explore an example of such a data integration procedure. We focus on the prediction of membership in protein complexes for individual genes. For this, we recruit six different data sources that include expression profiles, interaction data, essentiality and localization information. Each of these data sources individually contains some weakly predictive information with respect to protein complexes, but we show how this prediction can be improved by combining all of them. Supplementary information is available at http://bioinfo.mbb.yale.edu/integrate/interactions/.Abbreviations: TP: true possitive; TN: true negative; FP: false positive; FN: false negative; Y2H: yeast two-hybrid.  相似文献   

8.
MOTIVATION: Microarray technology allows the monitoring of expression levels for thousands of genes simultaneously. In time-course experiments in which gene expression is monitored over time, we are interested in testing gene expression profiles for different experimental groups. However, no sophisticated analytic methods have yet been proposed to handle time-course experiment data. RESULTS: We propose a statistical test procedure based on the ANOVA model to identify genes that have different gene expression profiles among experimental groups in time-course experiments. Especially, we propose a permutation test which does not require the normality assumption. For this test, we use residuals from the ANOVA model only with time-effects. Using this test, we detect genes that have different gene expression profiles among experimental groups. The proposed model is illustrated using cDNA microarrays of 3840 genes obtained in an experiment to search for changes in gene expression profiles during neuronal differentiation of cortical stem cells.  相似文献   

9.
Fuzzy J-Means and VNS methods for clustering genes from microarray data   总被引:4,自引:0,他引:4  
MOTIVATION: In the interpretation of gene expression data from a group of microarray experiments that include samples from either different patients or conditions, special consideration must be given to the pleiotropic and epistatic roles of genes, as observed in the variation of gene coexpression patterns. Crisp clustering methods assign each gene to one cluster, thereby omitting information about the multiple roles of genes. RESULTS: Here, we present the application of a local search heuristic, Fuzzy J-Means, embedded into the variable neighborhood search metaheuristic for the clustering of microarray gene expression data. We show that for all the datasets studied this algorithm outperforms the standard Fuzzy C-Means heuristic. Different methods for the utilization of cluster membership information in determining gene coregulation are presented. The clustering and data analyses were performed on simulated datasets as well as experimental cDNA microarray data for breast cancer and human blood from the Stanford Microarray Database. AVAILABILITY: The source code of the clustering software (C programming language) is freely available from Nabil.Belacel@nrc-cnrc.gc.ca  相似文献   

10.
Microarrays measure the expression of large numbers of genes simultaneously and can be used to delve into interaction networks involving many genes at a time. However, it is often difficult to decide to what extent knowledge about the expression of genes gleaned in one model organism can be transferred to other species. This can be examined either by measuring the expression of genes of interest under comparable experimental conditions in other species, or by gathering the necessary data from comparable microarray experiments. However, it is essential to know which genes to compare between the organisms. To facilitate comparison of expression data across different species, we have implemented a Web-based software tool that provides information about sequence orthologs across a range of Affymetrix microarray chips. AffyTrees provides a quick and easy way of assigning which probe sets on different Affymetrix chips measure the expression of orthologous genes. Even in cases where gene or genome duplications have complicated the assignment, groups of comparable probe sets can be identified. The phylogenetic trees provide a resource that can be used to improve sequence annotation and detect biases in the sequence complement of Affymetrix chips. Being able to identify sequence orthologs and recognize biases in the sequence complement of chips is necessary for reliable cross-species microarray comparison. As the amount of work required to generate a single phylogeny in a nonautomated manner is considerable, AffyTrees can greatly reduce the workload for scientists interested in large-scale cross-species comparisons.  相似文献   

11.
MOTIVATION: Most computational methodologies for microRNA gene prediction utilize techniques based on sequence conservation and/or structural similarity. In this study we describe a new technique, which is applicable across several species, for predicting miRNA genes. This technique is based on machine learning, using the Naive Bayes classifier. It automatically generates a model from the training data, which consists of sequence and structure information of known miRNAs from a variety of species. RESULTS: Our study shows that the application of machine learning techniques, along with the integration of data from multiple species is a useful and general approach for miRNA gene prediction. Based on our experiments, we believe that this new technique is applicable to an extensive range of eukaryotes' genomes. Specific structure and sequence features are first used to identify miRNAs followed by a comparative analysis to decrease the number of false positives (FPs). The resulting algorithm exhibits higher specificity and similar sensitivity compared to currently used algorithms that rely on conserved genomic regions to decrease the rate of FPs.  相似文献   

12.
Porous species boundaries are characterized by differential gene flow, where some regions of the genome experience divergent evolution while others experience the homogenizing effects of gene flow. If species can arise or remain distinct despite gene flow between them, speciation can only be understood on a gene by gene level. To understand the genetics of speciation, we therefore must identify the targets of selection that cause divergent evolution and identify the genetic architecture underlying such “speciation phenotypes”. This will enable characterization of genomic regions that are “free to flow” between species, and those that diverge in the face of gene flow. We discuss this problem in the genus Laupala, a morphologically cryptic, flightless group of crickets that has radiated in Hawaii. Because songs are used in courtship and always distinguish close relatives of Laupala as well as species in sympatry, we argue that songs in Laupala are speciation phenotypes. Here, we present our approaches to identify the underlying genomic regions and song genes that differentiate closely related species. We discuss what is known about the genetic basis of this species difference derived from classic quantitative genetics and quantitative trait locus mapping experiments. We also present a model of the molecular expression of cricket song to assist in our goal to identify the genes involved in song variation. As most species are sympatric and exchange genes with congeners, we discuss the importance of understanding the genetic and genomic architecture of song as a speciation phenotype that must be characterized to identify differential patterns of gene flow at porous species boundaries.  相似文献   

13.
MOTIVATION: The expression of genes during the cell division process has now been studied in many different species. An important goal of these studies is to identify the set of cycling genes. To date, this was done independently for each of the species studied. Due to noise and other data analysis problems, accurately deriving a set of cycling genes from expression data is a hard problem. This is especially true for some of the multicellular organisms, including humans. RESULTS: Here we present the first algorithm that combines microarray expression data from multiple species for identifying cycling genes. Our algorithm represents genes from multiple species as nodes in a graph. Edges between genes represent sequence similarity. Starting with the measured expression values for each species we use Belief Propagation to determine a posterior score for genes. This posterior is used to determine a new set of cycling genes for each species. We applied our algorithm to improve the identification of the set of cell cycle genes in budding yeast and humans. As we show, by incorporating sequence similarity information we were able to obtain a more accurate set of genes compared to methods that rely on expression data alone. Our method was especially successful for the human dataset indicating that it can use a high quality dataset from one species to overcome noise problems in another. AVAILABILITY: C implementation is available from the supporting website: http://www.cs.cmu.edu/~lyongu/pub/cellcycle/.  相似文献   

14.
15.
Full genome data sets are currently being explored on a regular basis to infer phylogenetic trees, but there are often discordances among the trees produced by different genes. An important goal in phylogenomics is to identify which individual gene and species produce the same phylogenetic tree and are thus likely to share the same evolutionary history. On the other hand, it is also essential to identify which genes and species produce discordant topologies and therefore evolve in a different way or represent noise in the data. The latter are outlier genes or species and they can provide a wealth of information on potentially interesting biological processes, such as incomplete lineage sorting, hybridization, and horizontal gene transfers. Here, we propose a new method to explore the genomic tree space and detect outlier genes and species based on multiple co-inertia analysis (MCOA), which efficiently captures and compares the similarities in the phylogenetic topologies produced by individual genes. Our method allows the rapid identification of outlier genes and species by extracting the similarities and discrepancies, in terms of the pairwise distances, between all the species in all the trees, simultaneously. This is achieved by using MCOA, which finds successive decomposition axes from individual ordinations (i.e., derived from distance matrices) that maximize a covariance function. The method is freely available as a set of R functions. The source code and tutorial can be found online at http://phylomcoa.cgenomics.org.  相似文献   

16.
17.
GENEVESTIGATOR. Arabidopsis microarray database and analysis toolbox   总被引:26,自引:0,他引:26  
High-throughput gene expression analysis has become a frequent and powerful research tool in biology. At present, however, few software applications have been developed for biologists to query large microarray gene expression databases using a Web-browser interface. We present GENEVESTIGATOR, a database and Web-browser data mining interface for Affymetrix GeneChip data. Users can query the database to retrieve the expression patterns of individual genes throughout chosen environmental conditions, growth stages, or organs. Reversely, mining tools allow users to identify genes specifically expressed during selected stresses, growth stages, or in particular organs. Using GENEVESTIGATOR, the gene expression profiles of more than 22,000 Arabidopsis genes can be obtained, including those of 10,600 currently uncharacterized genes. The objective of this software application is to direct gene functional discovery and design of new experiments by providing plant biologists with contextual information on the expression of genes. The database and analysis toolbox is available as a community resource at https://www.genevestigator.ethz.ch.  相似文献   

18.
19.
20.
Microarray analysis of pathogens and their interaction with hosts   总被引:7,自引:2,他引:5  
Microarrays are a promising technique for elucidating and interpreting the mechanistic roles of genes in the pathogenesis of infectious disease. Microarrays have been used to analyse the genetic polymorphisms of specific loci associated with resistance to antimicrobial agents, to explore the distribution of genes among isolates from the same and similar species, to understand the evolutionary relationship between closely related species and to integrate the clinical and genomic data. This technique has also been used to study host–pathogen interactions, mainly by identifying genes from pathogens that may be involved in pathogenicity and by surveying the scope of the host response to infection. The RNA expression profile of pathogens has been used to identify regulatory mechanisms that ensure gene expression in the appropriate environment, to hypothesize functions of hundreds of uncharacterized genes and to identify virulence genes that promote colonization or tissue damage. This information also has the potential to identify targets for drug design. Furthermore, microarrays have been used to investigate the mechanism of drug action and to delineate and predict adverse effects of new drugs. In this paper, we review the use of spotted and high-density oligonucleotide arrays to study the genetic polymorphisms of pathogens, host–pathogen interactions and whole-genome expression profiles of pathogens, as well as their use for drug discovery.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号