首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
2.
Recent large-scale studies of evolutionary changes in gene expression among mammalian species have led to the proposal that gene expression divergence may be neutral with respect to organismic fitness. Here, we employ a comparative analysis of mammalian gene sequence divergence and gene expression divergence to test the hypothesis that the evolution of gene expression is predominantly neutral. Two models of neutral gene expression evolution are considered: 1-purely neutral evolution (i.e., no selective constraint) of gene expression levels and patterns and 2-neutral evolution accompanied by selective constraint. With respect to purely neutral evolution, levels of change in gene expression between human-mouse orthologs are correlated with levels of gene sequence divergence that are determined largely by purifying selection. In contrast, evolutionary changes of tissue-specific gene expression profiles do not show such a correlation with sequence divergence. However, divergence of both gene expression levels and profiles are significantly lower for orthologous human-mouse gene pairs than for pairs of randomly chosen human and mouse genes. These data clearly point to the action of selective constraint on gene expression divergence and are inconsistent with the purely neutral model; however, there is likely to be a neutral component in evolution of gene expression, particularly, in tissues where the expression of a given gene is low and functionally irrelevant. The model of neutral evolution with selective constraint predicts a regular, clock-like accumulation of gene expression divergence. However, relative rate tests of the divergence among human-mouse-rat orthologous gene sets reveal clock-like evolution for gene sequence divergence, and to a lesser extent for gene expression level divergence, but not for the divergence of tissue-specific gene expression profiles. Taken together, these results indicate that gene expression divergence is subject to the effects of purifying selective constraint and suggest that it might also be substantially influenced by positive Darwinian selection.  相似文献   

3.
Although sequences containing regulatory elements located close to protein-coding genes are often only weakly conserved during evolution, comparisons of rodent genomes have implied that these sequences are subject to some selective constraints. Evolutionary conservation is particularly apparent upstream of coding sequences and in first introns, regions that are enriched for regulatory elements. By comparing the human and chimpanzee genomes, we show here that there is almost no evidence for conservation in these regions in hominids. Furthermore, we show that gene expression is diverging more rapidly in hominids than in murids per unit of neutral sequence divergence. By combining data on polymorphism levels in human noncoding DNA and the corresponding human–chimpanzee divergence, we show that the proportion of adaptive substitutions in these regions in hominids is very low. It therefore seems likely that the lack of conservation and increased rate of gene expression divergence are caused by a reduction in the effectiveness of natural selection against deleterious mutations because of the low effective population sizes of hominids. This has resulted in the accumulation of a large number of deleterious mutations in sequences containing gene control elements and hence a widespread degradation of the genome during the evolution of humans and chimpanzees.  相似文献   

4.
He C  Li Z  Chen P  Huang H  Hurst LD  Chen J 《Nucleic acids research》2012,40(9):4002-4012
MicroRNAs (miRNAs) have emerged as key regulators of gene expression. Intragenic miRNAs account for ~50% of mammalian miRNAs. Classic studies reported that they are usually coexpressed with host genes. Here, using genome-wide miRNA and gene expression profiles from five sample sets, we show that evolutionarily conserved ('old') intragenic miRNAs tend to be coexpressed with host genes, but non-conserved ('young') ones rarely do so. This result is robust: in all sample sets, the coexpression rate of young miRNAs is significantly lower than that of conserved ones even after controlling for abundance. As a result, although young miRNAs dominate in human genome, the majority of intragenic miRNAs that show coexpression with host genes are phylogenetically old ones. For younger miRNAs, extrapolation of their expression profiles from those of their host genes should be treated with caution. We propose a model to explain this phenomenon in which the majority of young miRNAs are unlikely to be coexpressed with host genes; however, for some fraction of young miRNAs coexpression with their host genes, initially imbued by chromatin level effects, is advantageous and these are the ones likely to embed into the system and evolve ever higher levels of coexpression, possibly by evolving piggybacking mechanisms.  相似文献   

5.
Retroviral promoters in the human genome   总被引:1,自引:0,他引:1  
  相似文献   

6.
7.
Changes in genetic regulation contribute to adaptations in natural populations and influence susceptibility to human diseases. Despite their potential phenotypic importance, the selective pressures acting on regulatory processes in general and gene expression levels in particular are largely unknown. Studies in model organisms suggest that the expression levels of most genes evolve under stabilizing selection, although a few are consistent with adaptive evolution. However, it has been proposed that gene expression levels in primates evolve largely in the absence of selective constraints. In this article, we discuss the microarray-based observations that led to these disparate interpretations. We conclude that in both primates and model organisms, stabilizing selection is likely to be the dominant mode of gene expression evolution. An important implication is that mutations affecting gene expression will often be deleterious and might underlie many human diseases.  相似文献   

8.
Most research concerning the evolution of introns has largely considered introns within coding sequences (CDSs), without regard for introns located within untranslated regions (UTRs) of genes. Here, we directly determined intron size, abundance, and distribution in UTRs of genes using full-length cDNA libraries and complete genome sequences for four species, Arabidopsis thaliana, Drosophila melanogaster, human, and mouse. Overall intron occupancy (introns/exon kbp) is lower in 5' UTRs than CDSs, but intron density (intron occupancy in regions containing introns) tends to be higher in 5' UTRs than in CDSs. Introns in 5' UTRs are roughly twice as large as introns in CDSs, and there is a sharp drop in intron size at the 5' UTR-CDS boundary. We propose a mechanistic explanation for the existence of selection for larger intron size in 5' UTRs, and outline several implications of this hypothesis. We found introns to be randomly distributed within 5' UTRs, so long as a minimum required exon size was assumed. Introns in 3' UTRs were much less abundant than in 5' UTRs. Though this was expected for human and mouse that have intron-dependent nonsense-mediated decay (NMD) pathways that discourage the presence of introns within the 3' UTR, it was also true for A. thaliana and D. melanogaster, which may lack intron-dependent NMD. Our findings have several implications for theories of intron evolution and genome evolution in general.  相似文献   

9.
Ficklin SP  Feltus FA 《Plant physiology》2011,156(3):1244-1256
One major objective for plant biology is the discovery of molecular subsystems underlying complex traits. The use of genetic and genomic resources combined in a systems genetics approach offers a means for approaching this goal. This study describes a maize (Zea mays) gene coexpression network built from publicly available expression arrays. The maize network consisted of 2,071 loci that were divided into 34 distinct modules that contained 1,928 enriched functional annotation terms and 35 cofunctional gene clusters. Of note, 391 maize genes of unknown function were found to be coexpressed within modules along with genes of known function. A global network alignment was made between this maize network and a previously described rice (Oryza sativa) coexpression network. The IsoRankN tool was used, which incorporates both gene homology and network topology for the alignment. A total of 1,173 aligned loci were detected between the two grass networks, which condensed into 154 conserved subgraphs that preserved 4,758 coexpression edges in rice and 6,105 coexpression edges in maize. This study provides an early view into maize coexpression space and provides an initial network-based framework for the translation of functional genomic and genetic information between these two vital agricultural species.  相似文献   

10.
11.
H Liu  J Yin  M Xiao  C Gao  AS Mason  Z Zhao  Y Liu  J Li  D Fu 《Gene》2012,507(2):106-111
Untranslated regions (UTRs) in eukaryotes play a significant role in the regulation of translation and mRNA half-life, as well as interacting with specific RNA-binding proteins. However, UTRs receive less attention than more crucial elements such as genes, and the basic structural and evolutionary characteristics of UTRs of different species, and the relationship between these UTRs and the genome size and species gene number is not well understood. To address these questions, we performed a comparative analysis of 5' and 3' untranslated regions of different species by analyzing the basic characteristics of 244,976 UTRs from three eukaryote kingdoms (Plantae, Fungi, and Protista). The results showed that the UTR lengths and SSR frequencies in UTRs increased significantly with increasing species gene number while the length and G+C content in 5' UTRs and different types of repetitive sequences in 3' UTRs increased with the increase of genome size. We also found that the sequence length of 5' UTRs was significantly positively correlated with the presence of transposons and SSRs while the sequence length of 3' UTRs was significantly positively correlated with the presence of tandem repeat sequences. These results suggested that evolution of species complexity from lower organisms to higher organisms is accompanied by an increase in the regulatory complexity of UTRs, mediated by increasing UTR length, increasing G+C content of 5' UTRs, and insertion and expansion of repetitive sequences.  相似文献   

12.
13.
Learning about the roles that duplicate genes play in the origins of novel phenotypes requires an understanding of how their functions evolve. A previous method for achieving this goal, CDROM, employs gene expression distances as proxies for functional divergence and then classifies the evolutionary mechanisms retaining duplicate genes from comparisons of these distances in a decision tree framework. However, CDROM does not account for stochastic shifts in gene expression or leverage advances in contemporary statistical learning for performing classification, nor is it capable of predicting the parameters driving duplicate gene evolution. Thus, here we develop CLOUD, a multi-layer neural network built on a model of gene expression evolution that can both classify duplicate gene retention mechanisms and predict their underlying evolutionary parameters. We show that not only is the CLOUD classifier substantially more powerful and accurate than CDROM, but that it also yields accurate parameter predictions, enabling a better understanding of the specific forces driving the evolution and long-term retention of duplicate genes. Further, application of the CLOUD classifier and predictor to empirical data from Drosophila recapitulates many previous findings about gene duplication in this lineage, showing that new functions often emerge rapidly and asymmetrically in younger duplicate gene copies, and that functional divergence is driven by strong natural selection. Hence, CLOUD represents a major advancement in classifying retention mechanisms and predicting evolutionary parameters of duplicate genes, thereby highlighting the utility of incorporating sophisticated statistical learning techniques to address long-standing questions about evolution after gene duplication.  相似文献   

14.
15.
16.
N-glycosylation is one of the most important forms of protein modification, serving key biological functions in multicellular organisms. N-glycans at the cell surface mediate the interaction between cells and the surrounding matrix and may act as pathogen receptors, making the genes responsible for their synthesis good candidates to show signatures of adaptation to different pathogen environments. Here, we study the forces that shaped the evolution of the genes involved in the synthesis of the N-glycans during the divergence of primates within the framework of their functional network. We have found that, despite their function of producing glycan repertoires capable of evading rapidly evolving pathogens, genes involved in the synthesis of the glycans are highly conserved, and no signals of positive selection have been detected within the time of divergence of primates. This suggests strong functional constraints as the main force driving their evolution. We studied the strength of the purifying selection acting on the genes in relation to the network structure considering the position of each gene along the pathway, its connectivity, and the rates of evolution in neighboring genes. We found a strong and highly significant negative correlation between the strength of purifying selection and the connectivity of each gene, indicating that genes encoding for highly connected enzymes evolve slower and thus are subject to stronger selective constraints. This result confirms that network topology does shape the evolution of the genes and that the connectivity within metabolic pathways and networks plays a major role in constraining evolutionary rates.  相似文献   

17.
18.
Rapid rates of evolution can signify either a lack of selective constraint and the consequent accumulation of neutral alleles, or positive Darwinian selection driving the fixation of advantageous alleles. Based on a comparison of 1,350 orthologous gene pairs from human and mouse, we show that the evolution of gene expression profiles is so rapid that it is comparable to that of paralogous gene pairs or randomly paired genes. The expression divergence in the entire set of orthologous pairs neither strongly correlates with sequence divergence, nor focuses in any particular tissue. Moreover, comparing tissue expressions across the orthologous gene pairs, we observe that any human tissue is more similar to any other human tissue examined than to its corresponding mouse tissue. Collectively, these results indicate that, while some differences in expression profiles may be due to adaptive evolution, the levels of divergence are mostly compatible with a neutral mode of evolution, in which a mutation for ectopic expression may rise to fixation by random drift without significantly affecting the fitness. A disturbing corollary of these findings is that knowledge of where the gene is expressed may not carry information about its function.  相似文献   

19.
An important comprehension from comparative genomic analysis is that sequence conservation beyond neutral expectations is frequently found outside protein-coding regions, indicating important functional roles of noncoding DNA. Understanding the causes of constraint on noncoding sequence evolution forms an important area of research, not least in light of the importance for understanding the evolution of gene expression. We aligned all orthologous genes of chicken and zebra finch together with 5 kb of their upstream and downstream noncoding sequences, to study the evolution of gene flanking sequences in the avian genome. Using ancestral repeats as a neutral reference, we detected significant evolutionary constraint in the 3' flanking region, highest directly after termination (60%) and then gradually decreasing to about 20% 5 kb downstream. Constraint was higher in annotated 3' untranslated regions (UTRs) than in non-UTRs at the same distance from the stop codon and higher in sequences annotated as microRNA (miRNA)-binding sites than in non-miRNA-binding sites within 3' UTRs. Constraint was also higher when estimated for a smaller data set of genes from more closely related songbird species, indicating turnover of functional elements during avian evolution. On the 5' flanking side constraint was readily seen within the first 125 bp immediately upstream of the start codon (34%) and was about 10% for remaining sequence within 5 kb upstream. Analysis of chicken polymorphism data gave further support for the highest constraint directly before and after the translated region. Finally, we found that genes evolving under the highest constraint measured by d(N)/d(S) also had the highest level of constraint in the 3' flanking region. This study broadens the insights into gene flanking sequence evolution by adding new findings from a vertebrate lineage other than mammals.  相似文献   

20.
One of the striking observations from recent whole-genome comparisons is that changes in the number of specialized genes in existing gene families, as opposed to novel taxon-specific gene families, are responsible for the majority of the difference in genome composition between major taxa. Previous models of duplicate gene evolution focused primarily on the role that neutral processes can play in evolutionary divergence after the duplicates are already fixed in the population. By instead including the entire cycle of duplication and divergence, we show that specialized functions are most likely to evolve through strong selection acting on segregating alleles at a single locus, even before the duplicate arises. We show that the fitness relationships that allow divergent alleles to evolve at a single locus largely overlap with the conditions that allow divergence of previously duplicated genes. Thus, a solution to the paradox of the origin of organismal complexity via the expansion of gene families exists in the form of the deterministic spread of novel duplicates via natural selection.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号