首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.

Background  

A major mechanism for the preservation of gene duplicates in the genome is thought to be mediated via loss or modification of cis-regulatory subfunctions between paralogs following duplication (a process known as regulatory subfunctionalization). Despite a number of gene expression studies that support this mechanism, no comprehensive analysis of regulatory subfunctionalization has been undertaken at the level of the distal cis-regulatory modules involved. We have exploited fish-mammal genomic alignments to identify and compare more than 800 conserved non-coding elements (CNEs) that associate with genes that have undergone fish-specific duplication and retention.  相似文献   

3.
He X  Zhang J 《Current biology : CB》2005,15(11):1016-1021
Eukaryotic genes are on average more complex than prokaryotic genes in terms of expression regulation, protein length, and protein-domain structure [1-5]. Eukaryotes are also known to have a higher rate of gene duplication than prokaryotes do [6, 7]. Because gene duplication is the primary source of new genes [], the average gene complexity in a genome may have been increased by gene duplication if complex genes are preferentially duplicated. Here, we test this "gene complexity and gene duplicability" hypothesis with yeast genomic data. We show that, on average, duplicate genes from either whole-genome or individual-gene duplication have longer protein sequences, more functional domains, and more cis-regulatory motifs than singleton genes. This phenomenon is not a by-product of previously known mechanisms, such as protein function [10-13], evolutionary rate [14, 15], dosage [11], and dosage balance [16], that influence gene duplicability. Rather, it appears to have resulted from the sub-neo-functionalization process in duplicate-gene evolution [11]. Under this process, complex genes are more likely to be retained after duplication because they are prone to subfunctionalization, and gene complexity is regained via subsequent neofunctionalization. Thus, gene duplication increases both gene number and gene complexity, two important factors in the origin of genomic and organismal complexity.  相似文献   

4.
Hu C  Lin SY  Chi WT  Charng YY 《Plant physiology》2012,158(2):747-758
The duplication and divergence of heat stress (HS) response genes might help plants adapt to varied HS conditions, but little is known on the topic. Here, we examined the evolution and function of Arabidopsis (Arabidopsis thaliana) mitochondrial GrpE (Mge) proteins. GrpE acts as a nucleotide-exchange factor in the Hsp70/DnaK chaperone machinery. Genomic data show that AtMge1 and AtMge2 arose from a recent whole-genome duplication event. Phylogenetic analysis indicated that duplication and preservation of Mges occurred independently in many plant species, which suggests a common tendency in the evolution of the genes. Intron retention contributed to the divergence of the protein structure of Mge paralogs in higher plants. In both Arabidopsis and tomato (Solanum lycopersicum), Mge1 is induced by ultraviolet B light and Mge2 is induced by heat, which suggests regulatory divergence of the genes. Consistently, AtMge2 but not AtMge1 is under the control of HsfA1, the master regulator of the HS response. Heterologous expression of AtMge2 but not AtMge1 in the temperature-sensitive Escherichia coli grpE mutant restored its growth at 43°C. Arabidopsis T-DNA knockout lines under different HS regimes revealed that Mge2 is specifically required for tolerating prolonged exposure to moderately high temperature, as compared with the need of the heat shock protein 101 and the HS-associated 32-kD protein for short-term extreme heat. Therefore, with duplication and subfunctionalization, one copy of the Arabidopsis Mge genes became specialized in a distinct type of HS. We provide direct evidence supporting the connection between gene duplication and adaptation to environmental stress.  相似文献   

5.
6.
We used the classic example of the duplicated zebrafish sox11a and -b loci to test the duplication, degeneration, complementation (DDC) model of genome evolution through whole genome duplication. While recent reports have demonstrated sub-partitioning of regulatory sequences in duplicated regions, a comparison of the regulatory capabilities of extant regulatory sequences derived from ancient ancestral elements has been scarce. Consistent with the DDC model, we find that ancestral regulatory elements distributed over several hundred kb were lost in either one or the other zebrafish duplicate, leading to subpartitioning. However, regulatory sequences kept as duplicates near both sox11 co-orthologs diverged in sequence from each other and from human elements and in the regulatory patterns they drive in transgenic zebrafish. Evolutionary analysis of the loci suggested that both zebrafish protein coding sox11 orthologs have been maintained by purifying selection, and have evolved at comparable rates, indicative of non-diverged protein functions. The duplicated regulatory elements, conversely, evolved with different divergence rates and degrees of subfunctionalization. These data show that regulatory evolution of gene expression patterns occurred both through differential loss as well as through regulatory sequence evolution in zebrafish versus human genomes.  相似文献   

7.
Segmental duplication is particularly frequent within plant genomes and the ability of the original single-copy gene to gain a new function for the change of regulatory elements is one of the prominent consequences of duplication. Thus, it is important to study the pattern of conserved non-coding sequence (CNS) between paralogous genes. We report the result of a survey of CNSs among paralogous regions in rice (Oryza sativa L.), as well as the comparison of CNS dataset between rice and Arabidopsis thaliana. Some common properties, such as the change of A + T content near the CNS boundaries and CNS are enriched in regulatory genes, were observed. However, the content of CNSs differs between rice and Arabidopsis, and it is interesting that the rice metabolic network includes both CNS-poor and CNS-rich genes, which indicated a fine-tuned metabolic network presents in rice.  相似文献   

8.
Recent years have witnessed a breathtaking increase in the availability of genome sequence data, providing evidence of the highly duplicate nature of eukaryotic genomes. Plants are exceptional among eukaryotic organisms in that duplicate loci compose a large fraction of their genomes, partly because of the frequent occurrence of polyploidy (or whole-genome duplication) events. Tandem gene duplication and transposition have also contributed to the large number of duplicated genes in plant genomes. Evolutionary analyses allowed the dynamics of duplicate gene evolution to be studied and several models were proposed. It seems that, over time, many duplicated genes were lost and some of those that were retained gained new functions and/or expression patterns (neofunctionalization) or subdivided their functions and/or expression patterns between them (subfunctionalization). Recent studies have provided examples of genes that originated by duplication with successive diversification within plants. In this review, we focused on the TEL (TERMINAL EAR1-like) genes to illustrate such mechanisms. Emerged from the mei2 gene family, these TEL genes are likely to be land plant-specific. Phylogenetic analyses revealed one or two TEL copies per diploid genome. TEL gene degeneration and loss in several Angiosperm species such as in poplar and maize seem to have occurred. In Arabidopsis thaliana, whose genome experienced at least three polyploidy events followed by massive gene loss and genomic reorganization, two TEL genes were retained and two new shorter TEL-like (MCT) genes emerged. Molecular and expression analyses suggest for these genes sub- and neofunctionalization events, but confirmation will come from their functional characterization.  相似文献   

9.
Gene duplication plays an important role in the evolution of diversity and novel function and is especially prevalent in the nuclear genomes of flowering plants. Duplicate genes may be maintained through subfunctionalization and neofunctionalization at the level of expression or coding sequence. In order to test the hypothesis that duplicated regulatory genes will be differentially expressed in a specific manner indicative of regulatory subfunctionalization and/or neofunctionalization, we examined expression pattern shifts in duplicated regulatory genes in Arabidopsis. A two-way analysis of variance was performed on expression data for 280 phylogenetically identified paralogous pairs. Expression data were extracted from global expression profiles for wild-type root, stem, leaf, developing inflorescence, nearly mature flower buds, and seedpod. Gene, organ, and gene by organ interaction (G x O) effects were examined. Results indicate that 85% of the paralogous pairs exhibited a significant G x O effect indicative of regulatory subfunctionalization and/or neofunctionalization. A significant G x O effect was associated with complementary expression patterns in 45% of pairwise comparisons. No association was detected between a G x O effect and a relaxed evolutionary constraint as detected by the ratio of nonsynonymous to synonymous substitutions. Ancestral gene expression patterns inferred across a Type II MADS-box gene phylogeny suggest several cases of regulatory neofunctionalization and organ-specific nonfunctionalization. Complete linkage clustering of gene expression levels across organs suggests that regulatory modules for each organ are independent or ancestral genes had limited expression. We propose a new classification, regulatory hypofunctionalization, for an overall decrease in expression level in one member of a paralogous pair while still having a significant G x O effect. We conclude that expression divergence specifically indicative of subfunctionalization and/or neofunctionalization contributes to the maintenance of most if not all duplicated regulatory genes in Arabidopsis and hypothesize that this results in increasing expression diversity or specificity of regulatory genes after each round of duplication.  相似文献   

10.
Guan Y  Dunham MJ  Troyanskaya OG 《Genetics》2007,175(2):933-943
Gene duplication can occur on two scales: whole-genome duplications (WGD) and smaller-scale duplications (SSD) involving individual genes or genomic segments. Duplication may result in functionally redundant genes or diverge in function through neofunctionalization or subfunctionalization. The effect of duplication scale on functional evolution has not yet been explored, probably due to the lack of global knowledge of protein function and different times of duplication events. To address this question, we used integrated Bayesian analysis of diverse functional genomic data to accurately evaluate the extent of functional similarity and divergence between paralogs on a global scale. We found that paralogs resulting from the whole-genome duplication are more likely to share interaction partners and biological functions than smaller-scale duplicates, independent of sequence similarity. In addition, WGD paralogs show lower frequency of essential genes and higher synthetic lethality rate, but instead diverge more in expression pattern and upstream regulatory region. Thus, our analysis demonstrates that WGD paralogs generally have similar compensatory functions but diverging expression patterns, suggesting a potential of distinct evolutionary scenarios for paralogs that arose through different duplication mechanisms. Furthermore, by identifying these functional disparities between the two types of duplicates, we reconcile previous disputes on the relationship between sequence divergence and expression divergence or essentiality.  相似文献   

11.
Consequences of hoxb1 duplication in teleost fish   总被引:1,自引:0,他引:1  
Vertebrate evolution is characterized by gene and genome duplication events. There is strong evidence that a whole-genome duplication occurred in the lineage leading to the teleost fishes. We have focused on the teleost hoxb1 duplicate genes as a paradigm to investigate the consequences of gene duplication. Previous analysis of the duplicated zebrafish hoxb1 genes suggested they have subfunctionalized. The combined expression pattern of the two zebrafish hoxb1 genes recapitulates the expression pattern of the single Hoxb1 gene of tetrapods, possibly due to degenerative changes in complementary cis-regulatory elements of the duplicates. Here we have tested the hypothesis that all teleost duplicates had a similar fate post duplication, by examining hoxb1 genes in medaka and striped bass. Consistent with this theory, we found that the ancestral Hoxb1 expression pattern is subdivided between duplicate genes in a largely similar fashion in zebrafish, medaka, and striped bass. Further, our analysis of hoxb1 genes reveals that sequence changes in cis-regulatory regions may underlie subfunctionalization in all teleosts, although the specific changes vary between species. It was previously shown that zebrafish hoxb1 duplicates have also evolved different functional capacities. We used misexpression to compare the functions of hoxb1 duplicates from zebrafish, medaka and striped bass. Unexpectedly, we found that some biochemical properties, which were paralog specific in zebrafish, are conserved in both duplicates of other species. This work suggests that the fate of duplicate genes varies across the teleost group.  相似文献   

12.
The discovery of regulatory motifs embedded in upstream regions of plants is a particularly challenging bioinformatics task. Previous studies have shown that motifs in plants are short compared with those found in vertebrates. Furthermore, plant genomes have undergone several diversification mechanisms such as genome duplication events which impact the evolution of regulatory motifs. In this article, a systematic phylogenomic comparison of upstream regions is conducted to further identify features of the plant regulatory genomes, the component of genomes regulating gene expression, to enable future de novo discoveries. The findings highlight differences in upstream region properties between major plant groups and the effects of divergence times and duplication events. First, clear differences in upstream region evolution can be detected between monocots and dicots, thus suggesting that a separation of these groups should be made when searching for novel regulatory motifs, particularly since universal motifs such as the TATA box are rare. Second, investigating the decay rate of significantly aligned regions suggests that a divergence time of ~100 mya sets a limit for reliable conserved non-coding sequence (CNS) detection. Insights presented here will set a framework to help identify embedded motifs of functional relevance by understanding the limits of bioinformatics detection for CNSs.  相似文献   

13.
14.
Complex traits and other polygenic processes require coordinated gene expression. Co-expression networks model mRNA co-expression: the product of gene regulatory networks. To identify regulatory mechanisms underlying coordinated gene expression in a tissue-enriched context, ten Arabidopsis thaliana co-expression networks were constructed after manually sorting 4,566 RNA profiling datasets into aerial, flower, leaf, root, rosette, seedling, seed, shoot, whole plant, and global (all samples combined) groups. Collectively, the ten networks contained 30% of the measurable genes of Arabidopsis and were circumscribed into 5,491 modules. Modules were scrutinized for cis regulatory mechanisms putatively encoded in conserved non-coding sequences (CNSs) previously identified as remnants of a whole genome duplication event. We determined the non-random association of 1,361 unique CNSs to 1,904 co-expression network gene modules. Furthermore, the CNS elements were placed in the context of known gene regulatory networks (GRNs) by connecting 250 CNS motifs with known GRN cis elements. Our results provide support for a regulatory role of some CNS elements and suggest the functional consequences of CNS activation of co-expression in specific gene sets dispersed throughout the genome.  相似文献   

15.
16.
Arabidopsis thaliana is believed to have experienced at least two and possibly three whole-genome duplication events in its evolutionary history. In order to investigate the evolutionary relationships between these duplication events and diversification of disease resistance (R) genes, segmental-duplication events containing R genes belonging to the nucleotide binding-leucine rich repeat (NB-LRR) class were identified. Of 153 segmental-duplication events containing NB-LRR genes, only 22 contained NB-LRR genes in both members of the duplication pair, indicating a high frequency of NB-LRR gene loss after whole-genome duplication. The relative age of the duplication events was estimated based on the average synonymous substitution rate of the duplicated gene pairs in the segments. These data were combined with phylogenetic analyses. NB-LRR genes present in segment pairs derived from the most recent whole-genome duplication event, estimated to have occurred only 20 to 40 million years ago, occupy very distant branches of the NB-LRR phylogenetic tree. These data suggest that when NB-LRR clusters are duplicated as part of a whole-genome duplication, homoeologous NB-LRR genes are preferentially lost, either by eliminating one copy of the cluster or by eliminating individual genes such that only paralogous NB-LRR genes are maintained.  相似文献   

17.
New genes arise through duplication and modification of DNA sequences on a range of scales: single gene duplication, duplication of large chromosomal fragments and whole-genome duplication. Each duplication mechanism has specific characteristics that influence the fate of the resulting duplicates, such as the size of the duplicated fragment, the potential for dosage imbalance, the preservation or disruption of regulatory control and genomic context. The ability to diagnose or identify the mechanism that produced a pair of paralogs has the potential to increase our ability to reconstruct evolutionary history, to understand the processes that govern genome evolution and to make functional predictions based on paralogy. The recent availability of large amounts of whole-genome sequence, often from several closely related species, has stimulated a wealth of new computational methods to diagnose gene duplications.  相似文献   

18.
19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号