首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Genome analysis shows that large-scale gene duplications have occurred in fungi, animals and plants, creating genomic regions that show similarity in gene content and order. However, the high frequency of gene loss reduces colinearity resulting in duplicated regions that, in the extreme, no longer share homologous genes. Here, we show that by comparison with an appropriate second genome, such paralogous regions can still be identified.  相似文献   

2.
3.
The important role of Hox genes in determining the regionalization of the body plan of the vertebrates makes them invaluable candidates for evolutionary analyses regarding functional and morphological innovation. Gene duplication and gene loss led to a variable number of Hox genes in different vertebrate lineages. The evolutionary forces determining the conservation or loss of Hox genes are poorly understood. In this study, we show that variable selective pressures acted on Hox7 genes in different evolutionary lineages, with episodes of positive selection occurring after gene duplications. Tests for functional divergence in paralogs detected significant differentiation in a region known to modulate HOX7 protein activity. Our results show that both positive and negative selection on coding regions are influencing Hox7 genes evolution.  相似文献   

4.
Partial and complete genome duplications occurred during evolution and resulted in the creation of new genes and gene families. We identified a novel and intricate human gene family located primarily in regions of segmental duplications on human chromosome 1. We named it NBPF, for neuroblastoma breakpoint family, because one of its members is disrupted by a chromosomal translocation in a neuroblastoma patient. The NBPF genes have a repetitive structure with high intragenic and intergenic sequence similarity in both coding and noncoding regions. These similarities might expose these genomic regions to illegitimate recombination, resulting in structural variation in the NBPF genes. The encoded proteins contain a highly conserved domain of unknown function, which we have named the NBPF repeat. In silico analysis combined with the isolation of multiple full-length cDNA clones showed that several members of this gene family are abundantly expressed in a large variety of tissues and cell lines. Strikingly, no discernable orthologues could be identified in the completed genomes of fruit fly, nematode, mouse, or rat, but sequences with low homology could be isolated from the draft canine and bovine genomes. Interestingly, this gene family shows primate-specific duplications that result in species-specific arrays of NBPF homologous sequences. Overall, this novel NBPF family reflects the continuous evolution of primate genomes that resulted in large physiological differences, and its potential role in this process is discussed.  相似文献   

5.
6.
Owing to duplication events in its progenitor, more than 90% of the genes in the Arabidopsis thaliana genome are members of multigene families. A set of 2108 gene families, each consisting of precisely two unlinked paralogous genes, was identified in the nuclear genome of A. thaliana on the basis of sequence similarity. A systematic method for the creation of double knock‐out lines for such gene pairs, designated as DUPLO lines, was established and 200 lines are now publicly available. Their initial phenotypic characterisation led to the identification of seven lines with defects that emerge only in the adult stage. A further six lines display seedling lethality and 23 lines were lethal before germination. Another 14 lines are known to show phenotypes under non‐standard conditions or at the molecular level. Knock‐out of gene pairs with very similar coding sequences or expression profiles is more likely to produce a mutant phenotype than inactivation of gene pairs with dissimilar profiles or sequences. High coding sequence similarity and highly similar expression profiles are only weakly correlated, implying that promoter and coding regions of these gene pairs display different degrees of diversification.  相似文献   

7.
Identification of conserved genomic regions within and between different genomes is crucial when studying genome evolution. Here, we described regions of strong synteny conservation between vertebrate deuterostomes (tetrapods and teleosts) and invertebrate deuterostomes (amphioxus and sea urchin). The shared gene contents across phylogenetically distant species demonstrate that the conservation of the regions stemmed from an ancestral segment instead of a series of independent convergent events. Comparison of the syntenic regions allows us to postulate the primitive gene organization in the last common ancestor of deuterostomes and the evolutionary events that occurred to the 3 distinct lineages of sea urchin, amphioxus, and vertebrates after their separation. In addition, alignment of the syntenic regions led to the identification of 8 noncoding evolutionarily conserved regions shared between amphioxus and vertebrates. To our knowledge, this is the first report of conserved noncoding sequences shared by vertebrates and nonvertebrates. These noncoding sequences have high possibility of being elements that regulate neighboring genes. They are likely to be a factor in the maintenance of conserved synteny over long phylogenetic distance in different deuterostome lineages.  相似文献   

8.
Abstract Analyses of whole-genome data often reveal that some genes have evolutionary histories that diverge from the majority phylogeny estimated for the entire genome. We present a probabilistic model that deals with heterogeneity among gene trees, implement it via the Gibbs sampler, and apply it to the plastid genome. Plastids and their genomes are transmitted as a single block without recombination, hence homogeneity among gene trees within this genome is expected. Nevertheless, previous work has revealed clear heterogeneity among plastid genes (e.g., Delwiche and Palmer 1996). Other studies, using whole plastid genomes of various algae and land plants, found little additional heterogeneity (Martin et al. 1998; Adachi et al. 2000). We augment the earlier studies by using a data set of 14 taxa: 6 land plants, 2 green algae, a diatom, 2 red algae and a cryptophyte, the cyanelle of the glaucocystophyte Cyanophora, and the blue–green alga Synechocystis as an outgroup. Contrary to the earlier analyses, we cannot find even a single, dominant consensus tree. Therefore, we formulate a probabilistic model that divides the genes into two sets: those that follow the consensus tree and those that have independent gene trees. No particular tree is supported by more than three-fourths of the genes. But the set of genes that follows a certain tree is fairly independent of data processing and the method of analysis. With one possible exception, we find no evidence for collinear or functionally related genes to follow similar trees. The phylogenetic pattern also seems independent of bias in amino acid composition. Among possible explanations for the observed phenomenon, the hypothesis that different genes have different covarion structures is difficult to assess. But gene duplication may be possible through the inverted or direct repeat regions, while horizontal gene transfer seems less likely. In contrast to green algae and land plants, inverted repeat regions in red algae and in Cyanophora show abundant differences among the copies. Thus, genes may get duplicated when they are recruited into the inverted repeat region and one of the two copies may be lost after leaving the inverted repeat region.  相似文献   

9.
To understand how changes in gene regulatory networks lead to novel morphologies, we have analysed the evolution of a key target gene, RAD, controlling floral asymmetry. In Antirrhinum, flower asymmetry depends on activation of RAD in dorsal regions of the floral meristem by the upstream regulators CYC and DICH. We show that Arabidopsis, a species with radially symmetric flowers, contains six RAD-like genes, reflecting at least three duplications since the divergence of Antirrhinum and Arabidopsis. Unlike the situation in Antirrhinum, none of the Arabidopsis RAD-like genes are activated in dorsal regions of the flower meristem. Rather, the RAD-like genes are expressed in distinctive domains along radial or ab-adaxial axes, consistent with a range of developmental roles. Introduction of a RAD genomic clone from Antirrhinum into Arabidopsis leads to a novel expression pattern that is distinct from the expression pattern of RAD in Antirrhinum and from the endogenous RAD-like genes of Arabidopsis. Nevertheless, RAD is able to influence developmental targets in Arabidopsis, as ectopic expression of RAD has developmental effects in this species. Taken together, our results suggest that duplication and divergence of RAD-like genes has involved a range of cis- and trans-regulatory changes. It is possible that such changes led to the coupling of RAD to CYC regulation in the Antirrhinum lineage and hence the co-option of RAD had a role in the generation of flower dorsoventral asymmetry.  相似文献   

10.
Gene duplication plays important roles in organismal evolution, because duplicate genes provide raw materials for the evolution of mechanisms controlling physiological and/or morphological novelties. Gene duplication can occur via several mechanisms, including segmental duplication, tandem duplication and retroposition. Although segmental and tandem duplications have been found to be important for the expansion of a number of multigene families, the contribution of retroposition is not clear. Here we show that plant SKP1 genes have evolved by multiple duplication events from a single ancestral copy in the most recent common ancestor (MRCA) of eudicots and monocots, resulting in 19 ASK (Arabidopsis SKP1-like) and 28 OSK (Oryza SKP1-like) genes. The estimated birth rates are more than ten times the average rate of gene duplication, and are even higher than that of other rapidly duplicating plant genes, such as type I MADS box genes, R genes, and genes encoding receptor-like kinases. Further analyses suggest that a relatively large proportion of the duplication events may be explained by tandem duplication, but few, if any, are likely to be due to segmental duplication. In addition, by mapping the gain/loss of a specific intron on gene phylogenies, and by searching for the features that characterize retrogenes/retrosequences, we show that retroposition is an important mechanism for expansion of the plant SKP1 gene family. Specifically, we propose that two and three ancient retroposition events occurred in lineages leading to Arabidopsis and rice, respectively, followed by repeated tandem duplications and chromosome rearrangements. Our study represents a thorough investigation showing that retroposition can play an important role in the evolution of a plant gene family whose members do not encode mobile elements.  相似文献   

11.
12.
Genes encoding wheat prolamins belong to complicated multi-gene families in the wheat genome. To understand the structural complexity of storage protein loci, we sequenced and analyzed orthologous regions containing both gliadin and LMW-glutenin genes from the A and B genomes of a tetraploid wheat species, Triticum turgidum ssp. durum. Despite their physical proximity to one another, the gliadin genes and LMW-glutenin genes are organized quite differently. The gliadin genes are found to be more clustered than the LMW-glutenin genes which are separated from each other by much larger distances. The separation of the LMW-glutenin genes is the result of both the insertion of large blocks of repetitive DNA owing to the rapid amplification of retrotransposons and the presence of genetic loci interspersed between them. Sequence comparisons of the orthologous regions reveal that gene movement could be one of the major factors contributing to the violation of microcolinearity between the homoeologous A and B genomes in wheat. The rapid sequence rearrangements and differential insertion of repetitive DNA has caused the gene islands to be not conserved in compared regions. In addition, we demonstrated that the i-type LMW-glutenin originated from a deletion of 33-bps in the 5′ coding region of the m-type gene. Our results show that multiple rounds of segmental duplication of prolamin genes have driven the amplification of the ω-gliadin genes in the region; such segmental duplication could greatly increase the repetitive DNA content in the genome depending on the amount of repetitive DNA present in the original duplicate region. Electronic supplementary material The online version of this article (doi:) contains supplementary material, which is available to authorized users. Shuangcheng Gao and Yong Qiang Gu contributed equally to the work.  相似文献   

13.
CYCLOIDEA (CYC) and DICHOTOMA (DICH) are paralogous genes that determine adaxial (dorsal) flower identity in the bilaterally symmetric flowers of Antirrhinum majus (snapdragon). We show here that the duplication leading to the existence of both CYC and DICH in Antirrhinum occurred before the radiation of the Antirrhineae (the tribe to which snapdragon belongs). We find no additional gene duplications within Antirrhineae. Using explicit codon-based models of evolution in a likelihood framework, we show that patterns of molecular evolution after the duplication that gave rise to CYC and DICH are consistent with purifying selection acting at both loci, despite their known functional redundancy in snapdragon. However, for specific gene regions, purifying selection is significantly relaxed across DICH lineages, relative to CYC lineages. In addition, we find evidence for relaxed purifying selection along the lineage leading to snapdragon in one of two putative functional domains of DICH. A model of selection accounting for the persistence of paralogous genes in the absence of diversifying selection is presented. This model takes into account differences in the degree of purifying selection acting at the two loci and is consistent with subfunctionalization models of paralogous gene evolution.  相似文献   

14.

Background

Interlocus gene conversion (IGC) is a recombination-based mechanism that results in the unidirectional transfer of short stretches of sequence between paralogous loci. Although IGC is a well-established mechanism of human disease, the extent to which this mutagenic process has shaped overall patterns of segregating variation in multi-copy regions of the human genome remains unknown. One expected manifestation of IGC in population genomic data is the presence of one-to-one paralogous SNPs that segregate identical alleles.

Results

Here, I use SNP genotype calls from the low-coverage phase 3 release of the 1000 Genomes Project to identify 15,790 parallel, shared SNPs in duplicated regions of the human genome. My approach for identifying these sites accounts for the potential redundancy of short read mapping in multi-copy genomic regions, thereby effectively eliminating false positive SNP calls arising from paralogous sequence variation. I demonstrate that independent mutation events to identical nucleotides at paralogous sites are not a significant source of shared polymorphisms in the human genome, consistent with the interpretation that these sites are the outcome of historical IGC events. These putative signals of IGC are enriched in genomic contexts previously associated with non-allelic homologous recombination, including clear signals in gene families that form tandem intra-chromosomal clusters.

Conclusions

Taken together, my analyses implicate IGC, not point mutation, as the mechanism generating at least 2.7 % of single nucleotide variants in duplicated regions of the human genome.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1681-3) contains supplementary material, which is available to authorized users.  相似文献   

15.
Polyploidy in vertebrate ancestry: Ohno and beyond   总被引:1,自引:0,他引:1  
Over 30 years ago, Susumu Ohno proposed that two rounds of polyploidy occurred early in vertebrate evolution. We re-examine this proposal using three recent lines of evidence. First, total gene number estimates from completely sequenced genomes suggest an increase in total gene number somewhere along the vertebrate or prevertebrate lineage, compatible with Ohno's model. Second, analyses of homeobox and other genes from amphioxus reveal very extensive gene duplication specifically on the vertebrate lineage. This refines the timing of putative polyploidy to after the divergence of amphioxus and vertebrates. Third, the existence of four-fold paralogy regions in the human genome is suggestive of two rounds of polyploidy, although other explanations are possible. We propose an experimental test, based on chromosomal localization of genes in amphioxus, that should resolve whether paralogy regions are indeed remnants of duplication in vertebrate ancestry.  © 2004 The Linnean Society of London, Biological Journal of the Linnean Society , 2004, 82 , 425–430.  相似文献   

16.
A substantial proportion of human genes contain tissue-specifically DNA-methylated regions (TDMRs). However, little is known about the evolutionary conservation of differentially methylated loci, how they evolve, and the signals that regulate them. We have studied TDMR conservation in the PLG and TBX gene families and in 32 pseudogene–parental gene pairs. Among the members of the recently evolved PLG gene family, 5′-UTR methylation is conserved and inversely correlated with the cognate gene expression, indicating as well a conserved regulatory role of DNA methylation. Conversely, many genes of the much older TBX family display complementary tissue-specific methylation, suggesting an epigenetic complementation in the evolution of this gene family. Similar to gene families, unprocessed pseudogenes arose from gene duplications and we found TDMR conservation in some pseudogene–parental gene pairs displaying short evolutionary distances. However, for the majority of unprocessed pseudogenes and for all processed pseudogenes examined, we found that tissue-specific methylation arose de novo after gene duplication.  相似文献   

17.
基因倍增研究进展   总被引:2,自引:0,他引:2  
李鸿健  谭军 《生命科学》2006,18(2):150-154
基因倍增是指DNA片段在基因组中复制出一个或更多的拷贝,这种DNA片段可以是一小段基因组序列、整条染色体,甚至是整个基因组。基因倍增是基因组进化最主要的驱动力之一,是产生具有新功能的基因和进化出新物种的主要原因之一。本文综述了脊椎动物、模式植物和酵母在进化过程中基因倍增研究领域的最新进展,并讨论了基因倍增研究的发展方向。  相似文献   

18.
The aims of the study were to outline the sequence of eventsthat gave rise to the vertebrate insulin-relaxin gene familyand the chromosomal regions in which they reside. We analyzedthe gene content surrounding the human insulin/relaxin geneswith respect to what family they belonged to and if the duplicationhistory of investigated families parallels the evolution ofthe insulin-relaxin family members. Markov Clustering and phylogeneticanalysis were used to determine family identity. More than 15%of the genes belonged to families that have paralogs in theregions, defining two sets of quadruplicate paralogy regions.Thereby, the localization of insulin/relaxin genes in humansis in accordance with those regions on human chromosomes 1,11, 12, 19q (insulin/insulin-like growth factors) and 1, 6p/15q,9/5, 19p (insulin-like factors/relaxins) were formed duringtwo genome duplications. We compared the human genome with thatof Ciona intestinalis, a species that split from the vertebratelineage before the two suggested genome duplications. Two insulin-likeorthologs were discovered in addition to the already describedCi-insulin gene. Conserved synteny between the Ciona regionshosting the insulin-like genes and the two sets of human paralogonsimplies their common origin. Linkage of the two human paralogons,as seen in human chromosome 1, as well as the two regions hostingthe Ciona insulin-like genes suggests that a segmental duplicationgave rise to the region prior to the genome doublings. Thus,preserved gene content provides support that genome duplication(s)in addition to segmental and single-gene duplications shapedthe genomes of extant vertebrates.  相似文献   

19.
Whole‐genome duplications are major evolutionary events with a lasting impact on genome structure. Duplication events complicate genetic analyses as paralogous sequences are difficult to distinguish; consequently, paralogs are often excluded from studies. The effects of an ancient whole‐genome duplication (approximately 88 MYA) are still evident in salmonids through the persistence of numerous paralogous gene sequences and partial tetrasomic inheritance. We use restriction site‐associated DNA sequencing on 10 collections of chum salmon from the Salish Sea in the USA and Canada to investigate genetic diversity and population structure in both tetrasomic and rediploidized regions of the genome. We use a pedigree and high‐density linkage map to identify paralogous loci and to investigate genetic variation across the genome. By applying multivariate statistical methods, we show that it is possible to characterize paralogous loci and that they display similar patterns of population structure as the diploidized portion of the genome. We find genetic associations with the adaptively important trait of run‐timing in both sets of loci. By including paralogous loci in genome scans, we can observe evolutionary signals in genomic regions that have routinely been excluded from population genetic studies in other polyploid‐derived species.  相似文献   

20.
Gene duplication and the accompanying release of negative selective pressure on the duplicate pair is thought to be the key process that makes functional change in the coding and regulatory regions of genomes possible. However, the nature of these changes remains unresolved. There are a number of models for the fate of gene duplicates, the two most prominent of which are neofunctionalisation and subfunctionalisation, but it is still unclear which is the dominant fate. Using a dataset consisting of smaller-scale (tandem and segmental) duplications identified from the genomes of four fully sequenced mammalian genomes, we characterise two key features of smaller-scale duplicate evolution: the rate of pseudogenisation and the rate of accumulation of replacement substitutions in the coding sequence. We show that the best fitting model for gene duplicate survival is a Weibull function with a downward sloping convex hazard function which implies that the rate of pseudogenisation of a gene declines rapidly with time since duplication. Our analysis of the accumulation of replacement substitutions per replacement site shows that they accumulate on average at 64% of the neutral expectation immediately following duplication and as high as 73% in the human lineage. Although this rate declines with time since duplication, it takes several tens of millions of years before it has declined to half its initial value. We show that the properties of the gene death rate and of the accumulation of replacement substitutions are more consistent with neofunctionalisation (or subfunctionalisation followed by neofunctionalisation) than they are with subfunctionalisation alone or any of the other alternative modes of evolution of smaller-scale duplicates. Electronic Supplementary Material The online version of this article (doi: ) contains supplementary material, which is available to authorized users.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号