首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
Expansion of genome coding regions by acquisition of new genes   总被引:7,自引:0,他引:7  
Betrán E  Long M 《Genetica》2002,115(1):65-80
As it is the case for non-coding regions, the coding regions of organisms can be expanded or shrunk during evolutionary processes. However, the dynamics of coding regions are expected to be more correlated with functional complexity and diversity than are the dynamics of non-coding regions. Hence, it is interesting to investigate the increase of diversity in coding regions – the origin and evolution of new genes – because this provides a new component to the genetic variation underlying the diversity of living organisms. Here, we examine what is known about the mechanisms responsible for the increase in gene number. Every mechanism affects genomes in a distinct way and to a different extent and it appears that certain organisms favor particular mechanisms. The detail of some interesting gene acquisitions reveals the extreme dynamism of genomes. Finally, we discuss what is known about the fate of new genes and conclude that many of the acquisitions are likely to have been driven by natural selection; they increase functional complexity, diversity, and/or adaptation of species. Despite this, the correlation between complexity of life and gene number is low and closely related species (with very similar life histories) can have very different number of genes. We call this phenomenon the G-value paradox.  相似文献   

3.
Genes with overlapping expression and function may gradually diverge despite retaining some common functions. To test whether such genes show distinct patterns of molecular evolution within species, we examined sequence variation at the bric à brac (bab) locus of Drosophila melanogaster. This locus is composed of two anciently duplicated paralogs, bab1 and bab2, which are involved in patterning the adult abdomen, legs, and ovaries. We have sequenced the 148 kb genomic region spanning the bab1 and bab2 genes from 94 inbred lines of D. melanogaster sampled from a single location. Two non-coding regions, one in each paralog, appear to be under selection. The strongest evidence of directional selection is found in a region of bab2 that has no known functional role. The other region is located in the bab1 paralog and is known to contain a cis-regulatory element that controls sex-specific abdominal pigmentation. The coding region of bab1 appears to be under stronger functional constraint than the bab2 coding sequences. Thus, the two paralogs are evolving under different selective regimes in the same natural population, illuminating the different evolutionary trajectories of partially redundant duplicate genes.  相似文献   

4.
Comparison of polymorphism at synonymous and non-synonymous sites in protein-coding DNA can provide evidence for selective constraint. Non-coding DNA that forms part of the regulatory landscape presents more of a challenge since there is not such a clear-cut distinction between sites under stronger and weaker selective constraint. Here, we consider putative regulatory elements termed Conserved Non-coding Elements (CNEs) defined by their high level of sequence identity across all vertebrates. Some mutations in these regions have been implicated in developmental disorders; we analyse CNE polymorphism data to investigate whether such deleterious effects are widespread in humans. Single nucleotide variants from the HapMap and 1000 Genomes Projects were mapped across nearly 2000 CNEs. In the 1000 Genomes data we find a significant excess of rare derived alleles in CNEs relative to coding sequences; this pattern is absent in HapMap data, apparently obscured by ascertainment bias. The distribution of polymorphism within CNEs is not uniform; we could identify two categories of sites by exploiting deep vertebrate alignments: stretches that are non-variant, and those that have at least one substitution. The conserved category has fewer polymorphic sites and a greater excess of rare derived alleles, which can be explained by a large proportion of sites under strong purifying selection within humans – higher than that for non-synonymous sites in most protein coding regions, and comparable to that at the strongly conserved trans-dev genes. Conversely, the more evolutionarily labile CNE sites have an allele frequency distribution not significantly different from non-synonymous sites. Future studies should exploit genome-wide re-sequencing to obtain better coverage in selected non-coding regions, given the likelihood that mutations in evolutionarily conserved enhancer sequences are deleterious. Discovery pipelines should validate non-coding variants to aid in identifying causal and risk-enhancing variants in complex disorders, in contrast to the current focus on exome sequencing.  相似文献   

5.
6.
The genus Rickettsia consists of intracellular bacteria that cause a variety of arthropod vectored human diseases. I have examined the evolutionary processes that are generating variation in antigens that are potential vaccine candidates. The surface proteins rOmpA and rOmpB are subject to intense positive natural selection, causing rapid diversification of their amino acid sequences between species. The positively selected amino acids were mapped and cluster together in regions that may indicate the location of functionally important regions such as epitopes. In contrast to the rOmp antigens, there is no evidence of positive selection on the intracytoplasmic antigen PS120 despite low selective constraints on this gene. All three genes showed evidence of recombination between species, and certain sequences are clear chimeras of two parental sequences. However, recombination has been sufficiently infrequent that the phylogenies of the three genes are similar, although not identical. [Reviewing Editor: Dr. Willie J. Swanson]  相似文献   

7.
Since plant mitochondrial genomes exhibit some of the slowest known synonymous substitution rates, it is generally believed that they experience exceptionally low mutation rates. However, the use of synonymous substitution rates to infer mutation rates depends on the implicit assumption that synonymous sites are evolving neutrally (or nearly so). To assess the validity of this assumption in plant mitochondrial genomes, we examined coding sequence for footprints of selection acting at synonymous sites. We found that synonymous sites exhibit an AT rich and pyrimidine skewed nucleotide composition compared to both non-synonymous sites and non-coding regions. We also found some evidence for selection associated with both biased codon usage and conservation of regulatory sequences involved in mRNA processing, although some of these findings are subject to alternative non-adaptive interpretations. Regardless, the inferred strength of selection appears too weak to account for the variation in substitution rates between the mitochondrial genomes of plants and other multicellular eukaryotes. Therefore, these results are consistent with the interpretation that plant mitochondrial genomes experience a substantially lower mutation rate rather than increased functional constraints acting on synonymous sites. Nevertheless, there are important nucleotide composition patterns (particularly the differences between synonymous sites and non-coding DNA) that remain largely unexplained.  相似文献   

8.
9.
The 1986 Chernobyl disaster contaminated vast regions of Ukraine and Belarus with a variety of radioactive isotopes and heavy metals. While over 90% of the radioactive isotopes have decayed into stable compounds, radiation levels in contaminated areas are still extraordinarily high. In fact, some rodents living near the reactor have internal 134,137Cs concentrations approaching 80 000 Bq/g. Several recent genetic analyses of vertebrates have illustrated that mutation rates of organisms exposed to radiation from Chernobyl are higher than in control groups, but none have studied DNA sequences. Nucleotide sequences of rodent mitochondrial genes were originally reported to have been hypervariable, but those results were subsequently retracted. Herein, I report the results of a pilot study to determine the extent of nucleotide variation at the p53 gene in four species of rodents (voles) from Chernobyl and from control sites. I sequenced a 788 bp region (coding and non-coding) of p53 in 30 different mice comprising four different species of Microtus. Nucleotide variation at the population level was due to deletions and substitutions; both were limited to introns. There were no significant differences between the number of haplotypes in radioactive and control populations (p=0.60). Rare or private alleles might have arisen due to unique mutational pressures at Chernobyl. Alternatively, natural selection might have favored one allele over others (i.e., a selective sweep). Neither scenario is strongly supported by these data. Thus, no apparent genetic effects of the Chernobyl disaster on the p53 gene of resident voles were revealed; more extensive surveys will be necessary to determine if mutation rates are indeed elevated in mice from Chernobyl. However, two salient points emerge; the first involves the utility of introns as markers for mutations in coding regions and the second considers the relative merits of cloning in mutation detection studies.  相似文献   

10.
Contrary to the classical view, a large amount of non-coding DNA seems to be selectively constrained in Drosophila and other species. Here, using Drosophila miranda BAC sequences and the Drosophila pseudoobscura genome sequence, we aligned coding and non-coding sequences between D. pseudoobscura and D. miranda, and investigated their patterns of evolution. We found two patterns that have previously been observed in comparisons between Drosophila melanogaster and its relatives. First, there is a negative correlation between intron divergence and intron length, suggesting that longer non-coding sequences may contain more regulatory elements than shorter sequences. Our other main finding is a negative correlation between the rate of non-synonymous substitutions (d N) and codon usage bias (F op), showing that fast-evolving genes have a lower codon usage bias, consistent with strong positive selection interfering with weak selection for codon usage.  相似文献   

11.
Xiao M  Zhu ZZ  Liu J  Zhang CY 《Acta biotheoretica》2002,50(3):155-165
We have refined entropy theory to explore the meaning of the increasing sequence data on nucleic acids and proteins more conveniently. The concept of selection constraint was not introduced, only the analyzed sequences themselves were considered. The refined theory serves as a basis for deriving a method to analyze non-coding regions (NCRs) as well as coding regions. Positions with maximal entropy might play the most important role in genome functions as opposed to positions with minimal entropy. This method was tested in the well-characterized coding regions of 12 strains of Classical Swine Fever Virus (CSFV) and non-coding regions of 20 strains of CSFV. It is suitable to analyze nucleic acid sequences of a complete genome and to detect sensitive positions for mutagenesis. As such, the method serves to formulate the basis for elucidating the functional mechanism.  相似文献   

12.
13.
Using a comparative genomics approach to reconstruct the fate of genomic regulatory blocks (GRBs) and identify exonic remnants that have survived the disappearance of their host genes after whole-genome duplication (WGD) in teleosts, we discover a set of 38 candidate cis-regulatory coding exons (RCEs) with predicted target genes. These elements demonstrate evolutionary separation of overlapping protein-coding and regulatory information after WGD in teleosts. We present evidence that the corresponding mammalian exons are still under both coding and non-coding selection pressure, are more conserved than other protein coding exons in the host gene and several control sets, and share key characteristics with highly conserved non-coding elements in the same regions. Their dual function is corroborated by existing experimental data. Additionally, we show examples of human exon remnants stemming from the vertebrate 2R WGD. Our findings suggest that long-range cis-regulatory inputs for developmental genes are not limited to non-coding regions, but can also overlap the coding sequence of unrelated genes. Thus, exonic regulatory elements in GRBs might be functionally equivalent to those in non-coding regions, calling for a re-evaluation of the sequence space in which to look for long-range regulatory elements and experimentally test their activity.  相似文献   

14.
Spider dragline (major ampullate) silk outperforms virtually all other natural and manmade materials in terms of tensile strength and toughness. For this reason, the mass-production of artificial spider silks through transgenic technologies has been a major goal of biomimetics research. Although all known arthropod silk proteins are extremely large (>200 kiloDaltons), recombinant spider silks have been designed from short and incomplete cDNAs, the only available sequences. Here we describe the first full-length spider silk gene sequences and their flanking regions. These genes encode the MaSp1 and MaSp2 proteins that compose the black widow's high-performance dragline silk. Each gene includes a single enormous exon (>9000 base pairs) that translates into a highly repetitive polypeptide. Patterns of variation among sequence repeats at the amino acid and nucleotide levels indicate that the interaction of selection, intergenic recombination, and intragenic recombination governs the evolution of these highly unusual, modular proteins. Phylogenetic footprinting revealed putative regulatory elements in non-coding flanking sequences. Conservation of both upstream and downstream flanking sequences was especially striking between the two paralogous black widow major ampullate silk genes. Because these genes are co-expressed within the same silk gland, there may have been selection for similarity in regulatory regions. Our new data provide complete templates for synthesis of recombinant silk proteins that significantly improve the degree to which artificial silks mimic natural spider dragline fibers.  相似文献   

15.
The best known and most thoroughly studied epigenetic phenomenon is DNA methylation, which plays an important role in regulating gene expression during plant regeneration and development. In this study, the methylation-sensitive amplified polymorphism (MSAP) technique was carried out to determine differences in methylation profiles between two forms of protocorm-like bodies (PLBs), continuously proliferating PLBs (cPLBs) and spontaneously-differenting PLBs (sdPLBs), derived from cultures of Cymbidium hybridium. A total of 72 selective primer combinations were used to assess the status of cytosine methylation of DNA in these tissues. Of 4,440 fragments obtained 911 fragments, each representing a recognition site cleaved by one or both of the isoschizomers (Hpa II and Msp I), were amplified and were significantly different between the two forms of PLBs. Frequency of total and full-methylation of cPLBs and sdPLBs were 26.7/12.2%, 24.1/11.1%, respectively. In addition, 14 types of MSAP patterns detected in the two forms of PLBs belonged to two classes, type I and II. Sequencing of 14 differentially methylated fragments and their subsequent blast search revealed that cytosine methylated 5′-CCGG-3′ sequences were equally distributed in the coding and non-coding regions. Southern blotting was conducted to verify the methylation polymorphism.  相似文献   

16.
The fire ant Solenopsis invicta and its close relatives display an important social polymorphism involving differences in colony queen number. Colonies are headed by either a single reproductive queen (monogyne form) or multiple queens (polygyne form). This variation in social organization is associated with variation at the gene Gp-9, with monogyne colonies harboring only B-like allelic variants and polygyne colonies always containing b-like variants as well. We describe naturally occurring variation at Gp-9 in fire ants based on 185 full-length sequences, 136 of which were obtained from S. invicta collected over much of its native range. While there is little overall differentiation between most of the numerous alleles observed, a surprising amount is found in the coding regions of the gene, with such substitutions usually causing amino acid replacements. This elevated coding-region variation may result from a lack of negative selection acting to constrain amino acid replacements over much of the protein, different mutation rates or biases in coding and non-coding sequences, negative selection acting with greater strength on non-coding than coding regions, and/or positive selection acting on the protein. Formal selection analyses provide evidence that the latter force played an important role in the basal b-like lineages coincident with the emergence of polygyny. While our data set reveals considerable paraphyly and polyphyly of S. invicta sequences with respect to those of other fire ant species, the b-like alleles of the socially polymorphic species are monophyletic. An expanded analysis of colonies containing alleles of this clade confirmed the invariant link between their presence and expression of polygyny. Finally, our discovery of several unique alleles bearing various combinations of b-like and B-like codons allows us to conclude that no single b-like residue is completely predictive of polygyne behavior and, thus, potentially causally involved in its expression. Rather, all three typical b-like residues appear to be necessary.  相似文献   

17.
Rapid evolution of snake venom genes by positive selection has been reported previously but key features of this process such as the targets of selection, rates of gene turnover, and functional diversity of toxins generated remain unclear. This is especially true for closely related species with divergent diets. We describe the evolution of PLA2 gene sequences isolated from genomic DNA from four taxa of Sistrurus rattlesnakes which feed on different prey. We identified four to seven distinct PLA2 sequences in each taxon and phylogenetic analyses suggest that these sequences represent a rapidly evolving gene family consisting of both paralogous and homologous loci with high rates of gene gain and loss. Strong positive selection was implicated as a driving force in the evolution of these protein coding sequences. Exons coding for amino acids that make up mature proteins have levels of variation two to three times greater than those of the surrounding noncoding intronic sequences. Maximum likelihood models of coding sequence evolution reveal that a high proportion (∼30%) of all codons in the mature protein fall into a class of codons with an estimated d N /d S (ω) ratio of at least 2.8. An analysis of selection on individual codons identified nine residues as being under strong (p < 0.01) positive selection, with a disproportionately high proportion of these residues found in two functional regions of the PLA2 protein (surface residues and putative anticoagulant region). This is direct evidence that diversifying selection has led to high levels of functional diversity due to structural differences in proteins among these snakes. Overall, our results demonstrate that both gene gain and loss and protein sequence evolution via positive selection are important evolutionary forces driving adaptive divergence in venom proteins in closely related species of venomous snakes.  相似文献   

18.
基于支持向量机的人类5’非翻译区剪接位点识别   总被引:5,自引:0,他引:5  
基因非编码区域剪接位点的识别是基因识别中一个非常具有挑战性的问题,尤其是5’非翻译区中剪接位点的识别。与一般剪接位点不同,5’非翻译区剪接位点的两侧不存在由编码到非编码的状态转移,所以通常的剪接位点识别算法在非翻译区的性能不太理想。文章采用了基于支持向量机的方法对5’非翻译区中的剪接位点进行识别。为了提高识别精度,采用了基于矩阵相似性度量的核函数参数选取方法,它能够简单快速地确定合适的核函数参数,进而提高核函数的识别性能。通过实验验证,经过参数选择后的支持向量机能够较好地识别5'非翻译区剪接位点。  相似文献   

19.
Evolutionary innovation relies partially on changes in gene regulation. While a growing body of evidence demonstrates that such innovation is generated by functional changes or translocation of regulatory elements via mobile genetic elements, the de novo generation of enhancers from non-regulatory/non-mobile sequences has, to our knowledge, not previously been demonstrated. Here we show evidence for the de novo genesis of enhancers in vertebrates. For this, we took advantage of the massive gene loss following the last whole genome duplication in teleosts to systematically identify regions that have lost their coding capacity but retain sequence conservation with mammals. We found that these regions show enhancer activity while the orthologous coding regions have no regulatory activity. These results demonstrate that these enhancers have been de novo generated in fish. By revealing that minor changes in non-regulatory sequences are sufficient to generate new enhancers, our study highlights an important playground for creating new regulatory variability and evolutionary innovation.  相似文献   

20.
Comparative polymorphism of the first exon and first intron of the shuttle craft (stc) and Lim3 genes and their putative regulatory 5'-flanking sequences was analyzed using 20 sequenced natural alleles. A comparison of the stc and Lim3 genes showed that the extent of polymorphism was similar in their introns and corresponded to the variation level characteristic of Drosophila melanogaster, while the putative regulatory region and first intron of the stc gene proved to be more variable than the corresponding regions of the Lim3 gene. Since the genes under study occurred on the same chromosomes isolated from one population and were close together in a region having a high recombination rate, the difference in the extent of polymorphism between the regulatory and coding regions was explained by individual characteristics of each gene. The results made it possible to assume that the extent of polymorphism of the coding gene regions is maintained by balancing selection.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号