首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Protein-coding genes often contain long overlapping open-reading frames (ORFs), which may or may not be functional. Current methods that utilize the signature of purifying selection to detect functional overlapping genes are limited to the analysis of sequences from divergent species, thus rendering them inapplicable to genes found only in closely related sequences. Here, we present a method for the detection of selection signatures on overlapping reading frames by using closely related sequences, and apply the method to several known overlapping genes, and to an overlapping ORF on the negative strand of segment 8 of influenza A virus (NEG8), for which the suggestion has been made that it is functional. We find no evidence that NEG8 is under selection, suggesting that the intact reading frame might be non-functional, although we cannot fully exclude the possibility that the method is not sensitive enough to detect the signature of selection acting on this gene. We present the limitations of the method using known overlapping genes and suggest several approaches to improve it in future studies. Finally, we examine alternative explanations for the sequence conservation of NEG8 in the absence of selection. We show that overlap type and genomic context affect the conservation of intact overlapping ORFs and should therefore be considered in any attempt of estimating the signature of selection in overlapping genes.  相似文献   

2.
The cloverleaf secondary structure of transfer RNA (tRNA) is highly conserved across all forms of life. Here, we provide sequence data and inferred secondary structures for all tRNA genes from 8 new arachnid mitochondrial genomes, including representatives from 6 orders. These data show remarkable reductions in tRNA gene sequences, indicating that T-arms are missing from many of the 22 tRNAs in the genomes of 4 out of 7 orders of arachnids. Additionally, all opisthothele spiders possess some tRNA genes that lack sequences that could form well-paired aminoacyl acceptor stems. We trace the evolution of T-arm loss onto phylogenies of arachnids and show that a genome-wide propensity to lose sequences that encode canonical cloverleaf structures likely evolved multiple times within arachnids. Mapping of structural characters also shows that certain tRNA genes appear more evolutionarily prone to lose the sequence coding for the T-arm and that once a T-arm is lost, it is not regained. We use tRNA structural data to construct a phylogeny of arachnids and find high bootstrap support for a clade that is not supported in phylogenies that are based on more traditional morphological characters. Together, our data demonstrate variability in structural evolution among different tRNAs as well as evidence for parallel evolution of the loss of sequence coding for tRNA arms within an ancient and diverse group of animals.  相似文献   

3.
Mitochondrial genomes of onychophorans (velvet worms) present an interesting problem: Some previous studies reported them lacking several transfer RNA (tRNA) genes, whereas others found that all their tRNA genes were present but severely reduced. To resolve this discrepancy, we determined complete mitochondrial DNA (mtDNA) sequences of the onychophorans Oroperipatus sp. and Peripatoides sympatrica as well as cDNA sequences from 14 and 10 of their tRNAs, respectively. We show that tRNA genes in these genomes are indeed highly reduced and encode truncated molecules, which are restored to more conventional structures by extensive tRNA editing. During this editing process, up to 34 nucleotides are added to the tRNA sequences encoded in Oroperipatus sp. mtDNA, rebuilding the aminoacyl acceptor stem, the TΨC arm, and in some extreme cases, the variable arm and even a part of the anticodon stem. The editing is less extreme in P. sympatrica in which at least a part of the TΨC arm is always encoded in mtDNA. When the entire TΨC arm is added de novo in Oroperipatus sp., the sequence of this arm is either identical or similar among different tRNA species, yet the sequences show substantial variation for each tRNA. These observations suggest that the arm is rebuilt, at least in part, by a template-independent mechanism and argue against the alternative possibility that tRNA genes or their parts are imported from the nucleus. By contrast, the 3' end of the aminoacyl acceptor stem is likely restored by a template-dependent mechanism. The extreme tRNA editing reported here has been preserved for >140 My as it was found in both extant families of onychophorans. Furthermore, a similar type of tRNA editing may be present in several other groups of arthropods, which show a high degree of tRNA gene reduction in their mtDNA.  相似文献   

4.
5.
We propose a method by which the intensity of purifying selection on a functional protein-coding gene is estimated by using three aligned homologous sequences: a processed pseudogene (psi), a functional paralog from the same species (g), and a functional ortholog from a different species (o). For each such trio, we calculate the numbers of nucleotide substitutions along the branches leading to psi and g, i.e., K psi and K(g). If we assume that the mutation rates are the same in the genes and the pseudogenes and that mutations occurring in a pseudogene do not affect the fitness of the organism, we can show that the fraction of mutations that are selectively neutral, fg, is equal to the ratio K(g)/K psi. Since advantageous mutations occur only very rarely, such that they do not contribute significantly to the rate of molecular evolution, the fraction of deleterious mutations that are subject to purifying selection is 1-fg. Therefore, the K(g)/K psi ratio can be used directly to estimate the intensity of purifying selection, thereby isolating its effects on the rate of evolution from those of mutation. We compared the selection intensities of 12 orthologous protein-coding pairs from humans and murids. As expected, the fraction of mutations that are subject to purifying selection is strongest in the second codon position and weakest in the third. Interestingly, the mean fractions of effectively neutral mutations in the third codon position were only 41% and 42% for murids and humans, respectively, indicating that many synonymous mutations are subject to selective constraint. In several orthologous genes, we found that the intensity of purifying selection is very different between murid and human orthologous genes. There was no statistically significant difference in overall intensity of purifying selection between humans and murids. Thus, purifying selection does not seem to be an important factor contributing to the observed differences in the rates of evolution between these two taxa.  相似文献   

6.
The reconstruction of correct genealogies among biological entities, the estimation of the divergence time between organisms or the study of the different events that occur along evolutionary lineages are not always based on suitable genes. For reliable results, it is necessary to look at full-length sequences of genes under stabilizing selection (neutral or purifying) and behaving as good molecular clocks. In bacteria it has been proved that the malate dehydrogenase gene (mdh) can be used to determine the inter- and intraspecies divergence, and hence this gene constitutes a potential marker for phylogeny and bacterial population genetics. We have sequenced the full-length mdh gene in 36 type and reference strains of Aeromonas. The species grouping obtained in the phylogenetic tree derived from mdh sequences was in agreement with that currently accepted for the genus Aeromonas. The maximum likelihood models applied to our sequences indicated that the mdh gene is highly conserved among the Aeromonas species and the main evolutionary force acting on it is purifying selection. Only two sites under potential diversifying selection were identified (T 108 and S 193). In order to determine if these two residues could have an influence on the MDH structure, we mapped them in a three-dimensional model constructed from the sequence of A. hydrophila using the human mitochondrial MDH as a template. The presence of purifying selection together with the linear relationship between substitutions and gene divergence makes the mdh an excellent candidate gene for a phylogeny of Aeromonas and probably for other bacterial groups.  相似文献   

7.
Several temperate bacteriophage utilize chromosomal sequences encoding putative tRNA genes for phage attachment. However, whether these sequences belong to genes which are functional as tRNA is generally not known. In this article, we demonstrate that the attachment site of temperate phage 16-3 (attB) nests within an active proline tRNA gene in Rhizobium meliloti 41. A loss-of-function mutation in this tRNA gene leads to significant delay in switching from lag to exponential growth phase. We converted the putative Rhizobium gene to an active amber suppressor gene which suppressed amber mutant alleles of genes of 16-3 phage and of Escherichia coli origin in R. meliloti 41 and in Agrobacterium tumefaciens GV2260. Upon lysogenization of R. meliloti by phage 16-3, the proline tRNA gene retained its structural and functional integrity. Aspects of the co-evolution of a temperate phage and its bacterium host is discussed. The side product of this work, i.e. construction of amber suppressor tRNA genes in Rhizobium and Agrobacterium, for the first time widens the options of genetic study.  相似文献   

8.

Background

Although mitochondrial (mt) gene order is highly conserved among vertebrates, widespread gene rearrangements occur in anurans, especially in neobatrachians. Protein coding genes in the mitogenome experience adaptive or purifying selection, yet the role that selection plays on genomic reorganization remains unclear. We sequence the mitogenomes of three species of Glandirana and hot spots of gene rearrangements of 20 frog species to investigate the diversity of mitogenomic reorganization in the Neobatrachia. By combing these data with other mitogenomes in GenBank, we evaluate if selective pressures or functional constraints act on mitogenomic reorganization in the Neobatrachia. We also look for correlations between tRNA positions and codon usage.

Results

Gene organization in Glandirana was typical of neobatrachian mitogenomes except for the presence of pseudogene trnS (AGY). Surveyed ranids largely exhibited gene arrangements typical of neobatrachian mtDNA although some gene rearrangements occurred. The correlation between codon usage and tRNA positions in neobatrachians was weak, and did not increase after identifying recurrent rearrangements as revealed by basal neobatrachians. Codon usage and tRNA positions were not significantly correlated when considering tRNA gene duplications or losses. Change in number of tRNA gene copies, which was driven by genomic reorganization, did not influence codon usage bias. Nucleotide substitution rates and dN/dS ratios were higher in neobatrachian mitogenomes than in archaeobatrachians, but the rates of mitogenomic reorganization and mt nucleotide diversity were not significantly correlated.

Conclusions

No evidence suggests that adaptive selection drove the reorganization of neobatrachian mitogenomes. In contrast, protein-coding genes that function in metabolism showed evidence for purifying selection, and some functional constraints appear to act on the organization of rRNA and tRNA genes. As important nonadaptive forces, genetic drift and mutation pressure may drive the fixation and evolution of mitogenomic reorganizations.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-691) contains supplementary material, which is available to authorized users.  相似文献   

9.
10.
Bivalve species are characterized by extraordinary variability in terms of mitochondrial (mt) genome size, gene arrangement and tRNA gene number. Many species are thought to lack the mitochondrial protein-coding gene atp8. Of these species, the Mytilidae appears to be the only known taxon with doubly uniparental inheritance of mtDNA that does not possess the atp8 gene. This raises the question as to whether mytilids have completely lost the ATP8 protein, whether the gene has been transferred to the nucleus or whether they possess a highly modified version of the gene/protein that has led to its lack of annotation. In the present study, we re-investigated all complete (or nearly complete) F and M mytilid mt genomes previously sequenced for the presence of conserved open reading frames (ORFs) that might code for ATP8 and/or have other functional importance in these bivalves. We also revised the annotations of all available complete mitochondrial genomes of bivalves and nematodes that are thought to lack atp8 in an attempt to detect it. Our results indicate that a novel mytilid ORF of significant length (i.e., the ORF is >85 amino acids in length), with complete start and stop codons, is a candidate for the atp8 gene: (1) it possesses a pattern of evolution expected for a protein-coding gene evolving under purifying selection (i.e., the 3rd>1st>2nd codon pattern of evolution), (2) it is actively transcribed in Mytilus species, (3) it has one predicted transmembrane helix (as do other metazoan ATP8 proteins), (4) it has conserved functional motifs and (5), comparisons of its amino acid sequence with ATP8 sequences of other molluscan or bivalve species reveal similar hydropathy profiles. Furthermore, our revised annotations also confirmed the mt presence of atp8 in almost all bivalve species and in one nematode species. Our results thus support recognizing the presence of ATPase 8 in most bivalves mt genomes (if not all) rather than the continued characterization of these genomes as lacking this gene.  相似文献   

11.
Overlapping genes are two protein-coding sequences sharing a significant part of the same DNA locus in different reading frames. Although in recent times an increasing number of examples have been found in bacteria the underlying mechanisms of their evolution are unknown. In this work we explore how selective pressure in a protein-coding sequence influences its overlapping genes in alternative reading frames. We model evolution using a time-continuous Markov process and derive the corresponding model for the remaining frames to quantify selection pressure and genetic noise. Our findings lead to the presumption that, once information is embedded in the reverse reading frame −2 (relative to the mother gene in +1) purifying selection in the protein-coding reading frame automatically protects the sequences in both frames. We also found that this coincides with the fact that the genetic noise measured using the conditional entropy is minimal in frame −2 under selection in the coding frame.  相似文献   

12.
13.
The assumption that conservation of sequence implies the action of purifying selection is central to diverse methodologies to infer functional importance. GC-biased gene conversion (gBGC), a meiotic mismatch repair bias strongly favouring GC over AT, can in principle mimic the action of selection, this being thought to be especially important in mammals. As mutation is GC→AT biased, to demonstrate that gBGC does indeed cause false signals requires evidence that an AT-rich residue is selectively optimal compared to its more GC-rich allele, while showing also that the GC-rich alternative is conserved. We propose that mammalian stop codon evolution provides a robust test case. Although in most taxa TAA is the optimal stop codon, TGA is both abundant and conserved in mammalian genomes. We show that this mammalian exceptionalism is well explained by gBGC mimicking purifying selection and that TAA is the selectively optimal codon. Supportive of gBGC, we observe (i) TGA usage trends are consistent at the focal stop codon and elsewhere (in UTR sequences); (ii) that higher TGA usage and higher TAA→TGA substitution rates are predicted by a high recombination rate; and (iii) across species the difference in TAA <-> TGA substitution rates between GC-rich and GC-poor genes is largest in genomes that possess higher between-gene GC variation. TAA optimality is supported both by enrichment in highly expressed genes and trends associated with effective population size. High TGA usage and high TAA→TGA rates in mammals are thus consistent with gBGC’s predicted ability to “drive” deleterious mutations and supports the hypothesis that sequence conservation need not be indicative of purifying selection. A general trend for GC-rich trinucleotides to reside at frequencies far above their mutational equilibrium in high recombining domains supports the generality of these results.

Is sequence conservation a sign of purifying selection and hence functional importance? This analysis of why mammals use and conserve the most error-prone stop codon suggests not, consistent with GC-biased gene conversion’s predicted ability to “drive” deleterious mutations and supporting the hypothesis that sequence conservation need not be indicative of purifying selection.  相似文献   

14.
A J Lohan  K H Wolfe 《Genetics》1998,150(1):425-433
The plastid genome of the nonphotosynthetic parasitic plant Epifagus virginiana contains only 17 of the 30 tRNA genes normally found in angiosperm plastid DNA. Although this is insufficient for translation, the genome is functional, so import of cytosolic tRNAs into plastids has been suggested. This raises the question of whether the tRNA genes that remain in E. virginiana plastid DNA are active or have just fortuitously escaped deletion. We report the sequences of 20 plastid tRNA loci from Orobanche minor, which shares a nonphotosynthetic ancestor with E. virginiana. The two species have 9 intact tRNA genes in common, the others being defunct in one or both species. The intron-containing trnLUAA gene is absent from E. virginiana, but it is intact, transcribed, and spliced in O. minor. The shared intact genes are better conserved than intergenic sequences, which indicates that these genes are being maintained by natural selection and, therefore, must be functional. For the most part, the tRNA species conserved in nonphotosynthetic plastids are also those that have never been found to be imported in plant mitochondria, which suggests that the same rules may govern tRNA import in the two organelles. A small photosynthesis gene, psbI, is still intact in O. minor, and computer simulations show that some small nonessential genes have an appreciable chance of escaping deletion.  相似文献   

15.
Identification of the full complement of genes and other functional elements in any virus is crucial to fully understand its molecular biology and guide the development of effective control strategies. RNA viruses have compact multifunctional genomes that frequently contain overlapping genes and non-coding functional elements embedded within protein-coding sequences. Overlapping features often escape detection because it can be difficult to disentangle the multiple roles of the constituent nucleotides via mutational analyses, while high-throughput experimental techniques are often unable to distinguish functional elements from incidental features. However, RNA viruses evolve very rapidly so that, even within a single species, substitutions rapidly accumulate at neutral or near-neutral sites providing great potential for comparative genomics to distinguish the signature of purifying selection. Computationally identified features can then be efficiently targeted for experimental analysis. Here we analyze alignments of protein-coding virus sequences to identify regions where there is a statistically significant reduction in the degree of variability at synonymous sites, a characteristic signature of overlapping functional elements. Having previously tested this technique by experimental verification of discoveries in selected viruses, we now analyze sequence alignments for ∼700 RNA virus species to identify hundreds of such regions, many of which have not been previously described.  相似文献   

16.
Widespread positive selection in synonymous sites of mammalian genes   总被引:5,自引:0,他引:5  
Evolution of protein sequences is largely governed by purifying selection, with a small fraction of proteins evolving under positive selection. The evolution at synonymous positions in protein-coding genes is not nearly as well understood, with the extent and types of selection remaining, largely, unclear. A statistical test to identify purifying and positive selection at synonymous sites in protein-coding genes was developed. The method compares the rate of evolution at synonymous sites (Ks) to that in intron sequences of the same gene after sampling the aligned intron sequences to mimic the statistical properties of coding sequences. We detected purifying selection at synonymous sites in approximately 28% of the 1,562 analyzed orthologous genes from mouse and rat, and positive selection in approximately 12% of the genes. Thus, the fraction of genes with readily detectable positive selection at synonymous sites is much greater than the fraction of genes with comparable positive selection at nonsynonymous sites, i.e., at the level of the protein sequence. Unlike other genes, the genes with positive selection at synonymous sites showed no correlation between Ks and the rate of evolution in nonsynonymous sites (Ka), indicating that evolution of synonymous sites under positive selection is decoupled from protein evolution. The genes with purifying selection at synonymous sites showed significant anticorrelation between Ks and expression level and breadth, indicating that highly expressed genes evolve slowly. The genes with positive selection at synonymous sites showed the opposite trend, i.e., highly expressed genes had, on average, higher Ks. For the genes with positive selection at synonymous sites, a significantly lower mRNA stability is predicted compared to the genes with negative selection. Thus, mRNA destabilization could be an important factor driving positive selection in nonsynonymous sites, probably, through regulation of expression at the level of mRNA degradation and, possibly, also translation rate. So, unexpectedly, we found that positive selection at synonymous sites of mammalian genes is substantially more common than positive selection at the level of protein sequences. Positive selection at synonymous sites might act through mRNA destabilization affecting mRNA levels and translation.  相似文献   

17.
MOTIVATION: Viral genomes tend to code in overlapping reading frames to maximize informational content. This may result in atypical codon bias and particular evolutionary constraints. Due to the fast mutation rate of viruses, there is additional strong evidence for varying selection between intra- and intergenomic regions. The presence of multiple coding regions complicates the concept of K(a)/K(s) ratio, and thus begs for an alternative approach when investigating selection strengths. Building on the paper by McCauley and Hein, we develop a method for annotating a viral genome coding in overlapping reading frames. We introduce an evolutionary model capable of accounting for varying levels of selection along the genome, and incorporate it into our prior single sequence HMM methodology, extending it now to a phylogenetic HMM. Given an alignment of several homologous viruses to a reference sequence, we may thus achieve an annotation both of coding regions as well as selection strengths, allowing us to investigate different selection patterns and hypotheses. RESULTS: We illustrate our method by applying it to a multiple alignment of four HIV2 sequences, as well as of three Hepatitis B sequences. We obtain an annotation of the coding regions, as well as a posterior probability for each site of the strength of selection acting on it. From this we may deduce the average posterior selection acting on the different genes. Whilst we are encouraged to see in HIV2, that the known to be conserved genes gag and pol are indeed annotated as such, we also discover several sites of less stringent negative selection within the env gene. To the best of our knowledge, we are the first to subsequently provide a full selection annotation of the Hepatitis B genome by explicitly modelling the evolution within overlapping reading frames, and not relying on simple K(a)/K(s) ratios.  相似文献   

18.
19.
Inferring the intensity of positive selection in protein-coding genes is important since it is used to shed light on the process of adaptation. Recently, it has been reported that overlapping genes, which are ubiquitous in all domains of life, seem to exhibit inordinate degrees of positive selection. Here, we present a new method for the simultaneous estimation of selection intensities in overlapping genes. We show that the appearance of positive selection is caused by assuming that selection operates independently on each gene in an overlapping pair, thereby ignoring the unique evolutionary constraints on overlapping coding regions. Our method uses an exact evolutionary model, thereby voiding the need for approximation or intensive computation. We test the method by simulating the evolution of overlapping genes of different types as well as under diverse evolutionary scenarios. Our results indicate that the independent estimation approach leads to the false appearance of positive selection even though the gene is in reality subject to negative selection. Finally, we use our method to estimate selection in two influenza A genes for which positive selection was previously inferred. We find no evidence for positive selection in both cases.  相似文献   

20.
CYCLOIDEA (CYC) and DICHOTOMA (DICH) are paralogous genes that determine adaxial (dorsal) flower identity in the bilaterally symmetric flowers of Antirrhinum majus (snapdragon). We show here that the duplication leading to the existence of both CYC and DICH in Antirrhinum occurred before the radiation of the Antirrhineae (the tribe to which snapdragon belongs). We find no additional gene duplications within Antirrhineae. Using explicit codon-based models of evolution in a likelihood framework, we show that patterns of molecular evolution after the duplication that gave rise to CYC and DICH are consistent with purifying selection acting at both loci, despite their known functional redundancy in snapdragon. However, for specific gene regions, purifying selection is significantly relaxed across DICH lineages, relative to CYC lineages. In addition, we find evidence for relaxed purifying selection along the lineage leading to snapdragon in one of two putative functional domains of DICH. A model of selection accounting for the persistence of paralogous genes in the absence of diversifying selection is presented. This model takes into account differences in the degree of purifying selection acting at the two loci and is consistent with subfunctionalization models of paralogous gene evolution.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号