首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
3.

Background

Animal mitochondrial genomes are physically separate from the much larger nuclear genomes and have proven useful both for phylogenetic studies and for understanding genome evolution. Within the phylum Arthropoda the subphylum Crustacea includes over 50,000 named species with immense variation in body plans and habitats, yet only 23 complete mitochondrial genomes are available from this subphylum.

Results

I describe here the complete mitochondrial genome of the crustacean Squilla mantis (Crustacea: Malacostraca: Stomatopoda). This 15994-nucleotide genome, the first described from a hoplocarid, contains the standard complement of 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and a non-coding AT-rich region that is found in most other metazoans. The gene order is identical to that considered ancestral for hexapods and crustaceans. The 70% AT base composition is within the range described for other arthropods. A single unusual feature of the genome is a 230 nucleotide non-coding region between a serine transfer RNA and the nad1 gene, which has no apparent function. I also compare gene order, nucleotide composition, and codon usage of the S. mantis genome and eight other malacostracan crustaceans. A translocation of the histidine transfer RNA gene is shared by three taxa in the order Decapoda, infraorder Brachyura; Callinectes sapidus, Portunus trituberculatus and Pseudocarcinus gigas. This translocation may be diagnostic for the Brachyura. For all nine taxa nucleotide composition is biased towards AT-richness, as expected for arthropods, and is within the range reported for other arthropods. Codon usage is biased, and much of this bias is probably due to the skew in nucleotide composition towards AT-richness.

Conclusion

The mitochondrial genome of Squilla mantis contains one unusual feature, a 230 base pair non-coding region has so far not been described in any other malacostracan. Comparisons with other Malacostraca show that all nine genomes, like most other mitochondrial genomes, share a bias toward AT-richness and a related bias in codon usage. The nine malacostracans included in this analysis are not representative of the diversity of the class Malacostraca, and additional malacostracan sequences would surely reveal other unusual genomic features that could be useful in understanding mitochondrial evolution in this taxon.  相似文献   

4.
Wang X  Wang J  He S  Mayden RL 《Gene》2007,399(1):11-19
The complete mitochondrial genome sequence of the Chinese hook snout carp, Opsariichthys bidens, was newly determined using the long and accurate polymerase chain reaction method. The 16,611-nucleotide mitogenome contains 13 protein-coding genes, two rRNA genes (12S, 16S), 22 tRNA genes, and a noncoding control region. We use these data and homologous sequence data from multiple other ostariophysan fishes in a phylogenetic evaluation to test hypothesis pertaining to codon usage pattern of O. bidens mitochondrial protein genes as well as to re-examine the ostariophysan phylogeny. The mitochondrial genome of O. bidens reveals an alternative pattern of vertebrate mitochondrial evolution. For the mitochondrial protein genes of O. bidens, the most frequently used codon generally ends with either A or C, with C preferred over A for most fourfold degenerate codon families; the relative synonymous codon usage of G-ending codons is greatly elevated in all categories. The codon usage pattern of O. bidens mitochondrial protein genes is remarkably different from the general pattern found previously in the relatively closely related zebrafish and most other vertebrate mitochondria. Nucleotide bias at third codon positions is the main cause of codon bias in the mitochondrial protein genes of O. bidens, as it is biased particularly in favor of C over A. Bayesian analysis of 12 concatenated mitochondrial protein sequences for O. bidens and 46 other teleostean taxa supports the monophyly of Cypriniformes and Otophysi and results in a robust estimate of the otophysan phylogeny.  相似文献   

5.
Understanding the extent and causes of biases in codon usage and nucleotide composition is essential to the study of viral evolution, particularly the interplay between viruses and host cells or immune responses. To understand the common features and differences among viruses we analyzed the genomic characteristics of a representative collection of all sequenced vertebrate-infecting DNA viruses. This revealed that patterns of codon usage bias are strongly correlated with overall genomic GC content, suggesting that genome-wide mutational pressure, rather than natural selection for specific coding triplets, is the main determinant of codon usage. Further, we observed a striking difference in CpG content between DNA viruses with large and small genomes. While the majority of large genome viruses show the expected frequency of CpG, most small genome viruses had CpG contents far below expected values. The exceptions to this generalization, the large gammaherpesviruses and iridoviruses and the small dependoviruses, have sufficiently different life-cycle characteristics that they may help reveal some of the factors shaping the evolution of CpG usage in viruses. Electronic Supplementary Material Electronic Supplementary material is available for this article at and accessible for authorised users. [Reviewing Editor: Dr. Nicolas Galtier]  相似文献   

6.
Eugenia uniflora is a plant native to tropical America that holds great ecological and economic importance. The complete chloroplast (cp) genome sequence of Eugenia uniflora, a member of the Neotropical Myrtaceae family, is reported here. The genome is 158,445 bp in length and exhibits a typical quadripartite structure of the large (LSC, 87,459 bp) and small (SSC, 18,318 bp) single-copy regions, separated by a pair of inverted repeats (IRs, 26,334 bp). It contains 111 unique genes, including 77 protein-coding genes, 30 tRNAs and 4 rRNAs. The genome structure, gene order, GC content and codon usage are similar to the typical angiosperm cp genomes. Comparison of the entire cp genomes of E. uniflora L. and three other Myrtaceae revealed an expansion of 43 bp in the intergenic spacer located between the IRA/large single-copy (LSC) border and the first gene of LSC region. Simple sequence repeat (SSR) analysis revealed that most SSRs are AT rich, which contribute to the overall AT richness of the cp genome. Additionally, fewer SSRs are distributed in the protein-coding sequences compared to the noncoding regions. Phylogenetic analysis among 58 species based on 57 cp genes demonstrated a closer relationship between E. uniflora L. and Syzygium cumini (L). Skeels compared to the Eucalyptus clade in the Myrtaceae family. The complete cp genome sequence of E. uniflora reported here has importance for population genetics, as well as phylogenetic and evolutionary studies in this species and other Myrtaceae species from Neotropical regions.  相似文献   

7.
《遗传学报》2020,47(1):49-60
Noncoding RNAs(ncRNAs) play important roles in many biological processes and provide materials for evolutionary adaptations beyond protein-coding genes, such as in the arms race between the host and pathogen. However, currently, a comprehensive high-resolution analysis of primate genomes that includes the latest annotated ncRNAs is not available. Here, we developed a computational pipeline to estimate the selections that act on noncoding regions based on comparisons with a large number of reference sequences in introns adjacent to the interested regions. Our method yields result comparable with those of the established codon-based method and phyloP method for coding genes; thus, it provides a holistic framework for estimating the selection on the entire genome. We further showed that fastevolving protein-coding genes and their corresponding 50 UTRs have a significantly lower frequency of the CpG dinucleotides than those evolving at an average pace, and these fast-evolving genes are enriched in the process of immunity and host defense. We also identified fast-evolving miRNAs with antiviral functions in cells. Our results provide a resource for high-resolution evolution analysis of the primate genomes.  相似文献   

8.
Phytophthora is a genus entirely comprised of destructive plant pathogens. It belongs to the Stramenopila, a unique branch of eukaryotes, phylogenetically distinct from plants, animals, or fungi. Phytophthora genes show a strong preference for usage of codons ending with G or C (high GC3). The presence of high GC3 in genes can be utilized to differentiate coding regions from noncoding regions in the genome. We found that both selective pressure and mutation bias drive codon bias in Phytophthora. Indicative for selection pressure is the higher GC3 value of highly expressed genes in different Phytophthora species. Lineage specific GC increase of noncoding regions is reminiscent of whole-genome mutation bias, whereas the elevated Phytophthora GC3 is primarily a result of translation efficiency-driven selection. Heterogeneous retrotransposons exist in Phytophthora genomes and many of them vary in their GC content. Interestingly, the most widespread groups of retroelements in Phytophthora show high GC3 and a codon bias that is similar to host genes. Apparently, selection pressure has been exerted on the retroelement’s codon usage, and such mimicry of host codon bias might be beneficial for the propagation of retrotransposons. Reviewing Editor: Dr. Yves van de Peer  相似文献   

9.
Overlapping genes are defined, in this paper, as a pair of adjacent genes whose coding regions are partly overlapping. We systematically analyzed all overlapping genes in the genomes of two closely related species: Mycoplasma genitalium and Mycoplasma pneumoniae. Careful comparisons were made for homologous genes that are overlapped in one species but not in the other. This comparative analysis allows us to propose a model of how overlapping genes emerged in the course of evolution. It was found that overlapping genes were generated primarily due to the loss of a stop codon in either gene, in many cases, the absence of which resulted in elongation of the 3' end of the gene's coding region. More specifically, the loss of the stop codon took place as a result of the following events: deletion of the stop codon (64.4%), point mutation at the stop codon (4.4%), and frame shift at the end of the coding region (6.7%). Overlapping genes, in a sense, can be thought of as the results of evolutionary pressure to minimize genome size. However, our analysis indicates that many overlapping genes, at least in the genomes of M.genitalium and M.pneumoniae, are due to incidental elongation of the coding regions.  相似文献   

10.
11.
The nucleotide divergence in the protein-coding region for replication-dependent and replication-independent histone 3 and 4 genes of Drosophila melanogaster and Drosophila hydei occurred mostly at the synonymous site. Therefore, the pattern of codon usage was analyzed in the two species, considering the genomic codon bias, which is proposed for estimating the genomic composition pressure in the protein-coding regions. The results indicated that the codon usage in the histone gene family could be explained mostly by the genomic codon bias. However, biases for Ala and Arg were commonly observed for the histone 3 and histone 4 gene families, and biases for Ser, Leu, and Glu were observed in a gene-specific manner. This suggests that both genomic codon bias and gene- or codon-specific bias are responsible for the nucleotide differentiation in the protein-coding region of the histone genes.  相似文献   

12.
13.
Simple sequence repeats (SSRs) are omnipresent in prokaryotes and eukaryotes, and are found anywhere in the genome in both protein encoding and noncoding regions. In present study the whole genome sequences of seven chromosomes (Shigella flexneri 2a str301 and 2457T, Shigella sonnei, Escherichia coli k12, Mycobacterium tuberculosis, Mycobacterium leprae and Staphylococcus saprophyticus) have downloaded from the GenBank database for identifying abundance, distribution and composition of SSRs and also to determine difference between the tandem repeats in real genome and randomness genome (using sequence shuffling tool) of the organisms included in this study. The data obtained in the present study show that: (i) tandem repeats are widely distributed throughout the genomes; (ii) SSRs are differentially distributed among coding and noncoding regions in investigated Shigella genomes; (iii) total frequency of SSRs in noncoding regions are higher than coding regions; (iv) in all investigated chromosomes ratio of Trinucleotide SSRs in real genomes are much higher than randomness genomes and Di nucleotide SSRs are lower; (v) Ratio of total and mononucleotide SSRs in real genome is higher than randomness genomes in E. coli K12, S. flexneri str 301 and S. saprophyticus, while it is lower in S. flexneri str 2457T, S.sonnei and M. tuberculosis and it is approximately same in M. leprae; (vi) frequency of codon repetitions are vary considerably depending on the type of encoded amino acids.  相似文献   

14.
张乃心  张玉娟  余果  陈斌 《昆虫学报》2013,56(4):398-407
研究双翅目昆虫线粒体基因组的结构特点, 并设计其测序的通用引物, 为今后双翅目昆虫线粒体基因组的研究提供参考和依据。利用比较基因组学和生物信息学方法, 分析了已经完全测序的26个双翅目昆虫线粒体基因组的结构特点、 碱基组成和保守区, 并据此设计了双翅目昆虫基因组测序的通用引物。结果表明: 双翅目昆虫线粒体基因组长14 503~19 517 bp, 其结构保守, 含有37个编码基因, 包括13个蛋白质编码基因, 22个tRNA编码基因和2个rRNA编码基因, 此外还包含一段长度差异很大的非编码区(AT富含区)。基因组内基因排列次序稳定, 除个别基因外, 其余都与黑腹果蝇Drosophila melanogaster基因排列次序一致。基因组的碱基组成不均衡, AT含量在72.59%~85.15%之间, 碱基使用存在偏向性, 偏好使用AC碱基。全基因组的核苷酸和氨基酸序列保守, 共鉴定了11个保守区。在保守区内共设计了26对双翅目线粒体基因组测序通用引物, 扩增的目标片段都在1 200 bp以内。将该套通用引物用于葱蝇Delia antiqua线粒体全基因组测序, 结果证明其高效、 合用。  相似文献   

15.
Oligonucleotide usage in archaeal and bacterial genomes can be linked to a number of properties, including codon usage (trinucleotides), DNA base-stacking energy (dinucleotides), and DNA structural conformation (di- to tetranucleotides). We wanted to assess the statistical information potential of different DNA ‘word-sizes’ and explore how oligonucleotide frequencies differ in coding and non-coding regions. In addition, we used oligonucleotide frequencies to investigate DNA composition and how DNA sequence patterns change within and between prokaryotic organisms. Among the results found was that prokaryotic chromosomes can be described by hexanucleotide frequencies, suggesting that prokaryotic DNA is predominantly short range correlated, i.e., information in prokaryotic genomes is encoded in short oligonucleotides. Oligonucleotide usage varied more within AT-rich and host-associated genomes than in GC-rich and free-living genomes, and this variation was mainly located in non-coding regions. Bias (selectional pressure) in tetranucleotide usage correlated with GC content, and coding regions were more biased than non-coding regions. Non-coding regions were also found to be approximately 5.5% more AT-rich than coding regions, on average, in the 402 chromosomes examined. Pronounced DNA compositional differences were found both within and between AT-rich and GC-rich genomes. GC-rich genomes were more similar and biased in terms of tetranucleotide usage in non-coding regions than AT-rich genomes. The differences found between AT-rich and GC-rich genomes may possibly be attributed to lifestyle, since tetranucleotide usage within host-associated bacteria was, on average, more dissimilar and less biased than free-living archaea and bacteria.  相似文献   

16.
To study the possible codon usage and base composition variation in the bacteriophages, fourteen mycobacteriophages were used as a model system here and both the parameters in all these phages and their plating bacteria, M. smegmatis had been determined and compared. As all the organisms are GC-rich, the GC contents at third codon positions were found in fact higher than the second codon positions as well as the first + second codon positions in all the organisms indicating that directional mutational pressure is strongly operative at the synonymous third codon positions. Nc plot indicates that codon usage variation in all these organisms are governed by the forces other than compositional constraints. Correspondence analysis suggests that: (i) there are codon usage variation among the genes and genomes of the fourteen mycobacteriophages and M. smegmatis, i.e., codon usage patterns in the mycobacteriophages is phage-specific but not the M. smegmatis-specific; (ii) synonymous codon usage patterns of Barnyard, Che8, Che9d, and Omega are more similar than the rest mycobacteriophages and M. smegmatis; (iii) codon usage bias in the mycobacteriophages are mainly determined by mutational pressure; and (iv) the genes of comparatively GC rich genomes are more biased than the GC poor genomes. Translational selection in determining the codon usage variation in highly expressed genes can be invoked from the predominant occurrences of C ending codons in the highly expressed genes. Cluster analysis based on codon usage data also shows that there are two distinct branches for the fourteen mycobacteriophages and there is codon usage variation even among the phages of each branch.  相似文献   

17.
We sequenced most of the mitochondrial (mt) genomes of 2 apocritan taxa: Vanhornia eucnemidarum and Primeuchroeus spp. These mt genomes have similar nucleotide composition and codon usage to those of mt genomes reported for other Hymenoptera, with a total A + T content of 80.1% and 78.2%, respectively. Gene content corresponds to that of other metazoan mt genomes, but gene organization is not conserved. There are a total of 6 tRNA genes rearranged in V. eucnemidarum and 9 in Primeuchroeus spp. Additionally, several noncoding regions were found in the mt genome of V. eucnemidarum, as well as evidence of a sustained gene duplication involving 3 tRNA genes. We also report an inversion of the large and small ribosomal RNA genes in Primeuchroeus spp. mt genome. However, none of the rearrangements reported are phylogenetically informative with respect to the current taxon sample.  相似文献   

18.
Qin H  Wu WB  Comeron JM  Kreitman M  Li WH 《Genetics》2004,168(4):2245-2260
To study the roles of translational accuracy, translational efficiency, and the Hill-Robertson effect in codon usage bias, we studied the intragenic spatial distribution of synonymous codon usage bias in four prokaryotic (Escherichia coli, Bacillus subtilis, Sulfolobus tokodaii, and Thermotoga maritima) and two eukaryotic (Saccharomyces cerevisiae and Drosophila melanogaster) genomes. We generated supersequences at each codon position across genes in a genome and computed the overall bias at each codon position. By quantitatively evaluating the trend of spatial patterns using isotonic regression, we show that in yeast and prokaryotic genomes, codon usage bias increases along translational direction, which is consistent with purifying selection against nonsense errors. Fruit fly genes show a nearly symmetric M-shaped spatial pattern of codon usage bias, with less bias in the middle and both ends. The low codon usage bias in the middle region is best explained by interference (the Hill-Robertson effect) between selections at different codon positions. In both yeast and fruit fly, spatial patterns of codon usage bias are characteristically different from patterns of GC-content variations. Effect of expression level on the strength of codon usage bias is more conspicuous than its effect on the shape of the spatial distribution.  相似文献   

19.
A well-known mechanism through which new protein-coding genes originate is by modification of pre-existing genes, e.g. by duplication or horizontal transfer. In contrast, many viruses generate protein-coding genes de novo, via the overprinting of a new reading frame onto an existing (“ancestral”) frame. This mechanism is thought to play an important role in viral pathogenicity, but has been poorly explored, perhaps because identifying the de novo frames is very challenging. Therefore, a new approach to detect them was needed. We assembled a reference set of overlapping genes for which we could reliably determine the ancestral frames, and found that their codon usage was significantly closer to that of the rest of the viral genome than the codon usage of de novo frames. Based on this observation, we designed a method that allowed the identification of de novo frames based on their codon usage with a very good specificity, but intermediate sensitivity. Using our method, we predicted that the Rex gene of deltaretroviruses has originated de novo by overprinting the Tax gene. Intriguingly, several genes in the same genomic region have also originated de novo and encode proteins that regulate the functions of Tax. Such “gene nurseries” may be common in viral genomes. Finally, our results confirm that the genomic GC content is not the only determinant of codon usage in viruses and suggest that a constraint linked to translation must influence codon usage.  相似文献   

20.
In this paper, the complete mitochondrial genome of Acraea issoria (Lepidoptera: Nymphalidae: Heliconiinae: Acraeini) is reported; a circular molecule of 15,245 bp in size. For A. issoria, genes are arranged in the same order and orientation as the complete sequenced mitochondrial genomes of the other lepidopteran species, except for the presence of an extra copy of tRNAIle(AUR)b in the control region. All protein-coding genes of A. issoria mitogenome start with a typical ATN codon and terminate in the common stop codon TAA, except that COI gene uses TTG as its initial codon and terminates in a single T residue. All tRNA genes possess the typical clover leaf secondary structure except for tRNASer(AGN), which has a simple loop with the absence of the DHU stem. The sequence, organization and other features including nucleotide composition and codon usage of this mitochondrial genome were also reported and compared with those of other sequenced lepidopterans mitochondrial genomes. There are some short microsatellite-like repeat regions (e.g., (TA)9, polyA and polyT) scattered in the control region, however, the conspicuous macro-repeats units commonly found in other insect species are absent.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号