首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
The relative contribution of mutation and selection to the G+C content of DNA was analyzed in bacterial species having widely different G+C contents. The analysis used two methods that were developed previously. The first method was to plot the average G+C content of a set of nucleotides against the G+C content of the third codon position for each gene. This method was used to present the G+C distribution of the third codon position and to assess the relative neutrality of a set of nucleotides to that of the G+C content of the third codon position. The second method was to plot the intrastrand bias of the third codon position from Parity Rule 2 (PR2), where A=T and G=C. It was found that whereas intragenomic distributions of the DNA G+C content of these bacteria are narrow in the majority of species, in some species the G+C content of the minor class of genes distributes over wider ranges than the major class of genes. On the other hand, ubiquitous PR2 biases are amino acid specific and independent of the G+C content of DNA, so that when averaged over the amino acids, the biases are small and not correlated with the DNA G+C content. Therefore, translation coupled PR2-biases are unlikely to explain the wide range of G+C contents among different species. Considering all data available, it was concluded that the amino acid-specific PR2 bias has only a minor effect, if any, on the average G+C content. In addition, PR2 bias patterns of different species show phylogenetic relationships, and the pattern can be as a taxal fingerprint. Received: 5 November 1998 / Accepted: 1 March 1999  相似文献   

2.
The principal intracellular symbiotic bacteria of the cereal weevil Sitophilus oryzae were characterized using the sequence of the 16S rDNA gene (rrs gene) and G + C content analysis. Polymerase chain reaction amplification with universal eubacterial primers of the rrs gene showed a single expected sequence of 1,501 bp. Comparison of this sequence with the available database sequences placed the intracellular bacteria of S. oryzae as members of the Enterobacteriaceae family, closely related to the free-living bacteria, Erwinia herbicola and Escherichia coli, and the endocytobiotic bacteria of the tsetse fly and aphids. Moreover, by high-performance liquid chromatography, we measured the genomic G + C content of the S. oryzae principal endocytobiotes (SOPE) as 54%, while the known genomic G + C content of most intracellular bacteria is about 39.5%. Furthermore, based on the third codon position G + C content and the rrs gene G + C content, we demonstrated that most intracellular bacteria except SOPE are A + T biased irrespective of their phylogenetic position. Finally, using the hsp60 gene sequence, the codon usage of SOPE was compared with that of two phylogenetically closely related bacteria: E. coli, a free-living bacterium, and Buchnera aphidicola, the intracellular symbiotic bacteria of aphids. Taken together, these results show a peculiar and distinctly different DNA composition of SOPE with respect to the other obligate intracellular bacteria, and, combined with biological and biochemical data, they elucidate the evolution of symbiosis in S. oryzae. Received: 8 September 1997 / Accepted: 24 October 1997  相似文献   

3.
Genes of a multicellular organism are heterogeneous in the G+C content, which is particularly true in the third codon position. The extent of deviation from intra-strand equality rule of A = T and G = C (Parity Rule 2, or PR2) is specific for individual amino acids and has been expressed as the PR2-bias fingerprint. Previous results suggested that the PR2-bias fingerprints tend to be similar among the genes of an organism, and the fingerprint of the organism is specific for different taxa, reflecting phylogenetic relationships of organisms. In this study, using coding sequences of a large number of human genes, we examined the intragenomic heterogeneity of their PR2-bias fingerprints in relation to the G+C content of the third codon position (P 3 ). Result shows that the PR2-bias fingerprint is similar in the wide range of the G+C content at the third codon position (0.30–0.80). This range covers approximately 89% of the genes, and further analysis of the high G+C range (0.80–1.00), where genes with normal PR2-bias fingerprints and those with anomalous fingerprints are mixed, shows that the total of 95% of genes have the similar finger prints. The result indicates that the PR2-bias fingerprint is a unique property of an organism and represents the overall characteristics of the genome. Combined with the previous results that the evolutionary change of the PR2-bias fingerprint is a slow process, PR2-bias fingerprints may be used for the phylogenetic analyses to supplement and augment the conventional methods that use the differences of the sequences of orthologous proteins and nucleic acids. Potential advantages and disadvantages of the PR2-bias fingerprint analysis are discussed. Received: 21 December 2000 / Accepted: 16 February 2001  相似文献   

4.
Analysis of DNA sequences of 132 introns and 140 exons from 42 pairs of orthologous genes of mouse and rat was used to compare patterns of evolutionary change between introns and exons. The mean of the absolute difference in length (measured in base pairs) between the two species was nearly five times as high in the case of introns as in the case of exons. The average rate of nucleotide substitution in introns was very similar to the rate of synonymous substitution in exons, and both were about three times the rate of substitution at nonsynonymous sites in exons. G+C content of introns and exons of the same gene were correlated; but mean G+C content at the third positions of exons was significantly higher than that of introns or positions 1–2 of exons from the same gene. G+C content was conserved over evolutionary time, as indicated by strong correlations between mouse and rat; but the change in G+C content was greatest at position 3 of exons, intermediate in introns, and lowest at positions 1–2 in introns. Received: 23 December 1996 / Accepted: 1 April 1997  相似文献   

5.
Synonymous codon choices vary considerably among Schistosoma mansoni genes. Principal components analysis detects a single major trend among genes, which highly correlates with GC content in third codon positions and exons, but does not discriminate among putatively highly and lowly expressed genes. The effective number of codons used in each gene, and its distribution when plotted against GC3, suggests that codon usage is shaped mainly by mutational biases. The GC content of exons, GC3, 5′, 3′, and flanking (5′+ 3′+ introns) regions are all correlated among them, suggesting that variations in GC content may exist among different regions of the S. mansoni genome. We propose that this genome structure might be among the most important factors shaping codon usage in this species, although the action of selection on certain sequences cannot be excluded. Received: 10 March 1997 / Accepted: 27 June 1997  相似文献   

6.
Identifying the G + C difference between closely related bacterial species or between different strains of the same species is one of the first steps in understanding the evolutionary mechanisms accounting for the differences observed among bacterial species. The G + C content can be one of the most important factors in the evolution of genomic structures. In this paper, we describe a new method for detecting an initial stage of differentiation of the G + C content at the third codon base position between two strains of the same bacterial species. We apply this method to the two strains of Helicobacter pylori. A group of genes is detected with large variations of G + C in the third positions—apparently genes of early response to pressures of changing G + C. We discuss our findings from the viewpoint of genomic evolution. Received: 26 February 2001 / Accepted: 16 May 2001  相似文献   

7.
Studies of the distribution of the three group I introns (intron A, intron T, and intron AT) in the 26S rDNA of Gaeumannomyces graminis had suggested that they were transferred to a common ancestor of G. graminis var. avenae and var. tritici after it had branched off from var. graminis. Intron AT and intron A exhibited vertical inheritance and coevolved in concert with their hosts. Intron loss could occur after its acquisition. Loss of any one of the three introns could occur in var. tritici whereas only loss of intron T had been found in the majority of var. avenae isolates. The existence of isolates of var. tritici and var. avenae with three introns suggested that intron loss could be reversed by intron acquisition and that the whole process is a dynamic one. This process of intron acquisition and intron loss reached different equilibrium points for different varieties and subgroups, which explained the irregular distribution of these introns in G. graminis. Each of the three group I introns was more closely related to other intron sequences that share the same insertion point in the 26S rDNA than to each other. These introns in distantly related organisms appeared to have a common ancestry. This system had provided a good model for studies on both the lateral transfer and common ancestry of group I introns in the 26S rRNA genes. Received: 17 May 1996 / Accepted: 14 January 1997  相似文献   

8.
Genes with atypical G+C content and pattern of codon usage in a certain genome are possibly of exotic origin, and this idea has been applied to identify horizontal events. In this way, it was postulated that a total of 755 genes in the E. coli genome are relics of horizontal events after the divergence of E. coli from the Salmonella lineage 100 million years ago (Lawrence and Ochman, 1998). In this paper we propose a new way to study sequence composition more thoroughly. We found that although the 755 genes differ in composition from other genes in the E. coli genome, the difference is minor. If we accepted that these genes are horizontally transferred, then (1) it would be more likely that they were transferred from genomes evolutionarily closely related to E. coli; but (2) the dating method used by Lawrence and Ochman (1997, 1998) largely underestimated the average age of introduced sequences in the E. coli genome, in particular, most of the 755 genes should be introduced into E. coli before, instead of after, the divergence of E. coli from the Salmonella lineage. Our study reveals that atypical G+C content and pattern of codon usage are not reliable indicators of horizontal gene transfer events. Received: 27 September 2000 / Accepted: 9 April 2001  相似文献   

9.
We compared the codon usage of sequences of transposable elements (TEs) with that of host genes from the species Drosophila melanogaster, Arabidopsis thaliana, Caenorhabditis elegans, Saccharomyces cerevisiae, and Homo sapiens. Factorial correspondence analysis showed that, regardless of the base composition of the genome, the TEs differed from the genes of their host species by their AT-richness. In all species, the percentage of A + T on the third codon position of the TEs was higher than that on the first codon position and lower than that in the noncoding DNA of the genomes. This indicates that the codon choice is not simply the outcome of mutational bias but is also subject to selection constraints. A tendency toward higher A + T on the third position than on the first position was also found in the host genes of A. thaliana, C. elegans, and S. cerevisiae but not in those of D. melanogaster and H. sapiens. This strongly suggests that the AT choice is a host-independent characteristic common to all TEs. The codon usage of TEs generally appeared to be different from the mean of the host genes. In the AT-rich genomes of Arabidopsis thaliana, Caenorhabditis elegans, and Saccharomyces cerevisiae, the codon usage bias of TEs was similar to that of weakly expressed genes. In the GC-rich genome of D. melanogaster, however, the bias in codon usage of the TEs clearly differed from that of weakly expressed genes. These findings suggest that selection acts on TEs and that TEs may display specific behavior within the host genomes. Received: 2 May 2001 / Accepted: 29 October 2001  相似文献   

10.
11.
While the two amylase genes of Drosophila melanogaster are intronless, the three genes of D. pseudoobscura harbor a short intron. This raises the question of the common structure of the Amy gene in Drosophila species. We have investigated the presence or absence of an intron in the amylase genes of 150 species of Drosophilids. Using polymerase chain reaction (PCR), we have amplified a region that surrounds the intron site reported in D. pseudoobscura and a few other species. The results revealed that most species contain an intron, with a variable size ranging from 50 to 750 bp, although the very majoritary size was around 60–80 bp. Several species belonging to different lineages were found to lack an intron. This loss of intervening sequence was likely due to evolutionarily independent and rather frequent events. Some other species had both types of genes: In the obscura group, and to a lesser extent in the ananassae subgroup, intronless copies had much diverged from intron-containing genes. Base composition of short introns was found to be variable and correlated with that of the surrounding exons, whereas long introns were all A-T rich. We have extended our study to non-Drosophilid insects. In species from other orders of Holometaboles, Lepidoptera and Hymenoptera, an intron was found at an identical position in the Amy gene, suggesting that the intron was ancestral. Received: 23 October 1995 / Accepted: 5 March 1996  相似文献   

12.
The 22,704-bp circular mitochondrial DNA (mtDNA) of the chlamydomonad alga Chlorogonium elongatum was completely cloned and sequenced. The genome encodes seven proteins of the respiratory electron transport chain, subunit 1 of the cytochrome oxidase complex (cox1), apocytochrome b (cob), five subunits of the NADH dehydrogenase complex (nad1, nad2, nad4, nad5, and nad6), a set of three tRNAs (Q, W, M), and the large (LSU)- and small (SSU)-subunit ribosomal RNAs. Six group-I introns were found, two each in the cox1, cob, and nad5 genes. In each intron an open reading frame (ORF) related to maturases or endonucleases was identified. Both the LSU and the SSU rRNA genes are split into fragments intermingled with each other and with other genes. Although the average A + T content is 62.2%, GC-rich clusters were detected in intergenic regions, in variable domains of the rRNA genes, and in introns and intron-encoded ORFs. A comparison of the genome maps reveals that C. elongatum and Chlamydomonas eugametos mtDNAs are more closely related to one another than either is to Chlamydomonas reinhardtii mtDNA. Received: 3 November 1997 / Accepted: 12 January 1998  相似文献   

13.
Variation in GC content, GC skew and AT skew along genomic regions was examined at third codon positions in completely sequenced prokaryotes. Eight out of nine eubacteria studied show GC and AT skews that change sign at the origin of replication. The leading strand in DNA replication is G-T rich at codon position 3 in six eubacteria, but C-T rich in two Mycoplasma species. In M. genitalium the AT and GC skews are symmetrical around the origin and terminus of replication, whereas its GC content variation has been shown to have a centre of symmetry elsewhere in the genome. Borrelia burgdorferi and Treponema pallidum show extraordinary extents of base composition skew correlated with direction of DNA replication. Base composition skews measured at third codon positions probably reflect mutational biases, whereas those measured over all bases in a sequence (or at codon positions 1 and 2) can be strongly affected by protein considerations due to the tendency in some bacteria for genes to be transcribed in the same direction that they are replicated. Consequently in some species the direction of skew for total genomic DNA is opposite to that for codon position 3. Received: 2 February 1998 / Accepted: 15 June 1998  相似文献   

14.
A+T content, phylogenetic relationships, codon usage, evolutionary rates, and ratio of synonymous versus non-synonymous substitutions have been studied in partial sequences of the atpD and aroQ/pheA genes of primary (Buchnera) and secondary symbionts of aphids and a set of selected non-symbiotic bacteria, belonging to the five subdivisions of the Proteobacteria. Compared to the homologous genes of the last group, both genes belonging to Buchnera behave in a similar way, showing a higher A+T content, forming a monophyletic group, a loss in codon bias, especially in third base position, an evolutionary acceleration and an increase in the number of non-synonymous substitutions, confirming previous results reported elsewhere for other genes. When available, these properties have been partly observed with the secondary symbionts, but with values that are intermediate between Buchnera and free living Proteobacteria. They show high A+T content, but not as high as Buchnera, a non-solved phylogenetic position between Buchnera, and the other γ-Proteobacteria, a loss in codon bias, again not as high as in Buchnera and a significant evolutionary acceleration in the case of the three atpD genes, but not when considering aroQ/pheA genes. These results give support to the hypothesis that they are symbionts at different stages of the symbiotic accommodation to the host.  相似文献   

15.
16.
The extent to which base composition and codon usage vary among RNA viruses, and the possible causes of this bias, is undetermined in most cases. A maximum-likelihood statistical method was used to test whether base composition and codon usage bias covary with arthropod association in the genus Flavivirus, a major source of disease in humans and animals. Flaviviruses are transmitted by mosquitoes, by ticks, or directly between vertebrate hosts. Those viruses associated with ticks were found to have a significantly lower G+C content than non-vector-borne flaviviruses and this difference was present throughout the genome at all amino acids and codon positions. In contrast, mosquito-borne viruses had an intermediate G+C content which was not significantly different from those of the other two groups. In addition, biases in dinucleotide and codon usage that were independent of base composition were detected in all flaviviruses, but these did not covary with arthropod association. However, the overall effect of these biases was slight, suggesting only weak selection at synonymous sites. A preliminary analysis of base composition, codon usage, and vector specificity in other RNA virus families also revealed a possible association between base composition and vector specificity, although with biases different from those seen in the Flavivirus genus. Received: 29 August 2000 / Accepted: 19 December 2000  相似文献   

17.
The phylogenetic relationships of genus Passer (Old World sparrows) have been studied with species covering their complete world living range. Mitochondrial (mt) cyt b genes and pseudogenes have been analyzed, the latter being strikingly abundant in genus Passer compared with other studied songbirds. The significance of these Passer pseudogenes is presently unclear. The mechanisms by which mt cyt b genes become pseudogenes after nuclear translocation are discussed together with their mode of evolution, i.e., transition/transversion mitochondrial ratio is decreased in the nucleus, as is the constraint for variability at the three codon positions. However, the skewed base composition according to codon position (in 1st position the percentage is very similar for the four bases, in 2nd position there are fewer percentage of A and G and more percentage of T, and in 3rd codon position fewer percentage of G and T and is very rich in A and C) is maintained in the translocated nuclear pseudogenes. Different nuclear internal mechanisms and/or selective pressures must exist for explaining this nuclear/mitochondrial differential DNA base evolutive variability. Also, the phylogenetic usefulness of pseudogenes for defining relationships between closely related lineages is stressed. The analyses suggest that the primitive genus Passer species comes from Africa, the Cape sparrow being the oldest: P. hispaniolensis italiae is more likely conspecific to P. domesticus than to P. hispaniolensis. Also, Passer species are not included within weavers or Estrildinae or Emberizinae, as previously suggested. European and American Emberizinae sparrows are closely related to each other and seem to be the earliest species that radiated among the studied songbirds (all in the Miocene Epoch). Received: 29 November 2000 / Accepted: 22 March 2001  相似文献   

18.
While globin genes ctt-2β and ctt-9.1 in Chironomus thummi thummi each have a single intron, all of the other insect globin genes reported so far are intronless. We analyzed four globin genes linked to the two intron-bearing genes in C. th. thummi. Three have a single intron at the same position as ctt-2β and ctt-9.1; the fourth is intronless and lies between intron bearing genes. Finally, in addition to its intron, one gene (ctt-13RT) was recently interrupted by retrotransposition. Phylogenetic analyses show that the six genes in C. th. thummi share common ancestry with five globin genes in the distantly related species C. tentans, and that a 5-gene ancestral cluster predates the divergence of the two species. One gene in the ancestral cluster gave rise to ctn-ORFB in C. tentans, and duplicated in C. th. thummi to create ctt-11 and ctt-12. From parsimonious calculations of evolutionary distances since speciation, ctt-11, ctt-12, and ctn-ORFB evolved rapidly, while ctn-ORFE in C. tentans evolved slowly compared to other globin genes in the clusters. While these four globins are under selective pressure, we suggest that most chironomid globin genes were not selected for their unique function. Instead, we propose that high gene copy number itself was selected because conditions favored organisms that could synthesize more hemoglobin. High gene copy number selection to produce more of a useful product may be the basis of forming multigene families, all of whose members initially accumulate neutral substitutions while retaining essential function. Maintenance of a large family of globin genes not only ensured high levels of hemoglobin production, but may have facilitated the extensive divergence of chironomids into as many as 5000 species. Received: 31 December 1996 / Accepted: 16 May 1997  相似文献   

19.
The evolutionary relationship of muscle and nonmuscle actin isoforms in deuterostomia was studied by the isolation and characterization of two actin genes from the cephalochordate Branchiostoma lanceolatum and two from the hemichordate Saccoglossus kowalevskii The Branchiostoma genes specify a muscle and a nonmuscle actin type, respectively. Together with earlier results on muscle actins from vertebrates and urochordates, a N-terminal sequence signature is defined for chordate muscle actins. These diagnostic amino acid residues separate the chordates from the echinoderms and other metazoa. Although the two Saccoglossus actins characterized so far lack the diagnostic residues, in line with the presumptive phylogenetic position of hemichordates outside the chordates, a definitive conclusion can only be expected once the full complement of actin genes of Saccoglossus is established. Comparison of the intron patterns of the various deuterostomic actin genes shows that intron 330-3, which is present in all vertebrate genes, is conspicuously absent from nonvertebrate genes. The possible origin of this intron is discussed. Received: 4 July 1997 / Accepted: 29 August 1997  相似文献   

20.
Complete sequences of seven protein coding genes from Penaeus notialis mitochondrial DNA were compared in base composition and codon usage with homologous genes from Artemia franciscana and four insects. The crustacean genes are significantly less A + T-rich than their counterpart in insects and the pattern of codon usage (ratio of G + C-rich versus A + T-rich codon) is less biased. A phylogenetic analysis using amino acid sequences of the seven corresponding polypeptides supports a sister-taxon status for mollusks–annelid and arthropods. Furthermore, a distance matrix-based tree and two most-parsimonious trees both suggest that crustaceans are paraphyletic with respect to insects. This is also supported by the inclusion of Panulirus argus COII (complete) and COI and COIII (partial) sequence data. From analysis of single and combined genes to infer phylogenies, it is observed that obtained from single genes are not well supported in most topologies cases and notably differ from that of the tree based on all seven genes. Received: 25 August 1998 / Accepted: 8 March 1999  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号