首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Codon usage in higher plants, green algae, and cyanobacteria   总被引:3,自引:1,他引:2  
Codon usage is the selective and nonrandom use of synonymous codons by an organism to encode the amino acids in the genes for its proteins. During the last few years, a large number of plant genes have been cloned and sequenced, which now permits a meaningful comparison of codon usage in higher plants, algae, and cyanobacteria. For the nuclear and organellar genes of these organisms, a small set of preferred codons are used for encoding proteins. Codon usage is different for each genome type with the variation mainly occurring in choices between codons ending in cytidine (C) or guanosine (G) versus those ending in adenosine (A) or uridine (U). For organellar genomes, chloroplastic and mitochrondrial proteins are encoded mainly with codons ending in A or U. In most cyanobacteria and the nuclei of green algae, proteins are encoded preferentially with codons ending in C or G. Although only a few nuclear genes of higher plants have been sequenced, a clear distinction between Magnoliopsida (dicot) and Liliopsida (monocot) codon usage is evident. Dicot genes use a set of 44 preferred codons with a slight preference for codons ending in A or U. Monocot codon usage is more restricted with an average of 38 codons preferred, which are predominantly those ending in C or G. But two classes of genes can be recognized in monocots. One set of monocot genes uses codons similar to those in dicots, while the other genes are highly biased toward codons ending in C or G with a pattern similar to nuclear genes of green algae. Codon usage is discussed in relation to evolution of plants and prospects for intergenic transfer of particular genes.  相似文献   

2.
Among a sample of 39 Geodia cydonium (Demospongiae, Porifera) genes, with an average G + C content of 51.2%, extensive structural heterogeneity and considerable variations in synonymous codon usage were found. The G + C content of coding sequences and G + C content at silent codon positions (GC3S) varied from 42.4 to 59.2% and from 35.6 to 76.5%, respectively. Correspondence analysis of 39 genes revealed that putative highly expressed genes preferentially use a limited subset of codons, which were therefore defined as preferred codons in G. cydonium . A total of 22 preferred codons for 18 amino acids with synonyms in codons were identified and they all (with one exception) end with C or G. Among these codons there are also C- and G-ending codons which were previously identified as codons optimal for translation in a variety of eukaryotes, including metazoans and plants. The bias in synonymous codon usage in putative highly expressed G. cydonium genes is moderate, indicating that these genes are not shaped under strong natural selection. We postulate that the preference for C- and G-ending codons was already established in the ancestor of all Metazoa, including also sponges. This ancestor most probably also had a G + C rich genome. The selection toward C- and G-ending codons has been largely conserved throughout eukaryote evolution; exceptions are, for example, mammals for which strong mutational biases caused switches from that rule.  相似文献   

3.
Site-Specific Codon Bias in Bacteria   总被引:1,自引:1,他引:1       下载免费PDF全文
J. M. Smith  N. H. Smith 《Genetics》1996,142(3):1037-1043
Sequences of the gapA and ompA genes from 10 genera of enterobacteria have been analyzed. There is strong bias in codon usage, but different synonymous codons are preferred at different sites in the same gene. Site-specific preference for unfavored codons is not confined to the first 100 codons and is usually manifest between two codons utilizing the same tRNA. Statistical analyses, based on conclusions reached in an accompanying paper, show that the use of an unfavored codon at a given site in different genera is not due to common descent and must therefore be caused either by sequence-specific mutation or sequence-specific selection. Reasons are given for thinking that sequence-specific mutation cannot be responsible. We are unable to explain the preference between synonymous codons ending in C or T, but synonymous choice between A and G at third sites is largely explained by avoidance of AG-G (where the hyphen indicates the boundary between codons). We also observed that the preferred codon for proline in Enterobacter cloacea has changed from CCG to CCA.  相似文献   

4.
An increasing number of cases where tri-nucleotide stop codons do not signal the termination of protein synthesis are being reported. In order to identify what constitutes an efficient stop signal, we analysed the region around natural stop codons in genes from a wide variety of eukaryotic species and gene families. Certain stop codons and nucleotides following stop codons are over-represented, and this pattern is accentuated in highly expressed genes. For example, the preferred signal for Saccharomyces cerevisiae and Drosophila melanogaster highly expressed genes is UAAG, and generally the signals UAA(A/G) and UGA(A/G) are preferred in eukaryotes. The GC% of the organism or DNA region can affect whether there is A or G in the second or fourth positions. We suggest therefore, that the stop codon and the nucleotide following it comprise a tetra-nucleotide stop signal. A model is proposed in which the polypeptide chain release factor, a protein, recognises this sequence, but will tolerate some substitution, particularly A to G in the second or third positions.  相似文献   

5.
Codon-anticodon recognition and transfer RNA utilization for the leucine tRNA isoaccepting species of Escherichia coli have been studied by protein synthesis in vitro directed by sequenced bacteriophage MS2 RNA. We have added radioactive Leu-tRNALeu isoaccepting species as tracers, rather than use a tRNA-dependent system, since in the presence of an excess of non-radioactive leucine, there is no transfer of radioactive leucine from one isoaccepting species to another. MS2-specific peptides containing leucine residues encoded by known codons were isolated and identified, and the relative abilities of the Leu-tRNALeu isoaccepting species to transfer leucine into these peptides compared. Sequenced tRNA1Leu and sequenced tRNA3Leu are of roughly equal efficiency in their ability to recognize CUC and CUA codons, while tRNA3Leu is highly preferred for the CUU codon; tRNA4Leu and tRNA5Leu both recognize UUA and UUG codons, with tRNA4Leu slightly preferred for the UUA codon. We conclude that: (1) wobble is greater than permitted by the wobble hypothesis; (2) there is still some discrimination in the third code letter, and that the CUX4 (CUC, CUA, CUU, CUG) portion of the leucine family of six codons is not read by a simple “two out of three” mechanism; (3) a Watson-Crick pair (C · G) between codon and anticodon does not appear to be preferred over an unorthodox pair (C · C) in the wobble position; (4) a standard wobble pair (U · G) between codon and anticodon is preferred over an unorthodox pair (U · C); and (5) the extensive wobble observed in the CUX4 leucine codon series is not paralleled in the UUX4 leucine (UUG, UUA) and phenylalanine (UUU, UUC) codon series, where mistranslation would be the consequence of such wobble.  相似文献   

6.
Preferential codon usage in genes   总被引:1,自引:0,他引:1  
We present a method which permits comparison of the preferential use of degenerate codons within any gene. The method makes use of the triplet frequencies in the noncoding frames to assess whether a preference is specific to the reading frame. Preference is given a statistical meaning by use of the analysis of variance coupled to Duncan's multiple range test.Preferential use of degenerate codons is gene-specific and independent of gene size. The data suggest that any correlation between codon frequency distribution and tRNA levels is unreliable. In those animal genes examined, codons ending in C or G are preferred; in animal viruses tested, codons ending in U or A are preferred. Similarly, the bacterial genes and the genes of single-stranded DNA phages that we analyzed differed from each other as well as from eukaryotic genes in the third base of the codon.  相似文献   

7.
A gene tree is an evolutionary reconstruction of the genealogical history of the genetic variation found in a sample of homologous genes or DNA regions that have experienced little or no recombination. Gene trees have the potential of straddling the interface between intra- and interspecific evolution. It is precisely at this interface that the process of speciation occurs, and gene trees can therefore be used as a powerful tool to probe this interface. One application is to infer species status. The cohesion species is defined as an evolutionary lineage or set of lineages with genetic exchangeability and/or ecological interchangeability. This species concept can be phrased in terms of null hypotheses that can be tested rigorously and objectively by using gene trees. First, an overlay of geography upon the gene tree is used to test the null hypothesis that the sample is from a single evolutionary lineage. This phase of testing can indicate that the sampled organisms are indeed from a single lineage and therefore a single cohesion species. In other cases, this null hypothesis is not rejected due to a lack of power or inadequate sampling. Alternatively, this null hypothesis can be rejected because two or more lineages are in the sample. The test can identify lineages even when hybridization and lineage sorting occur. Only when this null hypothesis is rejected is there the potential for more than one cohesion species. Although all cohesion species are evolutionary lineages, not all evolutionary lineages are cohesion species. Therefore, if the first null hypothesis is rejected, a second null hypothesis is tested that all lineages are genetically exchangeable and/or ecologically interchangeable. This second test is accomplished by direct contrasts of previously identified lineages or by overlaying reproductive and/or ecological data upon the gene tree and testing for significant transitions that are concordant with the previously identified lineages. Only when this second null hypothesis is rejected is a lineage elevated to the status of cohesion species. By using gene trees in this manner, species can be identified with objective, a priori criteria with an inference procedure that automatically yields much insight into the process of speciation. When one or more of the null hypotheses cannot be rejected, this procedure also provides specific guidance for future work that will be needed to judge species status.  相似文献   

8.
Studies on codon usage in Entamoeba histolytica   总被引:13,自引:0,他引:13  
Codon usage bias of Entamoeba histolytica, a protozoan parasite, was investigated using the available DNA sequence data. Entamoeba histolytica having AT rich genome, is expected to have A and/or T at the third position of codons. Overall codon usage data analysis indicates that A and/or T ending codons are strongly biased in the coding region of this organism. However, multivariate statistical analysis suggests that there is a single major trend in codon usage variation among the genes. The genes which are supposed to be highly expressed are clustered at one end, while the majority of the putatively lowly expressed genes are clustered at the other end. The codon usage pattern is distinctly different in these two sets of genes. C ending codons are significantly higher in the putatively highly expressed genes suggesting that C ending codons are translationally optimal in this organism. In the putatively lowly expressed genes A and/or T ending codons are predominant, which suggests that compositional constraints are playing the major role in shaping codon usage variation among the lowly expressed genes. These results suggest that both mutational bias and translational selection are operational in the codon usage variation in this organism.  相似文献   

9.
In Drosophila melanogaster, synonymous codons corresponding to the most abundant cognate tRNAs are used more frequently, especially in highly expressed genes. Increased use of such "optimal" codons is considered an adaptation for translational efficiency. Need it always be the case that selection should favor the use of a translationally optimal codon? Here, we investigate one possible confounding factor, namely, the need to specify information in exons necessary to enable correct splicing. As expected from such a model, in Drosophila many codons show different usage near intron-exon boundaries versus exon core regions. However, this finding is in principle also consistent with Hill-Robertson effects modulating usage of translationally optimal codons. However, several results support the splice model over the translational selection model: 1) the trends in codon usage are strikingly similar to those in mammals in which codon usage near boundaries correlates with abundance in exonic splice enhancers (ESEs), 2) codons preferred near boundaries tend to be enriched for A and avoid C (conversely those avoided near boundaries prefer C rather than A), as expected were ESEs involved, and 3) codons preferred near boundaries are typically not translationally optimal. We conclude that usage of translationally optimal codons usage is compromised in the vicinity of splice junctions in intron-containing genes, to the effect that we observe higher levels of usage of translationally optimal codons at the center of exons. On the gene level, however, controlling for known correlates of codon bias, the impact on codon usage patterns is quantitatively small. These results have implications for inferring aspects of the mechanism of splicing given nothing more than a well-annotated genome.  相似文献   

10.
Ryman N  Jorde PE 《Molecular ecology》2001,10(10):2361-2373
A variety of statistical procedures are commonly employed when testing for genetic differentiation. In a typical situation two or more samples of individuals have been genotyped at several gene loci by molecular or biochemical means, and in a first step a statistical test for allele frequency homogeneity is performed at each locus separately, using, e.g. the contingency chi-square test, Fisher's exact test, or some modification thereof. In a second step the results from the separate tests are combined for evaluation of the joint null hypothesis that there is no allele frequency difference at any locus, corresponding to the important case where the samples would be regarded as drawn from the same statistical and, hence, biological population. Presently, there are two conceptually different strategies in use for testing the joint null hypothesis of no difference at any locus. One approach is based on the summation of chi-square statistics over loci. Another method is employed by investigators applying the Bonferroni technique (adjusting the P-value required for rejection to account for the elevated alpha errors when performing multiple tests simultaneously) to test if the heterogeneity observed at any particular locus can be regarded significant when considered separately. Under this approach the joint null hypothesis is rejected if one or more of the component single locus tests is considered significant under the Bonferroni criterion. We used computer simulations to evaluate the statistical power and realized alpha errors of these strategies when evaluating the joint hypothesis after scoring multiple loci. We find that the 'extended' Bonferroni approach generally is associated with low statistical power and should not be applied in the current setting. Further, and contrary to what might be expected, we find that 'exact' tests typically behave poorly when combined in existing procedures for joint hypothesis testing. Thus, while exact tests are generally to be preferred over approximate ones when testing each particular locus, approximate tests such as the traditional chi-square seem preferable when addressing the joint hypothesis.  相似文献   

11.
Codon usage in a sample of 28 genes from the pathogenic yeast Candida albicans has been analysed using multivariate statistical analysis. A major trend among genes, correlated with gene expression level, was identified. We have focussed on the extent and nature of divergence between C.albicans and the closely related yeast Saccharomyces cerevisiae. It was recently suggested that significant differences exist between the subsets of preferred codons in these two species [Brown et al. (1991) Nucleic Acids Res. 19, 4293]. Overall, the genes of C.albicans are more A + T-rich, reflecting the lower genomic G + C content of that species, and presumably resulting from a different pattern of mutational bias. However, in both species highly expressed genes preferentially use the same subset of 'optimal' codons. A suggestion that the low frequency of NCG codons in both yeast species results from selection against the presence of codons that are potentially highly mutable is discounted. Codon usage in C.albicans, as in other unicellular species, can be interpreted as the result of a balance between the processes of mutational bias and translational selection. Codon usage in two related Candida species, C.maltosa and C.tropicalis, is briefly discussed.  相似文献   

12.
A novel method using the nonparametric bootstrap is proposed for testing whether a quantitative trait locus (QTL) at one chromosomal position could explain effects on two separate traits. If the single-QTL hypothesis is accepted, pleiotropy could explain the effect on two traits. If it is rejected, then the effects on two traits are due to linked QTLs. The method can be used in conjunction with several QTL mapping methods as long as they provide a straightforward estimate of the number of QTLs detectable from the data set. A selection step was introduced in the bootstrap procedure to reduce the conservativeness of the test of close linkage vs. pleiotropy, so that the erroneous rejection of the null hypothesis of pleiotropy only happens at a frequency equal to the nominal type I error risk specified by the user. The approach was assessed using computer simulations and proved to be relatively unbiased and robust over the range of genetic situations tested. An example of its application on a real data set from a saline stress experiment performed on a recombinant population of wheat (Triticum aestivum L. ) doubled haploid lines is also provided.  相似文献   

13.
14.
In a 15-month study of anthropoid primates in the forest of eastern Borneo, observers made 1230 contacts with groups of five different species within a study area of approximately 3 km2. A test of the hypothesis that there is no effect on the presence of one species by the presence of another (using the method of partial correlation) shows that in every pair of species the presence of one has a negative effect on the presence of the other, and that in seven of ten pairs the hypothesis may be rejected at the 97% level of confidence or better. Three cases in which the hypothesis cannot be rejected are found to have biological meaning: behavioral data show that niche separation of the pairs of species is such that their distributions should be independent. In spite of widely divergent feeding and locomotor adaptations, it is apparent that the species are competing for use of the forest.  相似文献   

15.
Biased codon usage is common in eukaryotic and prokaryotic genes. Evidence from Escherichia, Saccharomyces, and Drosophila indicates that it favors translational efficiency and accuracy. However, to date no functional advantages have been identified in the codon–anticodon interactions involving the most frequently used (preferred) codons. Here we present evidence that forces not related to the individual codon–anticodon interaction may be involved in determining which synonymous codons are preferred or avoided. We show that the ``off-frame' trinucleotide motif preferences inferrable from Drosophila coding regions are often in the same direction as Drosophila's ``in-frame' codon preferences, i.e., its codon usage. The off-frame preferences were inferred from the nonrandomness of the location of confamilial synonymous codons along coding regions—a pattern often described as a context dependence of nucleotide choice at synonymous positions or as codon-pair bias. We relied on randomizations of the location of confamilial codons that do not alter, and cannot be influenced by, the encoded amino acid sequences, codon usage, or base composition of the genes examined. The statistically significant congruency of in-frame and off-frame trinucleotide preferences suggests that the same kind of reading-frame-independent force(s) may also influence synonymous codon choice. These forces may have produced biases in codon usage that then led to the evolution of the translational advantages of these motifs as preferred codons. Under this scenario, tRNA pool size differences between preferred and nonpreferred codons initially were evolved to track the default overrepresentation of codons with preferred motifs. The motif preference hypothesis can explain the structuring of codon preferences and the similarities in the codon usages of distantly related organisms. Received: 10 November 1998 / Accepted: 23 February 1999  相似文献   

16.
Detection of major gene for Gilles de la Tourette syndrome   总被引:15,自引:12,他引:3  
The families of 250 consecutive, unselected patients with Tourette syndrome (TS) were analyzed. If the parents had either motor or vocal tics, but not both, there was an increased risk of both TS and tics in the offspring. The mode of inheritance of the combined tic-Tourette trait was evaluated in both nuclear families and extended pedigrees. Complex segregation analysis was carried out allowing for possible contributions from both a major autosomal locus and multifactorial inheritance of variation in the background of each genotype. The most likely mode of inheritance was a major semidominant gene, Ts, with low heritability of the multifactorial background variation. This was true regardless of assumptions about the prevalence of the disorder. The hypothesis of strict multifactorial inheritance could not be rejected with nuclear family data alone. However, the hypothesis of no major gene effect was rejected using data on 3 generations for any estimate of lifetime risk less than 12 per 1,000 in the general population. A pure recessive major gene effect was also rejected. With a gene frequency of approximately .5%, the penetrance was estimated to be about 94% in abnormal Ts/Ts homozygotes, 50% in Ts/ts heterozygotes, and less than 0.3% in normal ts/ts homozygotes. More than two of every three cases are heterozygotes, and nearly all other cases are phenocopies or new mutations. This is the first demonstration by segregation analysis of a major gene in a human neuropsychiatric disorder with a frequency approaching 1% of the population.  相似文献   

17.
Carlini DB  Stephan W 《Genetics》2003,163(1):239-243
The evolution of codon bias, the unequal usage of synonymous codons, is thought to be due to natural selection for the use of preferred codons that match the most abundant species of isoaccepting tRNA, resulting in increased translational efficiency and accuracy. We examined this hypothesis by introducing 1, 6, and 10 unpreferred codons into the Drosophila alcohol dehydrogenase gene (Adh). We observed a significant decrease in ADH protein production with number of unpreferred codons, confirming the importance of natural selection as a mechanism leading to codon bias. We then used this empirical relationship to estimate the selection coefficient (s) against unpreferred synonymous mutations and found the value (s >or= 10(-5)) to be approximately one order of magnitude greater than previous estimates from population genetics theory. The observed differences in protein production appear to be too large to be consistent with current estimates of the strength of selection on synonymous sites in D. melanogaster.  相似文献   

18.
During the evolution of living organisms, a natural selection event occurs toward the optimization of their genomes regarding the usage of codons. During this process which is known as codon bias, a set of preferred codons is naturally defined in the genome of a given organism, since there are 61 possible codons (plus 3 stop codons) to 20 amino acids. Such event leads to optimization of metabolic cellular processes such as translational efficiency, RNA stability and energy saving. Although we know why, we do not know how exactly a set of preferred codons for each amino acid is defined for a given genome considering that the usage frequency of each synonymous codons is peculiar to each organism. In order to help answering this question, we analyzed the usage frequency of codons which are similar to stop codons, since a minor mutation on these codons may lead to a stop codon into the open reading frame compromising the protein expression as a result. We found a reduced use of those codons in Xanthomomas axonopodis pv. citri which presents an optimized genome regarding codon usage. On the other hand, such codons are more often used in Xylella fastidiosa, which does not seem to have established codon preferences as previously shown. Our results support that a set of preferred codons is not randomly selected and propose new ideas to the field warranting further experiments in this regard.  相似文献   

19.
The pattern of codon utilization in the variable and constant regions of immunoglobulin genes are compared. It is shown that, in these regions, codon utilizations are quite distinct from one another: For most degenerate codons, there is a selective bias that prefers C and/or G ending codons to U and/or A ending codons in the constant region compared with the bias in the variable region. This would strongly suggest that, in immunoglobulin genes, the bias in code word usage is determined by other factors than those concerning with the translational mechanism such as tRNA availability and codon-anticodon interaction. A possibility is also suggested that this differance of code word usage between them is due to the existence of secondary structure in the constant region but not in the variable region.  相似文献   

20.
人类基因同义密码子偏好的特征以及与基因GC含量的关系   总被引:24,自引:0,他引:24  
对人类的728个基因,按其编码区中GC的含量分成四组(从GC<0.43到GC>0.58),分别考察了这四组样本对同义密码子偏好的特征,发现在全部样本中都呈现NTG(N代表四种碱基中的任一种)特受偏爱和NCG尽量避免的特征.基因环境中GC含量与C3/G3含量(密码子第三位C和G的含量)的相关分析,以及四组样本对密码子的偏好都支持以C结尾的密码子在编码中有特殊的优势,这种优势有利于保证翻译的准确性.还考察了各种氨基酸含量随编码区GC含量不同而变化的趋势.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号