首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
Summary We have investigated the relationship between the G + C content of silent (synonymous) sites in codons and the amino acid composition of encoded proteins for approximately 1,600 human genes. There are positive correlations between silent site G + C and the proportions of codons for Arg, Pro, Ala, Trp, His, Gln, and Leu and negative ones for Tyr, Phe, Asn, Ile, Lys, Asp, Thr, and Glu. The median proteins coded by groups of genes that differ in silent-site G + C content also differ in amino acid composition, as do some proteins coded by homologous genes. The pattern of compositional change can be largely explained by directional mutation pressure, the genetic code, and differences in the frequencies of accepted amino acid substitutions; the shifts in protein composition are likely to be selectively neutral.Offprint requests to: D.W. Collins  相似文献   

2.
FramePlot is a web-based tool for predicting protein-coding regions in bacterial DNA with a high G + C content, such as Streptomyces. The graphical output provides for easy distinction of protein-coding regions from non-coding regions. The plot is a clickable map. Clicking on an ORF provides not only the nucleotide sequence but also its deduced amino acid sequence. These sequences can then be compared to the NCBI sequence database over the Internet. The program is freely available for academic purposes at http://www.nih.go.jp/jun/cgi-bin/frameplot.pl.  相似文献   

3.
The correlation between genomic G+C content and optimal growth temperature in prokaryotes has gained renewed interest after Musto et al. [H. Musto, H. Naya, A. Zavala, H. Romero, F. Alvarex-Valin, G. Bernardi, Correlations between genomic GC levels and optimal growth temperatures in prokaryotes, FEBS Lett. 573 (2004) 73-77], reported that positive correlations exist in 15 families studied. We have reanalyzed their data and found that when genome size and data quality were adjusted for, there was no significant evidence of relationship between optimal temperature and GC content for two of the families that had previously shown strongly significant correlations. Using updated temperature optima for Halobacteriaceae species we found the correlation is insignificant in this family. For the family Enterobacteriaceae when genome size and optimal temperature are included in a multiple linear regression, only genome size is significant as a predictor of GC content. We showed that more profound statistical methods than simple two factor correlation analysis should be used for analyzing complex intrinsic and extrinsic factors that affect genomic GC content. We further found that a positive correlation between temperature and genomic GC is only evident in free-living species of low optimal growth temperatures.  相似文献   

4.
5.
Identifying the G + C difference between closely related bacterial species or between different strains of the same species is one of the first steps in understanding the evolutionary mechanisms accounting for the differences observed among bacterial species. The G + C content can be one of the most important factors in the evolution of genomic structures. In this paper, we describe a new method for detecting an initial stage of differentiation of the G + C content at the third codon base position between two strains of the same bacterial species. We apply this method to the two strains of Helicobacter pylori. A group of genes is detected with large variations of G + C in the third positions—apparently genes of early response to pressures of changing G + C. We discuss our findings from the viewpoint of genomic evolution. Received: 26 February 2001 / Accepted: 16 May 2001  相似文献   

6.
目的:使用反相高效液相色谱法测定色盐杆菌新种ST307的DNA G+C mol%含量.方法:以Escherichia coli DH5α为标准菌株,采用90%重蒸水10%甲醇为流动相,检测波长260nm,流速1ml·min-1,在Venusil MP C18柱上对四种碱基进行分离.结果:DNA碱基分离效果好,以外标法计算得到标准菌株DH5α的DNA G+C mol%含量为50.3%,待测菌株ST307的DNA G+C mol%含量为60.5%.结论:采用反向高效液相色谱法测定色盐杆菌的DNA G+C mol%含量准确可靠.  相似文献   

7.
Summary We develop a mathematical model for estimating evolutionary distance from restriction enzyme maps, which incorporates non-uniformity of the rate of base substitution into the theory and allows for an arbitrary G+C content at equilibrium. When the G+C content differs significantly from 1/2, the traditional model of base changes can introduce a systematic bias which depends upon the base composition of the restriction site. In addition, the accuracy of estimated evolutionary distance depends heavily upon the choice of restriction enzyme in that the expected number of sites is also affected. Monte Carlo experiments are conducted to check the validity of the present theoretical treatment and from which we draw several cautionary notes on estimation. An application is made to the available data on restriction enzyme maps of human mitochondrial DNA where the G+C content is approximately 1/3.Contribution No. 1372 from the National Institute of Genetics, Mishima, 411 Japan  相似文献   

8.
DNA of some anaerobic rumen fungi: G + C content determination   总被引:2,自引:0,他引:2  
The nuclear DNAs from five species of anaerobic rumen fungi have been isolated and purified by means of two extraction methods (with and without 8 M urea). Their G + C contents have been characterized by the thermal denaturation procedure of Marmur and Doty. As has already been shown in Neocallimastix frontalis, the results obtained by the two techniques demonstrated a very low G + C content (less than 20%) and the constant presence of satellite DNA.  相似文献   

9.
Summary One hundred twelve human DNA sequences were analyzed with respect to dinucleotide frequency and amino acid composition. The variation in guanine and cytosine (G+C) content revealed: (1) at 2–3 and 3-1 doublet positions CG discrimination is attenuated at high G+C, but TA disfavor is enhanced, and (2) several amino acids are subject to G+C change. These findings have been reported in part for collections of sequences from various species. The present study confirms that in a single organism-the human-the G+C effects do exist. Aspects of the argument that connects G+C with protein thermal stability are also discussed.  相似文献   

10.
For the isolation of gene space representative sequence elements, a new methodology—high C+G patch (HCGP) filtration—has been developed using rice as a model. The method is based on the fragmentation of the genomic DNA by methylation-sensitive HpaII and MspI restriction endonucleases having exclusively G/C base pair-containing recognition sites. These enzymes fragment the genome at high C+G content and hypomethylated regions. Cloning fragments spanning such regions in close vicinity (200–2,000 bp) revealed that about 60% of the clones represented gene space sequences resulting in twofold enrichment of these sequences, which is close to the theoretical maximum in rice. The sequence information of clones used in the present study was deposited in the NCBI database under the accession numbers EI 365676–EI 366364.  相似文献   

11.
刘林梦  温权  欧竑宇 《微生物学通报》2014,41(12):2583-2592
【目的】为识别已完成全测序细菌基因组中的ncRNA基因,对3个常用ncRNA预测工具s RNAPredict、PORTRAIT和s RNAscanner进行评估。【方法】选择了细菌ncRNA数据库BSRD收录的含有已知ncRNA基因数目大于30的9个细菌基因组,并按基因组G+C含量进行分类,比较s RNAPredict和PORTRAIT工具的预测准确性。提取不同G+C含量基因组中ncRNA基因转录起始和终止区的序列特征,对s RNAscanner预测结果进行评估。【结果】s RNAPredict对细菌ncRNA基因的预测特异性和阳性检出率均高于PORTRAIT,而敏感性则较差;两种工具预测效果均随基因组G+C含量不同而产生明显变化。在不同G+C含量的细菌基因组中,ncRNA基因启动子和终止子区域的序列特征有明显差异。利用这些序列特征能提高s RNAscanner预测ncRNA基因的平均水平。【结论】3种ncRNA基因工具预测效果随基因组G+C含量变化而不同。不同G+C含量基因组中ncRNA基因的转录起始和终止区特征可作为ncRNA基因预测的重要参数之一。  相似文献   

12.
G:C pairs are more stable than A:T pairs because they have an additional hydrogen bond. This has led to many studies on the correlation between the guanine+cytosine (G+C) content of nucleic acids and temperature over the last 20 years. We collected the optimal growth temperatures (Topt) and the G+C contents of genomic DNA; 23S, 16S, and 5S ribosomal RNAs; and transfer RNAs for 764 prokaryotic species. No correlation was found between genomic G+C content and Topt, but there were striking correlations between the G+C content of ribosomal and transfer RNA stems and Topt. Two explanations have been proposed—neutral evolution and selection pressure—for the approximate equalities of G and C (respectively, A and T) contents within each strand of DNA molecules. Our results do not support the notion that selection pressure induces complementary oligonucleotides in close proximity and therefore numerous secondary structures in prokaryotic DNA, as the genomic G+C content does not behave in the same way as that of folded RNA with respect to optimal growth temperature. Received: 25 September 1996 / Accepted: 21 January 1997  相似文献   

13.
Ou HY  Guo FB  Zhang CT 《FEBS letters》2003,540(1-3):188-194
The nucleotide distribution of all 33 527 open reading frames (ORFs) (≥300 bp) in the genome of Streptomyces coelicolor A3(2) has been analyzed using the Z curve method. Each ORF is mapped onto a point in a 9-dimensional space. To visualize the distribution of mapping points, the points are projected onto the principal plane based on principal component analysis. Consequently, the distribution pattern of the 33 527 points in the principal plane shows a flower-like shape, in which there are seven distinct regions. In addition to the central region, there are six petal-like regions around the center, one of which corresponds to 7172 coding sequences. The central region and the remaining five petal-like regions correspond to the intergenic sequences and out-of-frame non-coding ORFs, respectively. It is shown that selective pressure produces a remarkable bias of the G+C content among three codon positions, resulting in the interesting phenomenon observed. A similar phenomenon is also observed for other bacterial genomes with high genomic G+C content, such as Pseudomonas aeruginosa PA01 (G+C=66.6%). However, for the genomes of Bacillus subtilis (G+C=43.5%) and Clostridium perfringens (G+C=28.6%), no similar phenomenon was observed. The finding presented here may be useful to improve the gene-finding algorithms for genomes with high G+C content. A set of supplementary materials including the plots displaying the base distribution patterns of ORFs in 12 prokaryotes is provided on the website http://tubic.tju.edu.cn/highGC/.  相似文献   

14.
We conducted a genome-wide analysis of variations in guanine plus cytosine (G+C) content at the third codon position at silent substitution sites of orthologous human and mouse protein-coding nucleotide sequences. Alignments of 3776 human protein-coding DNA sequences with mouse orthologs having >50 synonymous codons were analyzed, and nucleotide substitutions were counted by comparing sequences in the alignments extracted from gap-free regions. The G+C content at silent sites in these pairs of genes showed a strong negative correlation (r = -0.93). Some gene pairs showed significant differences in G+C content at the third codon position at silent substitution sites. For example, human thymine-DNA glycosylase was A+T-rich at the silent substitution sites, while the orthologous mouse sequence was G+C-rich at the corresponding sites. In contrast, human matrix metalloproteinase 23B was G+C-rich at silent substitution sites, while the mouse ortholog was A+T-rich. We discuss possible implications of this significant negative correlation of G+C content at silent sites.  相似文献   

15.
Summary The G+C content of DNA varies widely in different organisms, especially microorganisms. This variation is accompanied by changes in the nucleotide composition of silent positions in codons. (Silent positions are defined and explained in the text.) These changes are mostly neutral or near neutral, and appear to result from mutation pressure in the direction of increasing either A+T (AT pressure) or G+C(GC pressure) content. Variations in G+C content are also accompanied by substitutions at replacement positions in codons. These substituions produce changes in the amino acid content of homologous proteins. The examples studied were genes for 13 mitochondrial proteins in five species, and A and B genes for bacterial tryptophan synthase in four species.In microorganisms, varying AT and GC mutational pressures, presumably resulting from shifts in the DNA polymerase system, exert strong effects on molecular evolution by changing the G+C content of DNA. These effects may be greater than those of random drift. The effects of GC pressure on silent substitutions in the systems examined are several times as great as the effects on replacement substitutions.GC pressure is exerted on noncoding as well as coding regions in mitochondrial DNA. This is shown by the close correlation (correlation coefficient, 0.99) of the G+C content of the noncoding D loop of mitochondria with the G+C content of silent positions in the corresponding mitochondrial genes.  相似文献   

16.
Abstract

We analysed complete or almost complete nucleotide sequences of the human, chimp, mouse, rat, chicken, dog, and other genomes to find that they contain extremely long (A+T) a (G+C) blocks that do not occur at all in the corresponding randomized sequences. The longest is an (A+T) block containing 1040 consecutive AT pairs that occurs in the 16th human chromosome. The longest human (G+C) block has 261 bp in length. About a half of the longest blocks occur in introns. The (A+T) blocks are discrete units whereas the (G+C) blocks are diffuse. They are embeeded in the genome through connectors longer than 1 kilobase where the (G+C) content gradually decreases to the value of 50%. Remarkably, the (A+T) as well as (G+C) blocks are substantially shorter in the chimp genome. Chicken is characteristic by very long (G+C) blocks that are even longer than in the human genome. Though much shorter, long (G+C) and especially (A+T) blocks occur in lower organisms as well, which means that AT and GC pair clustering is an ancient property that has evolved into large scales in higher eukaryote genomes and the human genome in particular. Very long (A+T) and (G+C) blocks confer specific biophysical properties on DNA that are likely to influence genome folding in cell nuclei and its functional properties.  相似文献   

17.
Sueoka N 《Gene》2002,300(1-2):141-154
The intra-strand Parity Rule 2 of DNA (PR2) states that A=T and G=C within each strands. Useful corollaries of PR2 are G/(G+C)=A/(A+T)=0.5, G/(G+A)=C/(C+T)=G+C, G/(G+T)=C/(C+A)=G+C. Here. A, T, G, and C represent relative contents of the four nucleotide residues in a specific strand of DNA, so that A+T+G+C=1. Thus, deviations from the PR2 is a sign of strand-specific (or asymmetric) mutation and/or selection pressures. The present study delineates the symmetric and asymmetric effects of mutations on the intra-genomic heterogeneity of the G+C content in the human genome. The results of this study on the human genome are: (1) When both two- and four-codon amino acids were combined, only slight departures from the PR2 were observed in the total ranges of G+C content of the third-codon position. Thus, the G+C heterogeneity is likely to be caused by symmetric mutagenesis between the two strands. (2) The above result makes the deamination of cytosine due to double-strand breathing of DNA [Mol. Biol. Evol. 17 (2000) 1371] and/or incorporation of the oxidized guanine (8-oxo-guanine) opposite adenine during DNA replication (dGTP-oxidation hypothesis) as the most likely candidates for the major cause of the diversities of the G+C content. (3) Patterns of amino acid-specific PR2-biases detected by plotting PR2 corollaries against the G+C content of third codon position revealed that eight four-codon amino acids can be divided into three types by the second codon letter: (a) C2-type (Ala, Pro, Ser4, and Thr), (b) G2-type (Arg4 and Gly), and (c) T2-type (Leu4 and Val). (4) Most of the asymmetric plot patterns of the above three classes in PR2 biases can be explained by C2→T2 deamination of C2pG3 of C2-type to T2pG3 (T2-type) in both human and chicken. This explains the existence of some preferred codons in human and chicken. However, these biases (asymmetric) hardly contribute to the overall G+C content diversity of the third codon position.  相似文献   

18.
The relative contribution of mutation and selection to the G+C content of DNA was analyzed in bacterial species having widely different G+C contents. The analysis used two methods that were developed previously. The first method was to plot the average G+C content of a set of nucleotides against the G+C content of the third codon position for each gene. This method was used to present the G+C distribution of the third codon position and to assess the relative neutrality of a set of nucleotides to that of the G+C content of the third codon position. The second method was to plot the intrastrand bias of the third codon position from Parity Rule 2 (PR2), where A=T and G=C. It was found that whereas intragenomic distributions of the DNA G+C content of these bacteria are narrow in the majority of species, in some species the G+C content of the minor class of genes distributes over wider ranges than the major class of genes. On the other hand, ubiquitous PR2 biases are amino acid specific and independent of the G+C content of DNA, so that when averaged over the amino acids, the biases are small and not correlated with the DNA G+C content. Therefore, translation coupled PR2-biases are unlikely to explain the wide range of G+C contents among different species. Considering all data available, it was concluded that the amino acid-specific PR2 bias has only a minor effect, if any, on the average G+C content. In addition, PR2 bias patterns of different species show phylogenetic relationships, and the pattern can be as a taxal fingerprint. Received: 5 November 1998 / Accepted: 1 March 1999  相似文献   

19.
20.
A novel C3G isoform, designated p87C3G, lacking the most amino terminal region of the cognate protein has been found to be overexpressed in two CML cell lines, K562 and Boff 210, both expressing Bcr-Abl p210. p87C3G expression is also highly augmented in patients diagnosed with chronic myeloid leukemia (CML) Ph+, in comparison with healthy individuals, and returns to basal levels after treatment with STI571. p87C3G co-immunoprecipitates with both CrkL and Bcr-Abl in CML cell lines and co-immunoprecipitation between p87C3G and Bcr-Abl was also detected in primary cells from CML patients. These interactions have been confirmed by in vitro pull down experiments. The interaction between p87C3G and Bcr-Abl involves the SH3-binding domain of p87C3G and the SH3 domain of Abl and depends mostly on the first polyproline region of p87C3G. Furthermore, we also demonstrated that p87C3G is phosphorylated in vitro by a Bcr-Abl-dependent mechanism. These results indicate that p87C3G overexpression is linked to CML phenotype and that p87C3G may exert productive functional interactions with Bcr-Abl signaling components suggesting the implication of this C3G isoform in the pathogenesis of chronic myeloid leukemia.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号