首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
A protein-gene linkage map of the cyanobacterium Anabaena sp. strain PCC7120 was successfully constructed for 123 relatively abundant proteins. The total proteins extracted from the cell were resolved by two-dimensional electrophoresis, and the amino-terminal sequences of the protein spots were determined. By comparing the determined amino-terminal sequences with the entire genome sequence, the putative translation initiation sites of 87 genes were successfully assigned on the genome. The elucidated sequence features surrounding the translation initiation sites were as follows: (1) GTG and TTG in addition to the ATG were used as rare initiation codons; (2) the core sequences (GAGG, GGAG and AGGA) of the Shine-Dalgarno sequence were identified in the appropriate position preceding the 51 initiation sites (58.6%); (3) the nucleotides at the two regions, from -35 to -33, and from -19 to -17 (relative to the first nucleotide in the initiation codon) were preferentially adenines or thymines; (4) the nucleotides at the region from -14 to -8 were preferentially purines; (5) the nucleotide at position -1 was biased towards non-guanine (96.6%); (6) the nucleotide at the position +5 was preferentially cytosine (63.2%). It was evident that removal of the translation initiator methionine was dependent on the side-chain bulkiness of the penultimate amino acid residue. The predicted putative signal peptide sequences were also indicated. Besides confirming the existence of many predicted proteins, the data will serve as a starting point for the study of signals important in post-translational processing and nucleotide sequences important in the initiation of translation.  相似文献   

2.
We developed a computer program, GeneHackerTL, which predictsthe most probable translation initiation site for a given nucleotidesequence. The program requires that information be extractedfrom the nucleotide sequence data surrounding the translationinitiation sites according to the framework of the Hidden MarkovModel. Since the translation initiation sites of 72 highly abundantproteins have already been assigned on the genome of Synechocystissp. strain PCC6803 by amino-terminal analysis, we extractednecessary information for GeneHackerTL from the nucleotide sequencedata. The prediction rate of the GeneHackerTL for these proteinswas estimated to be 86.1%. We then used GeneHackerTL for predictionof the translation initiation sites of 24 other proteins, ofwhich the initiation sites were not assigned experimentally,because of the lack of a potential initiation codon at the amino-terminalposition. For 20 out of the 24 proteins, the initiation siteswere predicted in the upstream of their amino-terminal positions.According to this assignment, the processed regions representa typical feature of signal peptides. We could also predictmultiple translation initiation sites for a particular genefor which at least two initiation sites were experimentallydetected. This program would be e.ective for the predictionof translation initiationsites of other proteins, not only inthis species but also in other prokaryotes as well.  相似文献   

3.
L H Soe  C K Shieh  S C Baker  M F Chang    M M Lai 《Journal of virology》1987,61(12):3968-3976
A 28-kilodalton protein has been suggested to be the amino-terminal protein cleavage product of the putative coronavirus RNA polymerase (gene A) (M.R. Denison and S. Perlman, Virology 157:565-568, 1987). To elucidate the structure and mechanism of synthesis of this protein, the nucleotide sequence of the 5' 2.0 kilobases of the coronavirus mouse hepatitis virus strain JHM genome was determined. This sequence contains a single, long open reading frame and predicts a highly basic amino-terminal region. Cell-free translation of RNAs transcribed in vitro from DNAs containing gene A sequences in pT7 vectors yielded proteins initiated from the 5'-most optimal initiation codon at position 215 from the 5' end of the genome. The sequence preceding this initiation codon predicts the presence of a stable hairpin loop structure. The presence of an RNA secondary structure at the 5' end of the RNA genome is supported by the observation that gene A sequences were more efficiently translated in vitro when upstream noncoding sequences were removed. By comparing the translation products of virion genomic RNA and in vitro transcribed RNAs, we established that our clones encompassing the 5'-end mouse hepatitis virus genomic RNA encode the 28-kilodalton N-terminal cleavage product of the gene A protein. Possible cleavage sites for this protein are proposed.  相似文献   

4.
5.

Background  

Utilization of alternative initiation sites for protein translation directed by non-AUG codons in mammalian mRNAs is observed with increasing frequency. Alternative initiation sites are utilized for the synthesis of important regulatory proteins that control distinct biological functions. It is, therefore, of high significance to define the parameters that allow accurate bioinformatic prediction of alternative translation initiation sites (aTIS). This study has investigated 5'-UTR regions of mRNAs to define consensus sequence properties and structural features that allow identification of alternative initiation sites for protein translation.  相似文献   

6.
7.
As a part of the Bacillus subtilis genome sequencing project,we have determined a 25-kb sequence covering the 17°–19°region. This region contains 26 complete open reading frames(ORFs) including the alkA and adaA/B operon, which encode genesfor adaptive response to DNA alkylation. A homology search forthe newly identified 21 ORFs revealed that 4 of them exhibita significant similarity to known proteins, e.g., methicillin-resistantStaphylococcus aureus (MRSA) protein homolog, proteins involvedin chloramphenicol resistance, glucosamine synthase and an ABCtransporter protein. The remaining 17 ORFs did not show anysignificant sequence similarities to known gene products inthe database.  相似文献   

8.
We searched partial sequences of over 22,706 rice cDNA and 1220genomic DNA clones to find and characterize simple sequencerepeats (SSRs) in the rice genome. The most frequently foundrepeated SSR motif in both cDNA and genomic DNA sequences wasd(CCG/CGG)n. The second most frequently found SSR was d(AG/CT)n.In contrast with mammalian genomes, in which d(AC/GT)n sequencesare the most abundant, d(AC/GT)n sequences were not frequentlyobserved in rice. Sequences containing d(CCG/CGG)n, d(AG/CT)nrepeats, and other SSRs were chosen for polymorphism detection.It was predicted that 17 of 20 SSRs in cDNA sequences were locatedin 5'-untranslated regions near initiation codons. Twenty-twoloci can be mapped on our RFLP linkage map by these SSRs. Sixmarkers were tested with 16 japonica rice varieties as templatesfor PCR. Two markers exhibited amplified fragment length polymorphismamong these rice varieties, implying that SSRs are polymorphicamong rice varieties which have similar genetic backgrounds.Even these polymorphic SSRs are located within or around geneswhich code ubiquitous proteins.  相似文献   

9.
The nucleotide sequence of a 2,146 bp portion of the Anacystisnidulans (Synechococcus PCC6301) genome has been determined.This region contains an open reading frame (ORF) of 392 codons,whose predicted protein sequence shows partial homology to thoseof E. coli phoM and envZ. Hence ORF392 is suggested to be asensory kinase gene in cyanobacteria.  相似文献   

10.
11.
Human T-cell lymphotropic virus type I (HTLV-I) double-spliced mRNA exhibits two GUG and two CUG codons upstream to, and in frame with, the sequences encoding Rex and Tax regulatory proteins, respectively. To verify whether these GUG and CUG codons could be used as additional initiation codons of translation, two chimeric constructs were built for directing the synthesis of either Rex–CAT or Tax–CAT fusion proteins. In both cases, the CAT reporter sequence was inserted after the Tax AUG codon and in frame with either the Rex or Tax AUG codon. Under transient expression of these constructs, other proteins of higher molecular mass were synthesized in addition to the expected Rex–CAT and Tax–CAT proteins. The potential non-AUG initiation codons were exchanged for either an AUG codon or a non-initiation codon. This allowed us to demonstrate that the two GUG codons in frame with the Rex coding sequence, and only the second CUG in frame with the Tax coding sequence, were used as additional initiation codons. In HTLV-I infected cells, two Rex and one Tax additional proteins were detected that exhibited molecular mass compatible with the use of the two GUG and the second CUG as additional initiation codons of translation. Comparison of the HTLV-I proviral DNA sequence with that of other HTLV-related retroviruses revealed a striking conservation of the three non-AUG initiation codons, strongly suggesting their use for the synthesis of additional Rex and Tax proteins.  相似文献   

12.
Nucleotide sequences around the boundaries of all open reading frames in the Escherichia coli whole genome were analyzed. Characteristic base biases were observed after the initiation codon and before the termination codon. We examined the effect of the base sequence after the initiation codon on the translation efficiency, by introducing mutations after the initiation codon of the E. coli dihydrofolate reductase (DHFR) gene, considering codon and base biases, and using in vitro and in vivo translation systems. In both assay systems, the two most frequent second codons, AAA and AAU, enhanced the translation efficiency compared with the wild type, whereas the effects of lower frequency codons were not significant. Experiments using 16S rRNA variants with mutations in the putative complementary sequence to the region downstream of the initiation codon showed that the translation efficiency of none of the DHFR mutants was affected. These results demonstrate that the statistically most frequent sequences for the second codon enhance translation efficiency, and this effect seems to be independent of base pairing between mRNA and 16S rRNA.  相似文献   

13.
Ingolia NT  Lareau LF  Weissman JS 《Cell》2011,147(4):789-802
The ability to sequence genomes has far outstripped approaches for deciphering the information they encode. Here we present a suite of techniques, based on ribosome profiling (the deep sequencing of ribosome-protected mRNA fragments), to provide genome-wide maps of protein synthesis as well as a pulse-chase strategy for determining rates of translation elongation. We exploit the propensity of harringtonine to cause ribosomes to accumulate at sites of translation initiation together with a machine learning algorithm to define protein products systematically. Analysis of translation in mouse embryonic stem cells reveals thousands of strong pause sites and unannotated translation products. These include amino-terminal extensions and truncations and upstream open reading frames with regulatory potential, initiated at both AUG and non-AUG codons, whose translation changes after differentiation. We also define a class of short, polycistronic ribosome-associated coding RNAs (sprcRNAs) that encode small proteins. Our studies reveal an unanticipated complexity to mammalian proteomes.  相似文献   

14.
The nucleotide sequence of a 27830-bp DNA segment in the 79°–81°.region of the Bacillus subtilis genome has been determined.This region contains 29 complete ORFs including the sspE gene,which encodes a small acid-soluble spore protein gamma and locateson the one side terminal of our assigned region. A homologysearch for the products deduced from the 29 ORFs revealed thatnine of them exhibit significant similarity to known proteins,e.g. proteins involved in an iron uptake system, a multidrugresistance protein, a chloramphenicol resistance protein, epoxidehydrolase, adenine glycosylase, and a glucose-1-dehydrogenasehomolog.  相似文献   

15.
16.
Highly expressed plastid genes display codon adaptation, which is defined as a bias toward a set of codons which are complementary to abundant tRNAs. This type of adaptation is similar to what is observed in highly expressed Escherichia coli genes and is probably the result of selection to increase translation efficiency. In the current work, the codon adaptation of plastid genes is studied with regard to three specific features that have been observed in E. coli and which may influence translation efficiency. These features are (1) a relatively low codon adaptation at the 5′ end of highly expressed genes, (2) an influence of neighboring codons on codon usage at a particular site (codon context), and (3) a correlation between the level of codon adaptation of a gene and its amino acid content. All three features are found in plastid genes. First, highly expressed plastid genes have a noticeable decrease in codon adaptation over the first 10–20 codons. Second, for the twofold degenerate NNY codon groups, highly expressed genes have an overall bias toward the NNC codon, but this is not observed when the 3′ neighboring base is a G. At these sites highly expressed genes are biased toward NNT instead of NNC. Third, plastid genes that have higher codon adaptations also tend to have an increased usage of amino acids with a high G + C content at the first two codon positions and GNN codons in particular. The correlation between codon adaptation and amino acid content exists separately for both cytosolic and membrane proteins and is not related to any obvious functional property. It is suggested that at certain sites selection discriminates between nonsynonymous codons based on translational, not functional, differences, with the result that the amino acid sequence of highly expressed proteins is partially influenced by selection for increased translation efficiency. Received: 21 July 1999 / Accepted: 5 November 1999  相似文献   

17.
We have determined a 35-kb sequence of the groESL-gutR-cotA(45°–52°) region of the Bacillus subtilis genome.In addition to the groESL, gutRB and cotA genes reported previously,we have newly identified 24 ORFs including gutA and fruC genes,encoding glucitol permease and fructokinase, respectively. Theinherent restriction/modification system genes, hsdMR and hsdMM,were mapped between groESL and gutRB, and we have identifiedtwo open reading frames (ORFs) encoding 5-methylcytosine formingDNA methyl transferase and an operon probably encoding a restrictionenzyme complex. The unusual genome structure of few ORFs andlower GC content around the restriction/modification genes stronglysuggests that the region originated from a bacteriophage integratedduring evolution.  相似文献   

18.
The amphidiploid (AACC) nuclear genome of Brassica napus (oil-seedrape) contains c. 5 ? 105 copies of a simple, highly repetitiveDNA element; each repeat is 176 or 177 base pairs long and isdefined by Hind III cutting sites. The diploid (AA) Brassicacampestris (turnip) possesses a very similar repetitive DNA,the consensus sequence of which does not differ from that inB. napus. The 176/177 bp unit consists of three 59 bp sub-units,defined by vestigial EcoRII sites. Analysis of the distributionof variants from consensus in adjacent and non-adjacent unitsprovides evidence for homogenization of sequences by the fixationof independent mutations and for tandem duplication of units.Within units, there is also evidence for inversion and tandemduplication of short (5–8 bp) motifs. Previously published data show that 176/177 base pair repetitiveDNA elements, defined by Hind III cutting sites, are also presentin Sinapis and Raphanus. There is a sequence homology betweenBrassica and Sinapis, and between Brassica and Raphanus, of75%. Sequence homology between Raphanus and Sinapis is 73%. Key words: Repetitive DNA, Brassica, Cruciferae  相似文献   

19.
A reassessment of the translation initiation codon in vertebrates   总被引:13,自引:0,他引:13  
  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号