首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Complete DNA sequence of yeast chromosome II.   总被引:20,自引:2,他引:18       下载免费PDF全文
In the framework of the EU genome-sequencing programmes, the complete DNA sequence of the yeast Saccharomyces cerevisiae chromosome II (807 188 bp) has been determined. At present, this is the largest eukaryotic chromosome entirely sequenced. A total of 410 open reading frames (ORFs) were identified, covering 72% of the sequence. Similarity searches revealed that 124 ORFs (30%) correspond to genes of known function, 51 ORFs (12.5%) appear to be homologues of genes whose functions are known, 52 others (12.5%) have homologues the functions of which are not well defined and another 33 of the novel putative genes (8%) exhibit a degree of similarity which is insufficient to confidently assign function. Of the genes on chromosome II, 37-45% are thus of unpredicted function. Among the novel putative genes, we found several that are related to genes that perform differentiated functions in multicellular organisms of are involved in malignancy. In addition to a compact arrangement of potential protein coding sequences, the analysis of this chromosome confirmed general chromosome patterns but also revealed particular novel features of chromosomal organization. Alternating regional variations in average base composition correlate with variations in local gene density along chromosome II, as observed in chromosomes XI and III. We propose that functional ARS elements are preferably located in the AT-rich regions that have a spacing of approximately 110 kb. Similarly, the 13 tRNA genes and the three Ty elements of chromosome II are found in AT-rich regions. In chromosome II, the distribution of coding sequences between the two strands is biased, with a ratio of 1.3:1. An interesting aspect regarding the evolution of the eukaryotic genome is the finding that chromosome II has a high degree of internal genetic redundancy, amounting to 16% of the coding capacity.  相似文献   

2.
We searched the nucleotide sequence of budding yeast Saccharomycescerevisiae chromosome VI (270 kb) for candidate coding regions,using the computer program GenMark. One hundred and twenty-nineputative genes were identified, which is almost the same asthe number of ORFs on this chromosome. Nineteen new putativegenes were identified through the GenMark analysis. Most largeORFs were also correctly identified (87% of the predicted putativegenes identified by the GenMark (110 of 127) matched the reportedORFs). The new coding regions were mostly small but they weredistinguished from the more than 2000 ORFs identified by Genetyx.GenMark did not predict 17 ORFs that were over 300 bp long.As these ORFs include known genes, their sequence context maydiffer somewhat from that of typical yeast genes. These analysesrevealed the high potential of GenMark to identify putativegenes from numerous short ORFs and will produce informationon the likelihood of their being actual genes.  相似文献   

3.
4.
Identification of functional open reading frames in chloroplast genomes   总被引:7,自引:0,他引:7  
K H Wolfe  P M Sharp 《Gene》1988,66(2):215-222
We have used a rapid computer dot-matrix comparison method to identify all DNA regions which have been evolutionarily conserved between the completely sequenced chloroplast genomes of tobacco and a liverwort. Analysis of these regions reveals 74 homologous open reading frames (ORFs) which have been conserved as to length and amino acid sequence; these ORFs also have an excess of nucleotide substitutions at silent sites of codons. Since the nonfunctional parts of these genomes have become saturated with mutations and show no sequence similarity whatsoever, the homologous ORFs are almost certainly functional. A further four pairs of ORFs show homology limited to only a short part of their putative gene products. Amino acid sequence identities range between 50 and 99%; some chloroplast proteins are seen to be among the most slowly evolving of all known proteins. A search of the nucleotide and amino acid sequence databanks has revealed several previously unidentified genes in chloroplast sequences from other species, but no new homologies to prokaryotic genes.  相似文献   

5.
6.
Analysis of the mitochondrial DNA of a liverwort Marchantia polymorpha by electron microscopy and restriction endonuclease mapping indicated that the liverwort mitochondrial genome was a single circular molecule of about 184,400 base-pairs. We have determined the complete sequence of the liverwort mitochondrial DNA and detected 94 possible genes in the sequence of 186,608 base-pairs. These included genes for three species of ribosomal RNA, 29 genes for 27 species of transfer RNA and 30 open reading frames (ORFs) for functionally known proteins (16 ribosomal proteins, 3 subunits of H(+)-ATPase, 3 subunits of cytochrome c oxidase, apocytochrome b protein and 7 subunits of NADH ubiquinone oxidoreductase). Three ORFs showed similarity to ORFs of unknown function in the mitochondrial genomes of other organisms. Furthermore, 29 ORFs were predicted as possible genes by using the index of G + C content in first, second and third letters of codons (42.0 +/- 10.9%, 37.0 +/- 13.2% and 26.4 +/- 9.4%, respectively) obtained from the codon usages of identified liverwort genes. To date, 32 introns belonging to either group I or group II intron have been found in the coding regions of 17 genes including ribosomal RNA genes (rrn18 and rrn26), a transfer RNA gene (trnS) and a pseudogene (psi nad7). RNA editing was apparently lacking in liverwort mitochondria since the nucleotide sequences of the liverwort mitochondrial DNA were well-conserved at the DNA level.  相似文献   

7.
8.
The physical and genetic map of the Bradyrhizobium japonicum chromosome revealed that nitrogen fixation and nodulation genes are clustered. Because of the complex interactions between the bacterium and the plant, we expected this chromosomal sector to contain additional genes that are involved in the maintenance of an efficient symbiosis. Therefore, we determined the nucleotide sequence of a 410-kb region. The overall G+C nucleotide content was 59.1%. Using a minimum gene length of 150 nucleotides, 388 open reading frames (ORFs) were selected as coding regions. Thirty-five percent of the predicted proteins showed similarity to proteins of rhizobia. Sixteen percent were similar only to proteins of other bacteria. No database match was found for 29%. Repetitive DNA sequence-derived ORFs accounted for the rest. The sequenced region contained all nitrogen fixation genes and, apart from nodM, all nodulation genes that were known to exist in B. japonicum. We found several genes that seem to encode transport systems for ferric citrate, molybdate, or carbon sources. Some of them are preceded by -24/-12 promoter elements. A number of putative outer membrane proteins and cell wall-modifying enzymes as well as a type III secretion system might be involved in the interaction with the host.  相似文献   

9.
The coding sequences of genes in the yeast Saccharomyces cerevisiae show a preference for 25 of the 61 possible coding triplets. The degree of this biased codon usage in each gene is positively correlated to its expression level. Highly expressed genes use these 25 major codons almost exclusively. As an experimental approach to studying biased codon usage and its possible role in modulating gene expression, systematic codon replacements were carried out in the highly expressed PGK1 gene. The expression of phosphoglycerate kinase (PGK) was studied both on a high-copy-number plasmid and as a single copy gene integrated into the chromosome. Replacing an increasing number (up to 39% of all codons) of major codons with synonymous minor ones at the 5' end of the coding sequence caused a dramatic decline of the expression level. The PGK protein levels dropped 10-fold. The steady-state mRNA levels also declined, but to a lesser extent (threefold). Our data indicate that this reduction in mRNA levels was due to destabilization caused by impaired translation elongation at the minor codons. By preventing translation of the PGK mRNAs by the introduction of a stop codon 3' and adjacent to the start codon, the steady-state mRNA levels decreased dramatically. We conclude that efficient mRNA translation is required for maintaining mRNA stability in S. cerevisiae. These findings have important implications for the study of the expression of heterologous genes in yeast cells.  相似文献   

10.
Leptospira biflexa is a representative of an evolutionarily distinct group of eubacteria. In order to better understand the genetic organization and gene regulatory mechanisms of this species, we have chosen to study the genes required for tryptophan biosynthesis in this bacterium. The nucleotide sequence of the region of the L. biflexa serovar patoc chromosome encoding the trpE and trpG genes has been determined. Four open reading frames (ORFs) were identified in this region, but only three ORFs were translated into proteins when the cloned genes were introduced into Escherichia coli. Analysis of the predicted amino acid sequences of the proteins encoded by the ORFs allowed us to identify the trpE and trpG genes of L. biflexa. Enzyme assays confirmed the identity of these two ORFs. Anthranilate synthase from L. biflexa was found to be subject to feedback inhibition by tryptophan. Codon usage analysis showed that there was a bias in L. biflexa towards the use of codons rich in A and T, as would be expected from its G + C content of 37%. Comparison of the amino acid sequences of the trpE gene product and the trpG gene product with corresponding gene products from other bacteria showed regions of highly conserved sequence.  相似文献   

11.
Previous analyses of Saccharomyces cerevisiae chromosome I have suggested that the majority (greater than 75%) of single-copy essential genes on this chromosome are difficult or impossible to identify using temperature-sensitive (Ts-) lethal mutations. To investigate whether this situation reflects intrinsic difficulties in generating temperature-sensitive proteins or constraints on mutagenesis in yeast, we subjected three cloned essential genes from chromosome I to mutagenesis in an Escherichia coli mutator strain and screened for Ts- lethal mutations in yeast using the "plasmid-shuffle" technique. We failed to obtain Ts- lethal mutations in two of the genes (FUN12 and FUN20), while the third gene yielded such mutations, but only at a low frequency. DNA sequence analysis of these mutant alleles and of the corresponding wild-type region revealed that each mutation was a single substitution not in the previously identified gene FUN19, but in the adjacent, newly identified essential gene FUN53. FUN19 itself proved to be non-essential. These results suggest that many essential proteins encoded by genes on chromosome I cannot be rendered thermolabile by single mutations. However, the results obtained with FUN53 suggest that there may also be significant constraints on mutagenesis in yeast. The 5046 base-pair interval sequenced contains the complete FUN19, FUN53 and FUN20 coding regions, as well as a portion of the adjacent non-essential FUN21 coding region. In all, 68 to 75% of this interval is open reading frame. None of the four predicted products shows significant homologies to known proteins in the available databases.  相似文献   

12.
13.
14.
Bacteriophage sk1 is a small isometric-headed lytic phage belonging to the 936 species. It infects Lactococcus lactis , a commonly used dairy starter organism. Nucleotide sequence data analysis indicated that the sk1 genome is 28 451 nucleotides long and contains 54 open reading frames (ORFs) of 30 or more codons, interspersed with three large intergenic regions. The nucleotide sequence of several of the sk1 ORFs demonstrated significant levels of identity to genes (many encoding proteins of unknown function) in other lactococcal phages of both small isometric-headed and prolate-headed morphotype. Based on this identity and predicted peptide structures, sk1 genes for the terminase, major structural protein and DNA polymerase have been putatively identified. Genes encoding holin and lysin were also identified, subcloned into an Escherichia coli expression vector, and their function demonstrated in vivo . The sk1 origin of replication was located by identifying sk1 DNA fragments able to support the maintenance in L. lactis of a plasmid lacking a functional Gram-positive ori . The minimal fragment conferring replication origin function contained a number of direct repeats and 179 codons of ORF47. Although no similarity between phage sk1 and coliphage λ at the nucleotide or amino acid sequence level was observed, an alignment of the sk1 late region ORFs with the λ structural and packaging genes revealed a striking correspondence in both ORF length and isoelectric point of the ORF product. It is proposed that this correspondence is indicative of a strong conservation in gene order within these otherwise unrelated isometric-headed phages that can be used to predict the functions of the sk1 gene products.  相似文献   

15.
The nucleotide sequence of the entire genome of a filamentous cyanobacterium, Anabaena sp. strain PCC 7120, was determined. The genome of Anabaena consisted of a single chromosome (6,413,771 bp) and six plasmids, designated pCC7120alpha (408,101 bp), pCC7120beta (186,614 bp), pCC7120gamma (101,965 bp), pCC7120delta (55,414 bp), pCC7120epsilon (40,340 bp), and pCC7120zeta (5,584 bp). The chromosome bears 5368 potential protein-encoding genes, four sets of rRNA genes, 48 tRNA genes representing 42 tRNA species, and 4 genes for small structural RNAs. The predicted products of 45% of the potential protein-encoding genes showed sequence similarity to known and predicted proteins of known function, and 27% to translated products of hypothetical genes. The remaining 28% lacked significant similarity to genes for known and predicted proteins in the public DNA databases. More than 60 genes involved in various processes of heterocyst formation and nitrogen fixation were assigned to the chromosome based on their similarity to the reported genes. One hundred and ninety-five genes coding for components of two-component signal transduction systems, nearly 2.5 times as many as those in Synechocystis sp. PCC 6803, were identified on the chromosome. Only 37% of the Anabaena genes showed significant sequence similarity to those of Synechocystis, indicating a high degree of divergence of the gene information between the two cyanobacterial strains.  相似文献   

16.
17.
Two genes, MF alpha 1 and MF alpha 2, coding for the alpha-factor in yeast Saccharomyces cerevisiae were identified by in situ colony hybridization of synthetic probes to a yeast genomic library. The probes were designed on the basis of the known amino acid sequence of the tridecapeptide alpha-pheromone. The nucleotide sequence revealed that the two genes, though similar in their overall structure, differ from each other in several striking ways. MF alpha 1 gene contains 4 copies of the coding sequence for the alpha-factor, which are separated by 24 nucleotides encoding the octapeptide Lys-Arg-Glu-Ala-Glu(or Asp)-Ala-Glu-Ala. The first alpha-factor coding block is preceded by a sequence for the hexapeptide Lys-Arg-Glu-Ala and 83 additional amino acids. MF alpha 2 gene contains coding sequences for two copies of the alpha-factor that differ from each other and from alpha-factor encoded by MF alpha 1 gene by a Gln leads to Asn and a Lys leads to Arg substitution. The first copy of the alpha-factor is preceded by a sequence coding for 87 amino acids which ends with Lys-Arg-Glu-Ala-Val-Ala-Asp-Ala. The coding blocks of the two copies of the pheromone are separated by the sequence for Lys-Arg-Glu-Ala-Asn-Ala-Asp-Ala. Thus, the alpha-factor can be derived from 2 different precursor proteins of 165 and 120 amino acids containing, respectively, 4 and 2 copies of the pheromone.  相似文献   

18.
The complete sequence of the mitochondrial DNA (mtDNA) of the true slime mold Physarun polycephalum has been determined. The mtDNA is a circular 62,862-bp molecule with an A+T content of 74.1%. A search with the program BLAST X identified the protein-coding regions. The mitochondrial genome of P. polycephalum was predicted to contain genes coding for 12 known proteins [for three cytochrome c oxidase subunits, apocytochrome b, two F1Fo-ATPase subunits, five NADH dehydrogenase (nad) subunits, and one ribosomal protein], two rRNA genes, and five tRNA genes. However, the predicted ORFs are not all in the same frame, because mitochondrial RNA in P. polycephalum undergoes RNA editing to produce functional RNAs. The nucleotide sequence of an nad7 cDNA showed that 51 nucleotides were inserted at 46 sites in the mRNA. No guide RNA-like sequences were observed in the mtDNA of P. polycephalum. Comparison with reported Physarum mtDNA sequences suggested that sites of RNA editing vary among strains. In the Physarum mtDNA, 20 ORFs of over 300 nucleotides were found and ORFs 14 19 are transcribed.  相似文献   

19.
We have screened numerous different yeast species for the presence of sequences homologous to the intron of the mitochondrial 21S rRNA gene of Saccharomyces cerevisiae (intron r1) and found them in all Kluyveromyces species, some of the Saccharomyces species and none of the other yeasts tested. We have determined the nucleotide sequence of the r1-intron in K. thermotolerans and compared it with that of S. cerevisiae. The two introns are inserted at the same position within the 21S rRNA gene. They contain homologous internal open reading frames (ORFs) initiated at the same AUG codon which can be aligned over their entire length. Several silent multi-substitutions indicate that these intronic ORFs represent selectively conserved functional genes. Other intron segments, on the contrary, reveal short blocks of extensive homology separated by non-homologous stretches and/or additions-deletions. Comparison of our two yeast r1-introns with equivalent introns of N. crassa and A. nidulans mitochondria reveals that introns with very similar RNA secondary structures can accommodate different types of ORFs.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号