首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The complete nucleotide sequence of the genome of a symbiotic bacterium Bradyrhizobium japonicum USDA110 was determined. The genome of B. japonicum was a single circular chromosome 9,105,828 bp in length with an average GC content of 64.1%. No plasmid was detected. The chromosome comprises 8317 potential protein-coding genes, one set of rRNA genes and 50 tRNA genes. Fifty-two percent of the potential protein genes showed sequence similarity to genes of known function and 30% to hypothetical genes. The remaining 18% had no apparent similarity to reported genes. Thirty-four percent of the B. japonicum genes showed significant sequence similarity to those of both Mesorhizobium loti and Sinorhizobium meliloti, while 23% were unique to this species. A presumptive symbiosis island 681 kb in length, which includes a 410-kb symbiotic region previously reported by G?ttfert et al., was identified. Six hundred fifty-five putative protein-coding genes were assigned in this region, and the functions of 301 genes, including those related to symbiotic nitrogen fixation and DNA transmission, were deduced. A total of 167 genes for transposases/104 copies of insertion sequences were identified in the genome. It was remarkable that 100 out of 167 transposase genes are located in the presumptive symbiotic island. DNA segments of 4 to 97 kb inserted into tRNA genes were found at 14 locations in the genome, which generates partial duplication of the target tRNA genes. These observations suggest plasticity of the B. japonicum genome, which is probably due to complex genome rearrangements such as horizontal transfer and insertion of various DNA elements, and to homologous recombination.  相似文献   

2.
For effective exploitation of the genome sequence information of Lotus microsymbiont, Mesorhizobium loti MAFF303099, to discover gene functions, we have constructed an ordered and mutually overlapping cosmid library using an IncP broad host-range vector. The library consisted of 480 clones to cover approximately 99.6% of the genome with average insert size and overlap of 26.9 and 11.1 kbp, respectively. The genome of M. loti consists of a single chromosome and two plasmids. The chromosome (7,036,071 bp) was covered 99.68% by 445 clones with four gaps, although two clones were unstable in E. coli. The larger plasmid pMLa (351,911 bp) was completely covered by 23 clones, while the smaller pMLb (208,315 bp) was covered 98.85% by 12 clones with two gaps. We have also made ancillary plasmids to facilitate the construction of deletion mutants using derivatives of the library clones. As a pilot experiment to uncover regions which contain novel symbiotic genes, 13 deletion mutants were constructed to lack in total 180.5 kbp of the genome. All the mutants formed apparently normal nodules and supported symbiotic nitrogen fixation, however, one mutant that lacked a 5.3 kbp chromosomal region, 4,551,930-4,557,222, did not produce normal exopolysaccharides as judged by fluorescence on medium containing Calcofluor. The results supported the effectiveness of the approach to detect gene functions.  相似文献   

3.
The nucleotide sequence of the entire genome of a filamentous cyanobacterium, Anabaena sp. strain PCC 7120, was determined. The genome of Anabaena consisted of a single chromosome (6,413,771 bp) and six plasmids, designated pCC7120alpha (408,101 bp), pCC7120beta (186,614 bp), pCC7120gamma (101,965 bp), pCC7120delta (55,414 bp), pCC7120epsilon (40,340 bp), and pCC7120zeta (5,584 bp). The chromosome bears 5368 potential protein-encoding genes, four sets of rRNA genes, 48 tRNA genes representing 42 tRNA species, and 4 genes for small structural RNAs. The predicted products of 45% of the potential protein-encoding genes showed sequence similarity to known and predicted proteins of known function, and 27% to translated products of hypothetical genes. The remaining 28% lacked significant similarity to genes for known and predicted proteins in the public DNA databases. More than 60 genes involved in various processes of heterocyst formation and nitrogen fixation were assigned to the chromosome based on their similarity to the reported genes. One hundred and ninety-five genes coding for components of two-component signal transduction systems, nearly 2.5 times as many as those in Synechocystis sp. PCC 6803, were identified on the chromosome. Only 37% of the Anabaena genes showed significant sequence similarity to those of Synechocystis, indicating a high degree of divergence of the gene information between the two cyanobacterial strains.  相似文献   

4.
The Mesorhizobium loti strain R7A symbiosis island is a 502-kb chromosomally integrated element which transfers to nonsymbiotic mesorhizobia in the environment, converting them to Lotus symbionts. It integrates into a phenylalanine tRNA gene in a process mediated by a P4-type integrase encoded at the left end of the element. We have determined the nucleotide sequence of the island and compared its deduced genetic complement with that reported for the 611-kb putative symbiosis island of M. loti strain MAFF303099. The two islands share 248 kb of DNA, with multiple deletions and insertions of up to 168 kb interrupting highly conserved colinear DNA regions in the two strains. The shared DNA regions contain all the genes likely to be required for Nod factor synthesis, nitrogen fixation, and island transfer. Transfer genes include a trb operon and a cluster of potential tra genes which are also present on the strain MAFF303099 plasmid pMLb. The island lacks plasmid replication genes, suggesting that it is a site-specific conjugative transposon. The R7A island encodes a type IV secretion system with strong similarity to the vir pilus from Agrobacterium tumefaciens that is deleted from MAFF303099, which in turn encodes a type III secretion system not found on the R7A island. The 414 genes on the R7A island also include putative regulatory genes, transport genes, and an array of metabolic genes. Most of the unique hypothetical genes on the R7A island are strain-specific and clustered, suggesting that they may represent other acquired genetic elements rather than symbiotically relevant DNA.  相似文献   

5.
Complete structure of the chloroplast genome of Arabidopsis thaliana.   总被引:7,自引:0,他引:7  
The complete nucleotide sequence of the chloroplast genome of Arabidopsis thaliana has been determined. The genome as a circular DNA composed of 154,478 bp containing a pair of inverted repeats of 26,264 bp, which are separated by small and large single copy regions of 17,780 bp and 84,170 bp, respectively. A total of 87 potential protein-coding genes including 8 genes duplicated in the inverted repeat regions, 4 ribosomal RNA genes and 37 tRNA genes (30 gene species) representing 20 amino acid species were assigned to the genome on the basis of similarity to the chloroplast genes previously reported for other species. The translated amino acid sequences from respective potential protein-coding genes showed 63.9% to 100% sequence similarity to those of the corresponding genes in the chloroplast genome of Nicotiana tabacum, indicating the occurrence of significant diversity in the chloroplast genes between two dicot plants. The sequence data and gene information are available on the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/arabi/.  相似文献   

6.
To deduce the entire sequence of the top arm of the Arabidopsis thaliana chromosome 3, the sequence determination was performed on a total of 90 P1, TAC and BAC clones chosen according to our sequencing strategy. Sequence features of the resulting 4,251,695 bp regions were analyzed with various computer programs for similarity search and gene modeling. As a result, a total of 941 potential protein-coding genes were identified. The average density of the genes identified was 1 gene per 4210 bp. Introns were observed in 73% of the genes, and the average number per gene and the average length of the introns were 3.6 and 159 bp, respectively. These sequence features are essentially identical to those of chromosomes 3 and 5 in our previous reports. The regions also contained 14 tRNA genes when searched by similarity to reported tRNA genes and the tRNA scan-SE program. The sequence data and information on the potential genes are available through the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/kaos/.  相似文献   

7.
The entire genome of the unicellular cyanobacterium Synechococcus elongatus PCC 6301 (formerly Anacystis nidulans Berkeley strain 6301) was sequenced. The genome consisted of a circular chromosome 2,696,255 bp long. A total of 2,525 potential protein-coding genes, two sets of rRNA genes, 45 tRNA genes representing 42 tRNA species, and several genes for small stable RNAs were assigned to the chromosome by similarity searches and computer predictions. The translated products of 56% of the potential protein-coding genes showed sequence similarities to experimentally identified and predicted proteins of known function, and the products of 35% of the genes showed sequence similarities to the translated products of hypothetical genes. The remaining 9% of genes lacked significant similarities to genes for predicted proteins in the public DNA databases. Some 139 genes coding for photosynthesis-related components were identified. Thirty-seven genes for two-component signal transduction systems were also identified. This is the smallest number of such genes identified in cyanobacteria, except for marine cyanobacteria, suggesting that only simple signal transduction systems are found in this strain. The gene arrangement and nucleotide sequence of Synechococcus elongatus PCC 6301 were nearly identical to those of a closely related strain Synechococcus elongatus PCC 7942, except for the presence of a 188.6 kb inversion. The sequences as well as the gene information shown in this paper are available in the Web database, CYORF (http://www.cyano.genome.jp/).  相似文献   

8.
Based on the physical map of Arabidopsis thaliana chromosome 3 previously constructed with CIC YAC, TAC, P1 and BAC clones (Sato, S. et al., DNA Res., 5, 163-168, 1998), a total of 60 P1 and TAC clones were sequenced, and the sequence features of the resulting 4,504,864 bp regions were analyzed by applying various computer programs for similarity search and gene modeling. As a result, a total of 1054 potential protein-coding genes were identified. The average density of the genes identified was 1 gene per 4066 bp. Introns were observed in 77% of the genes, and the average number per gene and the average length of the introns were 3.9 and 156 bp, respectively. These sequence features are essentially identical to those of chromosome 5 in our previous reports, but the gene density was slightly higher than that observed for chromosomes 2 and 4. The regions also contained 10 tRNA genes when searched by similarity to reported tRNA genes and the tRNA scan-SE program. The sequence data and information on the potential genes are available through the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/kaos/.  相似文献   

9.
The complete mitochondrial genome sequence of the cockscomb pearl mussel Cristaria plicata, which is an endangered species in South Korea, was sequenced. The circle genome (15,708 bp in size) consists of 13 protein-coding genes, 2 rRNA genes, and 22 tRNA genes. There were 26 noncoding regions (NCs) found throughout the mitogenome of C. plicata, ranging in size from 2 to 327 bp, and the two largest NC regions, NC286 and NC326, were found between ND5 and tRNA(Gln) (286 bp) and between tRNA(Glu) and ND2 (326 bp), respectively. The 13 mitochondrial protein-coding genes of a female individual of C. plicata collected from Korea (15,708 bp) were compared to those of the Chinese individual (15,712 bp) published before. The result showed that ND3 is the most conserved with 100% nucleotide similarity, and each of the other protein-coding genes has ca. 99%, respectively. The two largest NCs among 26 NCs have totally 98% nucleotide similarity between Korean and Chinese ones.  相似文献   

10.
11.
Leishmania parasites (order Kinetoplastida, family Trypanosomatidae) cause a spectrum of human diseases ranging from asymptomatic to lethal. The ~33.6 Mb genome is distributed among 36 chromosome pairs that range in size from ~0.3 to 2.8 Mb. The complete nucleotide sequence of Leishmania major Friedlin chromosome 1 revealed 79 protein-coding genes organized into two divergent polycistronic gene clusters with the mRNAs transcribed towards the telomeres. We report here the complete nucleotide sequence of chromosome 3 (384 518 bp) and an analysis revealing 95 putative protein-coding ORFs. The ORFs are primarily organized into two large convergent polycistronic gene clusters (i.e. transcribed from the telomeres). In addition, a single gene at the left end is transcribed divergently towards the telomere, and a tRNA gene separates the two convergent gene clusters. Numerous genes have been identified, including those for metabolic enzymes, kinases, transporters, ribosomal proteins, spliceosome components, helicases, an RNA-binding protein and a DNA primase subunit.  相似文献   

12.
In this series of projects sequencing the entire genome of Arabidopsis thaliana chromosome 5, non-redundant P1 and TAC clones have been sequenced according to the fine physical map, and as of May 7, 1999, the sequences of 16.2 Mb representing approximately 60% of chromosome 5 have been accumulated and released at our web site. In parallel, structural features of the sequenced regions have been analyzed by applying a variety of computer programs, and to date we have predicted a total of 2380 potential protein-coding genes in the 10,154,580 bp regions, which are covered by 142 P1 and TAC clones. In this paper, we newly analyzed the structural features of the 1,011,550 bp regions covered by additional 17 P1 and TAC clones, and predicted 298 protein-coding genes. The average density of the genes identified was 1 gene per 3394 bp. Introns were observed in 67% of the genes, and the average number per gene and the average length of the introns were 3.2 and 159 bp, respectively. The gene density became higher than the value estimated in the previously analyzed regions (1 gene per 4,267 bp), as the data in this paper were compiled based on a new standard of gene assignment including the computer-predicted hypothetical genes. The regions also contained 8 tRNA genes when searched by similarity to reported tRNA genes and the tRNA scan-SE program. The sequence data and information on the potential genes are available on the database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/arabi/.  相似文献   

13.
We determined the nucleotide sequence of the entire genome of a diazotrophic endophyte, Azospirillum sp. B510. Strain B510 is an endophytic bacterium isolated from stems of rice plants (Oryza sativa cv. Nipponbare). The genome of B510 consisted of a single chromosome (3 311 395 bp) and six plasmids, designated as pAB510a (1 455 109 bp), pAB510b (723 779 bp), pAB510c (681 723 bp), pAB510d (628 837 bp), pAB510e (537 299 bp), and pAB510f (261 596 bp). The chromosome bears 2893 potential protein-encoding genes, two sets of rRNA gene clusters (rrns), and 45 tRNA genes representing 37 tRNA species. The genomes of the six plasmids contained a total of 3416 protein-encoding genes, seven sets of rrns, and 34 tRNAs representing 19 tRNA species. Eight genes for plasmid-specific tRNA species are located on either pAB510a or pAB510d. Two out of eight genomic islands are inserted in the plasmids, pAB510b and pAB510e, and one of the islands is inserted into trnfM-CAU in the rrn located on pAB510e. Genes other than the nif gene cluster that are involved in N2 fixation and are homologues of Bradyrhizobium japonicum USDA110 include fixABCX, fixNOQP, fixHIS, fixG, and fixLJK. Three putative plant hormone-related genes encoding tryptophan 2-monooxytenase (iaaM) and indole-3-acetaldehyde hydrolase (iaaH), which are involved in IAA biosynthesis, and ACC deaminase (acdS), which reduces ethylene levels, were identified. Multiple gene-clusters for tripartite ATP-independent periplasmic-transport systems and a diverse set of malic enzymes were identified, suggesting that B510 utilizes C4-dicarboxylate during its symbiotic relationship with the host plant.  相似文献   

14.
15.
The complete nucleotide sequence of the cucumber (C. sativus L. var. Borszczagowski) chloroplast genome has been determined. The genome is composed of 155,293 bp containing a pair of inverted repeats of 25,191 bp, which are separated by two single-copy regions, a small 18,222-bp one and a large 86,688-bp one. The chloroplast genome of cucumber contains 130 known genes, including 89 protein-coding genes, 8 ribosomal RNA genes (4 rRNA species), and 37 tRNA genes (30 tRNA species), with 18 of them located in the inverted repeat region. Of these genes, 16 contain one intron, and two genes and one ycf contain 2 introns. Twenty-one small inversions that form stem-loop structures, ranging from 18 to 49 bp, have been identified. Eight of them show similarity to those of other species, while eight seem to be cucumber specific. Detailed comparisons of ycf2 and ycf15, and the overall structure to other chloroplast genomes were performed.  相似文献   

16.
The nucleotide sequence of the complete genome of a cyanobacterium,Microcystis aeruginosa NIES-843, was determined. The genomeof M. aeruginosa is a single, circular chromosome of 5 842 795base pairs (bp) in length, with an average GC content of 42.3%.The chromosome comprises 6312 putative protein-encoding genes,two sets of rRNA genes, 42 tRNA genes representing 41 tRNA species,and genes for tmRNA, the B subunit of RNase P, SRP RNA, and6Sa RNA. Forty-five percent of the putative protein-encodingsequences showed sequence similarity to genes of known function,32% were similar to hypothetical genes, and the remaining 23%had no apparent similarity to reported genes. A total of 688kb of the genome, equivalent to 11.8% of the entire genome,were composed of both insertion sequences and miniature inverted-repeattransposable elements. This is indicative of a plasticity ofthe M. aeruginosa genome, through a mechanism that involveshomologous recombination mediated by repetitive DNA elements.In addition to known gene clusters related to the synthesisof microcystin and cyanopeptolin, novel gene clusters that maybe involved in the synthesis and modification of toxic smallpolypeptides were identified. Compared with other cyanobacteria,a relatively small number of genes for two component systemsand a large number of genes for restriction-modification systemswere notable characteristics of the M. aeruginosa genome.  相似文献   

17.
The tRNA-coding locus of the 8.2-kilobase pair (kbp) Eco RI fragments Eco G of Euglena gracilis Klebs, strain Z Pringsheim chloroplast DNA was chosen for detailed analysis. Two recombinant plasmids, pPG14, containing Eco G and the vector pMB9, and pEZC23, containing the chloroplast DNA fragment HindIII B cloned in pBR322 were employed for the study. The tRNA locus was mapped to an 0.8-kbp region of Eco G also present in HindIII B. The DNA sequence of a 1.6 kbp from HindIII B, containing the entire tRNA gene locus was determined. Four tRNA genes were identified from the DNA sequence. The gene organization is tRNAVal-16 bp spacer-tRNAAsn-3 bp spacer-tRNAArg-45 bp spacer-tRNALeu. The tRNALeu gene is of the opposite polarity as the other three genes. This is the first evidence of such a tRNA cluster for a chloroplast genome. Also evident from the DNA sequence, 132 bp from the 5'-end of the tRNALeu gene, is a putative gene or pseudogene for a chloroplast protein.  相似文献   

18.
19.
W D Reiter  P Palm    S Yeats 《Nucleic acids research》1989,17(5):1907-1914
The DNA sequences were determined at the boundaries of the integrated copy of the archaebacterial genetic element SSV1. A 44 bp sequence present as a single copy on the 15.5 kb circular SSV1 DNA flanked the integrated copy as a direct DNA sequence repeat, suggesting that SSV1 integration occurred by recombination between this 44 bp SSV1 sequence and an identical sequence on the bacterial chromosome. At the left attachment site, a region encompassing the 44 bp attachment core sequence and the 31 nucleotides upstream of it displayed all characteristics expected for an arginine tRNA gene. An analysis of published attachment site sequences of other systems revealed that tRNA genes also constitute the bacterial attachment site in the case of three temperate phages and two transmissible plasmids in eubacteria, indicating a widespread occurrence of tRNA genes as integration target sites. This finding may be important for the understanding of mechanisms and evolution of site-specific recombination.  相似文献   

20.
Bifidobacterium longum strains predominate in the colonic microbiota of breast-fed infants. Here we report the complete genome sequence of B. longum subsp. longum KACC 91563, isolated from feces of neonates. A single circular chromosome of 2,385,301 bp contains 1,980 protein-coding genes, 56 tRNA genes, and 3 rRNA operons.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号