首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The complete sequence of the genome of an aerobic hyper-thermophiliccrenarchaeon, Aeropyrum pernix K1, which optimally grows at95°C, has been determined by the whole genome shotgun methodwith some modifications. The entire length of the genome was1,669,695 bp. The authenticity of the entire sequence was supportedby restriction analysis of long PCR products, which were directlyamplified from the genomic DNA. As the potential protein-codingregions, a total of 2,694 open reading frames (ORFs) were assigned.By similarity search against public databases, 633 (23.5%) ofthe ORFs were related to genes with putative function and 523(19.4%) to the sequences registered but with unknown function.All the genes in the TCA cycle except for that of alpha-ketoglutaratedehydrogenase were included, and instead of the alpha-ketoglutaratedehydrogenase gene, the genes coding for the two subunits of2-oxoacid:ferredoxin oxidoreductase were identified. The remaining1,538 ORFs (57.1%) did not show any significant similarity tothe sequences in the databases. Sequence comparison among theassigned ORFs suggested that a considerable member of ORFs weregenerated by sequence duplication. The RNA genes identifiedwere a single 16S–23S rRNA operon, two 5S rRNA genes and47 tRNA genes including 14 genes with intron structures. Allthe assigned ORFs and RNA coding regions occupied 89.12% ofthe whole genome. The data presented in this paper are availableon the internet homepage (http://www.mild.nite.go.jp).  相似文献   

2.
The complete sequence of the genome of a hyper-thermophilicarchaebacterium, Pyrococcus horikoshii OT3, has been determinedby assembling the sequences of the physical map-based contigsof fosmid clones and of long polymerase chain reaction (PCR)products which were used for gap-filling. The entire lengthof the genome was 1,738,505 bp. The authenticity of the entiregenome sequence was supported by restriction analysis of longPCR products, which were directly amplified from the genomicDNA. As the potential protein-coding regions, a total of 2061open reading frames (ORFs) were assigned, and by similaritysearch against public databases, 406 (19.7%) were related togenes with putative function and 453 (22.0%) to the sequencesregistered but with unknown function. The remaining 1202 ORFs(58.3%) did not show any significant similarity to the sequencesin the databases. Sequence comparison among the assigned ORFsin the genome provided evidence that a considerable number ofORFs were generated by sequence duplication. By similarity search,11 ORFs were assumed to contain the intein elements. The RNAgenes identified were a single 16S-23S rRNA operon, two 5S rRNAgenes and 46 tRNA genes including two with the intron structure.All the assigned ORFs and RNA coding regions occupied 91.25%of the whole genome. The data presented in this paper are availableon the internet at http://www.nite.go.jp.  相似文献   

3.
The genes encoding the 5S ribosomal RNA (rRNA) for Leptonema illini strain 3055 were isolated and sequenced. The 5S RNA molecule encoded was 117 nucleotides long. The genome of strain 3055 contained two genes for 5S rRNA that were located close together. The nucleotide sequences of the Leptonema illini genes exhibited less similarity to the rRNA gene of Leptospira interrogans strain Moulton and also to those of typical eubacterial genes than did the rRNA genes of other leptospires. However, the overall secondary structure of the 5S rRNA encoded exhibited a strong similarity to that of typical eubacterial 5S rRNA. Southern hybridization of the 5S rRNA gene probe with the genomic DNA of strain 965, which is currently classified as Leptospira biflexa, showed the latter to have close similarity to that of strain 3055. The physical map of strain 965 was quite similar to that of strain 3055 and was greatly different from that of any other strains of L. biflexa. In the organization of 5S rRNA genes, strain 965 is sufficiently different from other members of the genus Leptospira to be regarded as a member of the genus Leptonema.  相似文献   

4.
The nucleotide sequence of the complete genome of a cyanobacterium,Microcystis aeruginosa NIES-843, was determined. The genomeof M. aeruginosa is a single, circular chromosome of 5 842 795base pairs (bp) in length, with an average GC content of 42.3%.The chromosome comprises 6312 putative protein-encoding genes,two sets of rRNA genes, 42 tRNA genes representing 41 tRNA species,and genes for tmRNA, the B subunit of RNase P, SRP RNA, and6Sa RNA. Forty-five percent of the putative protein-encodingsequences showed sequence similarity to genes of known function,32% were similar to hypothetical genes, and the remaining 23%had no apparent similarity to reported genes. A total of 688kb of the genome, equivalent to 11.8% of the entire genome,were composed of both insertion sequences and miniature inverted-repeattransposable elements. This is indicative of a plasticity ofthe M. aeruginosa genome, through a mechanism that involveshomologous recombination mediated by repetitive DNA elements.In addition to known gene clusters related to the synthesisof microcystin and cyanopeptolin, novel gene clusters that maybe involved in the synthesis and modification of toxic smallpolypeptides were identified. Compared with other cyanobacteria,a relatively small number of genes for two component systemsand a large number of genes for restriction-modification systemswere notable characteristics of the M. aeruginosa genome.  相似文献   

5.
The complete nucleotide sequence of the genome of a symbiotic bacterium Mesorhizobium loti strain MAFF303099 was determined. The genome of M. loti consisted of a single chromosome (7,036,071 bp) and two plasmids, designated as pMLa (351,911 bp) and pMLb (208, 315 bp). The chromosome comprises 6752 potential protein-coding genes, two sets of rRNA genes and 50 tRNA genes representing 47 tRNA species. Fifty-four percent of the potential protein genes showed sequence similarity to genes of known function, 21% to hypothetical genes, and the remaining 25% had no apparent similarity to reported genes. A 611-kb DNA segment, a highly probable candidate of a symbiotic island, was identified, and 30 genes for nitrogen fixation and 24 genes for nodulation were assigned in this region. Codon usage analysis suggested that the symbiotic island as well as the plasmids originated and were transmitted from other genetic systems. The genomes of two plasmids, pMLa and pMLb, contained 320 and 209 potential protein-coding genes, respectively, for a variety of biological functions. These include genes for the ABC-transporter system, phosphate assimilation, two-component system, DNA replication and conjugation, but only one gene for nodulation was identified.  相似文献   

6.
We have determined a 180 kb contiguous sequence in the replicationorigin region of the Bacillus subtilis chromosome. Open readingframes (ORF) in this region were unambiguously identified fromthe determined sequence, using criteria characteristic for theB. subtilis gene structure, i.e., starting with an ATG, GTGor TTG codon preceded by sequences complementary to the 3' endof the 16S rRNA. Four rRNA gene sets, 7 individual tRNA genesand 1 scRNA gene were identified, occupying 20 kb in total.In the remaining 160 kb region, 158 ORFs were identified, suggestingthat 1 ORF is coded on average by 1 kb of DNA of the B. subtilisgenome. Among the 158 ORFs, the functions of 48 ORFs were assignedand those of 11 ORFs are suggested through significant similaritiesto known proteins present in data banks. However, the functionsof more than half of the ORFs (63%) remain to be determined.  相似文献   

7.
Lancefield group C Streptococcus dysgalactiae causes infections in farmed fish. Here, the genome of S. dysgalactiae strain kdys0611, isolated from farmed amberjack (Seriola dumerili) was sequenced. The complete genome sequence of kdys0611 consists of a single chromosome and five plasmids. The chromosome is 2,142,780 bp long and has a GC content of 40%. It possesses 2061 coding sequences and 67 tRNA and 6 rRNA operons. One clustered regularly interspaced short palindromic repeat, 125 insertion sequences, and four predicted prophage elements were identified. Phylogenetic analysis based on 126 core genes suggested that the kdys0611 strain is more closely related to S. dysgalactiae subsp. dysgalactiae than to S. dysgalactiae subsp. equisimilis. The genome of kdys0611 harbors 87 genes with sequence similarity to putative virulence‐associated genes identified in other bacteria, of which 57 exhibit amino acid identity (>52%) to genes of the S. dysgalactiae subsp. equisimilis GGS124 human clinical isolate. Four putative virulence genes, emm5 (FGCSD_0256), spg_2 (FGCSD_1961), skc (FGCSD_1012), and cna (FGCSD_0159), in kdys0611 did not show significant homology with any deposited S. dysgalactiae genes. The chromosomal sequence of kdys0611 has been deposited in GenBank under Accession No. AP018726. This is the first report of the complete genome sequence of S. dysgalactiae isolated from fish.  相似文献   

8.
The complete nucleotide sequence of the genome of a symbiotic bacterium Bradyrhizobium japonicum USDA110 was determined. The genome of B. japonicum was a single circular chromosome 9,105,828 bp in length with an average GC content of 64.1%. No plasmid was detected. The chromosome comprises 8317 potential protein-coding genes, one set of rRNA genes and 50 tRNA genes. Fifty-two percent of the potential protein genes showed sequence similarity to genes of known function and 30% to hypothetical genes. The remaining 18% had no apparent similarity to reported genes. Thirty-four percent of the B. japonicum genes showed significant sequence similarity to those of both Mesorhizobium loti and Sinorhizobium meliloti, while 23% were unique to this species. A presumptive symbiosis island 681 kb in length, which includes a 410-kb symbiotic region previously reported by G?ttfert et al., was identified. Six hundred fifty-five putative protein-coding genes were assigned in this region, and the functions of 301 genes, including those related to symbiotic nitrogen fixation and DNA transmission, were deduced. A total of 167 genes for transposases/104 copies of insertion sequences were identified in the genome. It was remarkable that 100 out of 167 transposase genes are located in the presumptive symbiotic island. DNA segments of 4 to 97 kb inserted into tRNA genes were found at 14 locations in the genome, which generates partial duplication of the target tRNA genes. These observations suggest plasticity of the B. japonicum genome, which is probably due to complex genome rearrangements such as horizontal transfer and insertion of various DNA elements, and to homologous recombination.  相似文献   

9.
We have sequenced the long unique region (LUR) and characterized the terminal repeats of the genome of a rhesus rhadinovirus (RRV), strain 17577. The LUR as sequenced is 131,364 bp in length, with a G+C content of 52.2% and a CpG ratio of 1.11. The genome codes for 79 open reading frames (ORFs), with 67 of these ORFs similar to genes found in both Kaposi's sarcoma-associated herpesvirus (KSHV) (formal name, human herpesvirus 8) and herpesvirus saimiri. Eight of the 12 unique genes show similarity to genes found in KSHV, including genes for viral interleukin-6, viral macrophage inflammatory protein, and a family of viral interferon regulatory factors (vIRFs). Genomic organization is essentially colinear with KSHV, the primary differences being the number of cytokine and IRF genes and the location of the gene for dihydrofolate reductase. Highly repetitive sequences are located in positions corresponding to repetitive sequences found in KSHV. Phylogenetic analysis of several ORFs supports the similarity between RRV and KSHV. Overall, the sequence, structural, and phylogenetic data combine to provide strong evidence that RRV 17577 is the rhesus macaque homolog of KSHV.  相似文献   

10.
The genomic DNA fragment which contains ribosomal RNA (rRNA) genes for Treponema phagedenis was cloned into bacteriophage vector lambda EMBL3. A restriction map of the fragment was constructed and the organization of the rRNA genes was determined. The fragment contained at least one copy of the 16S, 23S and 5S sequences and the genes are arranged in the order 16S-23S-5S. Southern hybridization using radiolabeled rRNA gene probes to genomic DNA from T. phagedenis strain Reiter and T. pallidum strain Nichols showed that these organisms have two radioactive fragments which hybridize to the probes in their genome. These results suggest that both pathogenic and non-pathogenic strains of Treponema may carry at least two sets of rRNA genes on their chromosomes.  相似文献   

11.
Darwin's paradigm holds that the diversity of present-day organisms has arisen via a process of genetic descent with modification, as on a bifurcating tree. Evidence is accumulating that genes are sometimes transferred not along lineages but rather across lineages. To the extent that this is so, Darwin's paradigm can apply only imperfectly to genomes, potentially complicating or perhaps undermining attempts to reconstruct historical relationships among genomes (i.e., a genome tree). Whether most genes in a genome have arisen via treelike (vertical) descent or by lateral transfer across lineages can be tested if enough complete genome sequences are used. We define a phylogenetically discordant sequence (PDS) as an open reading frame (ORF) that exhibits patterns of similarity relationships statistically distinguishable from those of most other ORFs in the same genome. PDSs represent between 6.0 and 16.8% (mean, 10.8%) of the analyzable ORFs in the genomes of 28 bacteria, eight archaea, and one eukaryote (Saccharomyces cerevisiae). In this study we developed and assessed a distance-based approach, based on mean pairwise sequence similarity, for generating genome trees. Exclusion of PDSs improved bootstrap support for basal nodes but altered few topological features, indicating that there is little systematic bias among PDSs. Many but not all features of the genome tree from which PDSs were excluded are consistent with the 16S rRNA tree.  相似文献   

12.
Comparative 16S rRNA gene sequence and genomic DNA reassociation analyses were used to assess the phylogenetic relationships of Methanobrevibacter fecal isolates. The 16S rRNA gene sequences of Methanobrevibacter smithii strain PS and the human fecal isolates B181 and ALI were essentially identical, and their genomic DNA reassociated at values greater than 94%. The analysis of 16S rRNA sequences of the horse, pig, cow, rat, and goose fecal isolates confirm that they are members of the genus Methanobrevibacter. They had a high degree of sequence similarity (97–98%) with the 16S rRNA gene of M. smithii, indicating that they share a common line of descent. The 16S rRNA genes of the horse and pig isolates had 99.3% sequence similarity. Sequence analysis of the 16S rRNA gene of the sheep fecal isolate showed that it formed a separate line of descent in the genus Methanobrevibacter. Genomic DNA reassociation studies indicate that the horse, pig, cow, and goose fecal isolates represent at least three new species. The horse and pig isolates were the only animal isolates that had > 70% genomic DNA reassociation and represent strains of a single species. The cow, goose, and sheep isolates had little or no genomic DNA reassociation with M. smithii or with each other. The relationship of the rat isolate to the other animal isolates was not determined. An evaluation of the relationship of 16S rRNA gene sequence similarity and genomic DNA reassociation of Methanobrevibacter and other methanogenic archaea indicated that genomic DNA reassociation studies are necessary to establish that two methanogenic organisms belong to the same species. Received: 17 November 1997 / Accepted: 16 January 1998  相似文献   

13.
To explore the mitochondrial genes of the Cruciferae family, the mitochondrial genome of Raphanus sativus (sat) was sequenced and annotated. The circular mitochondrial genome of sat is 239,723 bp and includes 33 protein-coding genes, three rRNA genes and 17 tRNA genes. The mitochondrial genome also contains a pair of large repeat sequences 5.9 kb in length, which may mediate genome reorga-nization into two sub-genomic circles, with predicted sizes of 124.8 kb and 115.0 kb, respectively. Furthermore, gene evolution of mitochondrial genomes within the Cruciferae family was analyzed using sat mitochondrial type (mitotype), together with six other re-ported mitotypes. The cruciferous mitochondrial genomes have maintained almost the same set of functional genes. Compared with Cycas taitungensis (a representative gymnosperm), the mitochondrial genomes of the Cruciferae have lost nine protein-coding genes and seven mitochondrial-like tRNA genes, but acquired six chloroplast-like tRNAs. Among the Cruciferae, to maintain the same set of genes that are necessary for mitochondrial function, the exons of the genes have changed at the lowest rates, as indicated by the numbers of single nucleotide polymorphisms. The open reading frames (ORFs) of unknown function in the cruciferous genomes are not conserved. Evolutionary events, such as mutations, genome reorganizations and sequence insertions or deletions (indels), have resulted in the non- conserved ORFs in the cruciferous mitochondrial genomes, which is becoming significantly different among mitotypes. This work represents the first phylogenic explanation of the evolution of genes of known function in the Cruciferae family. It revealed significant variation in ORFs and the causes of such variation.  相似文献   

14.
15.
Wang XC  Sun XY  Sun QQ  Zhang DX  Hu J  Yang Q  Hao JS 《动物学研究》2011,32(5):465-475
该研究对斐豹蛱蝶(Argyreus hyperbius)(鳞翅目:蛱蝶科)线粒体基因组全序列进行了测定和初步分析。结果表明:斐豹蛱蝶线粒体基因全序列全长为15156bp,包含13个蛋白质编码基因、22个tRNA和2个rRNA基因以及1个非编码的A+T富集区,基因排列顺序与其它鳞翅目种类一致;线粒体全序列核苷酸组成和密码子使用显示出明显的A+T偏好(80.8%)和轻微的AT偏移(AT skew,?0.019)。基因组中共存在11个2~52bp不等的基因间隔区,总长96bp;以及14个1~8bp不等的基因重叠区,总长34bp。除COI以CGA作为起始密码子外,13个蛋白质编码基因中的其余12个基因是以ATN作为起始密码子。除COI和COII基因是以单独的一个T为终止密码子,其余11个蛋白质编码基因都是以TAA结尾的。除了缺少DHU臂的tRNASer(AGN),其余的tRNA基因都显示典型的三叶草结构。tRNA(AGN)和ND1之间的基因间隔区包含一个ATACTAA结构域,这个结构域在鳞翅目中是保守的。A+T富集区没有较大的多拷贝重复序列,但是包含一些微小重复结构:ATAGA结构域下游的20bp poly-T结构,ATTTA结构域后的(AT)9重复,以及位于tRNAMet上游的5bp poly-A结构等。这项研究所揭示的斐豹蛱蝶的线粒体基因组特征,不仅为认识蛱蝶科的遗传多样性贡献数据,而且对于该物种的保护生物学、群体遗传学、谱系地理及演化研究等具有重要意义。  相似文献   

16.
猪Ⅱ型圆环病毒豫A株的全基因组克隆与序列分析   总被引:13,自引:0,他引:13  
参照国外发表的猪Ⅱ型圆环病毒(porcine circovirus type 2,PCV-2)全基因组序列,设计一对PCV-2特异性引物,用该室分离的PCV-2豫A株感染PK-15细胞,从中提取PCV-2复制型基因组DNA,并以之为模板进行PCR扩增.回收PCR产物,构建重组测序质粒T-PCV-2.测序结果表明,猪Ⅱ型圆环病毒豫A株的全基因组为1767bp,与GenBank收录的PCV-2国外分离株核苷酸的同源性可高达97%.序列分析表明,复制型豫A株的基因组包含10个读码框架,其中ORF1、ORF2是其两个最主要的读码框架,分别编码314、234个氨基酸.豫A株和PCV-1间的ORF1、ORF2的氨基酸序列同源性分别为85%、66%,与其它PCV-2毒株间的ORF1氨基酸同源性均在98%以上,而ORF2的氨基酸同源性为92%~97%.  相似文献   

17.
Phenotypically, Photobacterium damselae subsp. piscicida and P. damselae subsp. damselae are easily distinguished. However, their 16S rRNA gene sequences are identical, and attempts to discriminate these two subspecies by molecular tools are hampered by their high level of DNA-DNA similarity. The 16S-23S rRNA internal transcribed spacers (ITS) were sequenced in two strains of Photobacterium damselae subsp. piscicida and two strains of P. damselae subsp. damselae to determine the level of molecular diversity in this DNA region. A total of 17 different ITS variants, ranging from 803 to 296 bp were found, some of which were subspecies or strain specific. The largest ITS contained four tRNA genes (tDNAs) coding for tRNA(Glu(UUC)), tRNA(Lys(UUU)), tRNA(Val(UAC)), and tRNA(Ala(GGC)). Five amplicons contained tRNA(Glu(UUC)) combined with two additional tRNA genes, including tRNA(Lys(UUU)), tRNA(Val(UAC)), or tRNA(Ala(UGC)). Five amplicons contained tRNA(Ile(GAU)) and tRNA(Ala(UGC)). Two amplicons contained tRNA(Glu(UUC)) and tRNA(Ala(UGC)). Two different isoacceptor tRNA(Ala) genes (GGC and UGC anticodons) were found. The five smallest amplicons contained no tRNA genes. The tRNA-gene combinations tRNA(Glu(UUC))-tRNA(Val(UAC))-tRNA(Ala(UGC)) and tRNA(Glu(UUC))-tRNA(Ala(UGC)) have not been previously reported in bacterial ITS regions. The number of copies of the ribosomal operon (rrn) in the P. damselae chromosome ranged from at least 9 to 12. For ITS variants coexisting in two strains of different subspecies or in strains of the same subspecies, nucleotide substitution percentages ranged from 0 to 2%. The main source of variation between ITS variants was due to different combinations of DNA sequence blocks, constituting a mosaic-like structure.  相似文献   

18.
New genetic data with biotechnological potential (citrate metabolism, proteases, bacteriocin production) provides the genome sequence of the lactic acid producing bacterium of Enterococcus faecium strain 8S3, isolated from traditional Slovak cheese - bryndza produced from unpasteurised ewe milk. The genome sequence consists of 2.8 Mbp, with a mean G?+?C content of 38.2% and show high similarity to other E. faecium genome sequences. A total of 2.833 coding sequences, including 62 structural RNAs (3 rRNA and 59 tRNA) were predicted. Comparative genomic data indicate that prophages and bacteriophage remnants are the main source of diversity among E. faecium genomes.  相似文献   

19.
【背景】枯草芽孢杆菌N2-10是一株具有较强抑菌能力且能产纤维素酶等多种水解酶的革兰氏阳性菌,在发酵饲料中具有较大的应用潜力。【目的】通过获得枯草芽孢杆菌N2-10的全基因组序列信息,进一步解析菌株次级代谢产物合成基因信息,并通过比较基因组学分析菌株N2-10与模式菌株的差异性,为阐明N2-10抑菌和益生机制提供理论基础。【方法】通过二代Illumina NovaSeq联合三代PacBio Sequel测序平台,对菌株N2-10进行全基因组测序,将测序数据进行基因组组装、基因预测与功能注释,并利用比较基因组学分析N2-10与其他菌株的差异。【结果】菌株N2-10基因组大小为4 036 899 bp,GC含量为43.88%;共编码4 163个编码基因,所有编码基因总长度为3594369bp,编码区总长度占基因组总长度的89.04%;含有85个tRNA、10个5S rRNA、10个16S rRNA、10个23S rRNA,以及2个CRISPR-Cas、1个前噬菌体和6个基因岛;在GO (gene ontolog)、COG (clusters of orthologous groups of...  相似文献   

20.
Harvest Mouse (Micromys minutus) has a very wide range of distribution in Asia and Europe. However, the phylogenetic relationship of M. minutus is still uncertain. In this study, we determined the complete mitochondrial (mt) genome sequences of M. minutus, and used the complete mitochondrial genome sequences constructed the phylogenetic tree of Muroidea. The size of the genome is 16,232 bp in length and has a base composition of 33.6% A, 29.1% T, 24.8% C, and 12.5% G. The mitogenome structure was similar to that of typical vertebrate and other rodents' mitochondrial genomes, includes 13 protein-coding genes, 2 rRNA genes (12S rRNA and 16S rRNA), 22 tRNA genes, and 1 control region. We suggested a new initiation codon for ND5 (NADH dehydrogenase subunit), which has been never reported in the mitochondrial genome of vertebrate. The ML and BI phylogenetic trees, which based on the combination of the 12 protein-coding genes, supported strongly that the genus Micromys was represent an early offshoot within the Muridae with high support values (BI = 1.00, ML = 100).  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号