共查询到20条相似文献,搜索用时 109 毫秒
1.
Zhaohui Xiong Xudong Tang Fan Yang Xiaobing Zhang Jian Yang Lihong Chen Huan Nie Yongliang Yan Yan Jiang Jing Wang Ying Xue Xingye Xu Yafang Zhu Jie Dong Lizhe An Xunling Wang Qi Jin 《中国科学:生命科学英文版》2006,49(2):141-148
We determined and analyzed the Shigella flexneri serotype 5 (pSF5) and S. dysenteriae serotype 1 (pSD1) virulence plasmid genomes. The total length of pSF5 is 136513 bp, including 165 open reading frames (ORFs).
Of these ORFs, 133 were identified and 32 of those had no significant homology to proteins with known functions. The length
of pSD1 is 182545 bp, including 224 ORFs, of which we identified 181. The remaining 43 ORFs were not significantly homologous
to proteins with known functions. The insertion sequence (IS) elements are 53787 bp in pSF5, and 49616 bp in pSD1, which represents
39.4% and 27.1% of the genome, respectively. There are 22 IS element types in pSF5 and pSD1, among which we report ISEc8 and ISSbo6 for the first time in the Shigella virulence plasmid. Compared to pCP301, there are a large number of deleted genes and gene inversions in both pSF5 and pSD1.
The ipa-mxi-spa locus in pSF5 is completely absent, and the genes related to the O-antigen biosynthesis are partially missing.
In contrast, the above genes in pSD1 are integral, with the exception of virF. The whole genome analysis of the two plasmids shows that the loss of genes related to gene invasion or regulation also obliterates
the ability of pPF5 and pSD1 to bind Congo red (Crb). Whether these genes determine the Crb function requires continued investigation.
These authors contributed equally to this work. 相似文献
2.
The sequence of plasmid pXF51 from the plant pathogen Xylella fastidiosa, the causal agent of citrus variegated chlorosis, has been analyzed. This plasmid codes for 65 open reading frames (ORFs), organized into four main regions, containing genes related to replication, mobilization, and conjugative transfer. Twenty-five ORFs have no counterparts in the public sequence databases, and 7 are similar to conserved hypothetical proteins from other bacteria. A pXF51 incompatibility group has not been determined, as we could not find a typical replication origin. One cluster of conjugation-related genes (trb) seems to be incomplete in pXF51, and a copy of this sequence is found in the chromosome, suggesting it was generated by a duplication event. A second cluster (tra) contains all genes necessary for conjugation transfer to occur, showing a conserved organization with other conjugative plasmids. An identifiable origin of transfer similar to oriT from IncP plasmids is found adjacent to genes encoding two mobilization proteins. None of the ORFs with putative assigned function could be predicted as having a role in pathogenesis, except for a virulence-associated protein D homolog. These results indicate that even though pXF51 appears not to have a direct role in Xylella pathogenesis, it is a conjugative plasmid that could be important for lateral gene transfer in this bacterium. This property may be of great importance for future development of transformation techniques in X. fastidiosa. 相似文献
3.
Arnoud J. Kal Ewald H. Hettema Marlene van den Berg Marian Groot Koerkamp Lodewijk van Ijlst Ben Distel Henk F. Tabak 《Cell biochemistry and biophysics》2000,32(1-3):1-8
The biogenesis of peroxisomes involves the synthesis of new proteins that after, completion of translation, are targeted to
the organelle by virtue of peroxisomal targeting signals (PTS). Two types of PTSs have been well characterized for import
of matrix proteins (PTS1 and PTS2). Induction of the genes encoding these matrix proteins takes place in oleate-containing
medium and is mediated via an oleate response element (ORE) present in the region preceding these genes. The authors have
searched the yeast genome for OREs preceding open reading frames (ORFs), and for ORFs that contain either a PTS1 or PTS2.
Of the ORFs containing an ORE, as well as either a PTS1 or a PTS2, many were known to encode bona fide peroxisomal matrix proteins. In addition, candidate genes were identified as encoding putative new peroxisomal proteins.
For one case, subcellular location studies validated the in silicio prediction. This gene encodes a new peroxisomal thioesterase. 相似文献
4.
A cryptic plasmid of the wall-less plant pathogenic mollicute, Spiroplasma kunkelii CR2-3X, was cloned and its sequence analyzed. The 14,615 bp plasmid, designated pSKU146, has a nucleotide content of 28 mol% G + C, and contains 18 potential protein-coding regions (open reading frames, ORFs), of which six encode proteins that exhibit similarity to virulence-associated proteins involved in cell-to-cell adhesion or conjugal DNA transfer. One ORF encodes a 96 kDa protein, SkARP1, that is highly similar to SARP1 adhesin involved in attachment of Spiroplasma citri to insect vector gut membrane. Five ORFs encode proteins similar to TraE and Mob in walled bacteria, and to ORFs found in the integrative, conjugative element (ICEF) of Mycoplasma fermentans, respectively. Presence of domains similar to proteins of the Type IV secretion system in pathogenic bacteria suggests that spiroplasma possesses a related translocation system. Plasmid pSKU146 also contains two identical oriT regions each containing a nick sequence characteristic of the IncP conjugative plasmid family, as well as a 58 bp palindromic sequence, palSK1. Features in pSKU146 suggest that the plasmid functions as a mobile genetic element in conjugative transmission of spiroplasma pathogenicity-related genes. 相似文献
5.
6.
Kira S Makarova Alexander V Sorokin Pavel S Novichkov Yuri I Wolf Eugene V Koonin 《Biology direct》2007,2(1):33-20
Background
An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs). Rapid accumulation of genome sequences creates opportunities for refining COGs but also represents a challenge because of error amplification. One of the practical strategies involves construction of refined COGs for phylogenetically compact subsets of genomes. 相似文献7.
Seishirō Aoki Akiyoshi Kawaoka Masami Sekine Takanari Ichikawa Tomomichi Fujita Atsuhiko Shinmyō Kunihiko Syōno 《Molecular & general genetics : MGG》1994,243(6):706-710
A region homologous to the TL-DNA of Agrobacterium rhizogenes was previously detected in the genome of untransformed Nicotiana glauca and designated cellular T-DNA (cT-DNA). Subsequently, part of this region was sequenced and two genes, which corresponded to rolB and rolC and were named NgrolB and NgrolC, were found. We have now sequenced a region of the cT-DNA other than the region that includes NgrolB and C and we have found two other open reading frames (ORFs), NgORF13 and NgORF14. These ORFs correspond to ORFs 13 and 14 of the TL-DNA of A. rhizogenes and exhibit a high degree of homology to these ORFs, without having a nonsense codon. We have not found any sequence homologous to rolD (ORF15). The two genes, NgORF13 and 14, as well as the NgrolB and C genes, are expressed in genetic tumors of hybrids between N. glauca and N. langsdorffii but not in leaf tissues of the hybrid. 相似文献
8.
The nucleotide sequence of a 27830-bp DNA segment in the 79°81°.region of the Bacillus subtilis genome has been determined.This region contains 29 complete ORFs including the sspE gene,which encodes a small acid-soluble spore protein gamma and locateson the one side terminal of our assigned region. A homologysearch for the products deduced from the 29 ORFs revealed thatnine of them exhibit significant similarity to known proteins,e.g. proteins involved in an iron uptake system, a multidrugresistance protein, a chloramphenicol resistance protein, epoxidehydrolase, adenine glycosylase, and a glucose-1-dehydrogenasehomolog. 相似文献
9.
Takami H Nakasone K Ogasawara N Hirama C Nakamura Y Masui N Fuji F Takaki Y Inoue A Horikoshi K 《Extremophiles : life under extreme conditions》1999,3(1):29-34
The nucleotide sequences of three independent fragments (designated no. 3, 4, and 9; each 15–20 kb in size) of the genome
of alkaliphilic Bacillus sp. C-125 cloned in a λ phage vector have been determined. Thirteen putative open reading frames (ORFs) were identified in
sequenced fragment no. 3 and 11 ORFs were identified in no. 4. Twenty ORFs were also identified in fragment no. 9. All putative
ORFs were analyzed in comparison with the BSORF database and non-redundant protein databases. The functions of 5 ORFs in fragment
no. 3 and 3 ORFs in fragment no. 4 were suggested by their significant similarities to known proteins in the database. Among
the 20 ORFs in fragment no. 9, the functions of 11 ORFs were similarly suggested. Most of the annotated ORFs in the DNA fragments
of the genome of alkaliphilic Bacillus sp. C-125 were conserved in the Bacillus subtilis genome. The organization of ORFs in the genome of strain C-125 was found to differ from the order of genes in the chromosome
of B. subtilis, although some gene clusters (ydh, yqi, yer, and yts) were conserved as operon units the same as in B. subtilis.
Received: April 17, 1998 / Accepted: June 23, 1998 相似文献
10.
Stuknyte M Guglielmetti S Mora D Kuisiene N Parini C Citavicius D 《Extremophiles : life under extreme conditions》2008,12(3):415-429
The complete nucleotide sequence (62.8 kb) of pGS18, the largest sequenced plasmid to date from the species Geobacillus stearothermophilus, was determined. Computational analysis of sequence data revealed 65 putative open reading frames (ORFs); 38 were carried
on one strand and 27 were carried on the other. These ORFs comprised 84.1% of the pGS18 sequence. Twenty-five ORFs (38.4%)
were assigned to putative functions; four ORFs (6.2%) were annotated as pseudogenes. The amino acid sequences obtained from
29 ORFs (44.6%) had the highest similarity to hypothetical proteins of the other microorganisms, and seven (10.8%) had no
significant similarity to any genes present in the current open databases. Plasmid replication region, strongly resembling
that of the theta-type replicon, and genes encoding three different plasmid maintenance systems were identified, and a putative
discontinuous transfer region was localized. In addition, we also found several mobile genetic elements and genes, responsible
for DNA repair, distributed along the whole sequence of pGS18. The alignment of pGS18 with two other large indigenous plasmids
of the genus Geobacillus highlighted the presence of well-conserved segments and has provided a framework that can be exploited to formulate hypotheses
concerning the molecular evolution of these three plasmids. 相似文献
11.
Yamashiro K Yokobori S Oshima T Yamagishi A 《Extremophiles : life under extreme conditions》2006,10(4):327-335
Thermoplasma acidophilum is a thermoacidophilic archaeon that grows optimally at pH1.8 and 56°C and has no cell wall. Plasmid pTA1 was found in some strains of the species. We sequenced plasmid pTA1 and analyzed the open reading frames (ORFs). pTA1 was found to be a circular DNA molecule of 15,723 bp. Eighteen ORFs were found; none of the gene products except ORF1 had sequence similarity to known proteins. ORF1 showed similarity to Cdc6, which is involved in genome-replication initiation in Eukarya and Archaea. T. acidophilum has two Cdc6 homologues in the genome. The homologue found in pTA1 is most similar to Tvo3, one of the three Cdc6 homologues found in the genome of Thermoplasma volcanium, among all of the Cdc6 family proteins. The phylogenetic analysis suggested that plasmid pTA1 is possibly originated from the chromosomal DNA of Thermoplasma. 相似文献
12.
Meng-Ze Du Feng-Biao Guo Yue-Yun Chen 《Journal of biomolecular structure & dynamics》2013,31(2):391-401
Abstract In this paper, we re-annotated the genome of Pyrobaculum aerophilum str. IM2, particularly for hypothetical ORFs. The annotation process includes three parts. Firstly and most importantly, 23 new genes, which were missed in the original annotation, are found by combining similarity search and the ab initio gene finding approaches. Among these new genes, five have significant similarities with function-known genes and the rest have significant similarities with hypothetical ORFs contained in other genomes. Secondly, the coding potentials of the 1645 hypothetical ORFs are re-predicted by using 33 Z curve variables combined with Fisher linear discrimination method. With the accuracy being 99.68%, 25 originally annotated hypothetical ORFs are recognized as non-coding by our method. Thirdly, 80 hypothetical ORFs are assigned with potential functions by using similarity search with BLAST program. Re-annotation of the genome will benefit related researches on this hyperthermophilic crenarchaeon. Also, the re-annotation procedure could be taken as a reference for other archaeal genomes. Details of the revised annotation are freely available at http://cobi.uestc.edu.cn/resource/paero/ 相似文献
13.
Satoh M Kubo T Nishizawa S Estiati A Itchoda N Mikami T 《Molecular genetics and genomics : MGG》2004,272(3):247-256
The complete nucleotide sequence (501,020 bp) of the mitochondrial genome from cytoplasmic male-sterile (CMS) sugar beet was determined. This enabled us to compare the sequence with that previously published for the mitochondrial genome of normal, male-fertile sugar beet. The comparison revealed that the two genomes have the same complement of genes of known function. The rRNA and tRNA genes encoded in the CMS mitochondrial genome share 100% sequence identity with their respective counterparts in the normal genome. We found a total of 24 single nucleotide substitutions in 11 protein genes encoded by the CMS mitochondrial genome. However, none of these seems to be responsible for male sterility. In addition, several other ORFs were found to be actively transcribed in sugar beet mitochondria. Among these, Norf246 was observed to be present in the normal mitochondrial genome but absent from the CMS genome. However, it seems unlikely that the loss of Norf246 is causally related to the expression of CMS, because previous studies on mitochondrial translation products failed to detect the product of this ORF. Conversely, the CMS genome contains four transcribed ORFs (Satp6presequence, Scox2-2 , Sorf324 and Sorf119) which are missing from the normal genome. These ORFs, which are potential candidates for CMS genes, were shown to be generated by mitochondrial genome rearrangements.Electronic Supplementary Material Supplementary material is available in the online version of this article at Communicated by R. Hagemann 相似文献
14.
A new plasmid designated pAsa6 from an Aeromonas salmonicida subsp. salmonicida strain isolated from diseased turbot has been characterized. pAsa6 consists of 18536 bp, has a G+C content of 53.8% and encodes 20 predicted open-reading frames (ORFs). Eight ORFs showed homology to transposases, of which six are complete and two are partial IS sequences. Two ORFs showed homology to replication proteins, and six ORFs showed homology to hypothetical proteins. Two ORFs are truncated homologs of putative A. salmonicida sulfatases. Two genes, aopH and sycH encode homologs of an effector protein for which a role in fish colonization by A. salmonicida has been previously reported, and its chaperone, respectively. The results of filter conjugation experiments suggested that pAsa6 is not mobilizable, as it failed to be conjugally-transferred to several species of marine bacteria tested. All the ORFs of pAsa6 with the exception of four copies of a IS1 transposase gene, have a counterpart in the recently sequenced 155-kb A. salmonicida plasmid pAsa5, suggesting either that pAsa6 is a derivative of pAsa5, or that pAsa5 is the result of the fusion of a pAsa6-like plasmid and a larger plasmid of ca. 135-kb. The pAsa6-encoded repA and aopH genes could be PCR-amplified from strains lacking pAsa6, suggesting presence of a large, possibly pAsa5-like plasmid that was not detected on agarose gels, or the existence of chromosome-integrated plasmid sequences. This study demonstrates that genomic locations for the aopH gene different to pAsa5 or pAsa5-like plasmids exist in A. salmonicida. 相似文献
15.
Buchnera aphidicola is a prokaryotic endosymbiont of the aphid Schizaphis graminum. From past and present nucleotide sequence analyses of the B. aphidicola genome, we have assembled a 34.7-kilobase (kb) DNA segment. This segment contains genes coding for 32 open reading frames
(ORFs), which corresponded to 89.9% of the DNA. All of these ORFs could be identified with homologous regions of the Escherichia coli genome. The order of the genes with established functions was groELS–trmE–rnpA–rpmH–dnaA–dnaN–gyrB–atpCDGAHFEB–gidA–fdx–hscA– hscB–nifS–ilvDC–rep–trxA–rho. The order of genes in small DNA fragments was conserved in both B. aphidicola and E. coli. Most of these fragments were in approximately the same region of the E. coli genome. The latter organism, however, contained many additional inserted genes within and between the fragments. The results
of the B. aphidicola genome analyses indicate that the endosymbiont has many properties of free-living bacteria.
Received: 15 August 1997 / Accepted: 29 August 1997 相似文献
16.
Comparative proteome analysis reveals pathogen specific outer membrane proteins of Leptospira
下载免费PDF全文
![点击此处可从《Proteins》网站下载免费的PDF全文](/ch/ext_images/free.gif)
Aarti Rana Rahul Brahma Yusuf Akhter Madathiparambil Gopalakrishnan Madanan 《Proteins》2018,86(7):712-722
Proteomes of pathogenic Leptospira interrogans and L. borgpetersenii and the saprophytic L. biflexa were filtered through computational tools to identify Outer Membrane Proteins (OMPs) that satisfy the required biophysical parameters for their presence on the outer membrane. A total of 133, 130, and 144 OMPs were identified in L. interrogans, L. borgpetersenii, and L. biflexa, respectively, which forms approximately 4% of proteomes. A holistic analysis of transporting and pathogenic characteristics of OMPs together with Clusters of Orthologous Groups (COGs) among the OMPs and their distribution across 3 species was made and put forward a set of 21 candidate OMPs specific to pathogenic leptospires. It is also found that proteins homologous to the candidate OMPs were also present in other pathogenic species of leptospires. Six OMPs from L. interrogans and 2 from L. borgpetersenii observed to have similar COGs while those were not found in any intermediate or saprophytic forms. These OMPs appears to have role in infection and pathogenesis and useful for anti‐leptospiral strategies. 相似文献
17.
Feng-Biao Guo 《Journal of biomolecular structure & dynamics》2013,31(2):127-133
Abstract The distribution patterns of bases of DNA fragments in different regions in P. aeruginosa genome are analyzed in this paper. It's shown that 5565 protein-coding genes, 17315 non- coding ORFs, and 1104 intergenic sequences are located into seven clusters based on their base frequencies. Almost all the protein-coding genes are contained in one of the seven clusters. The significant difference of base frequencies among three codon positions in high GC genome, which arouse the division between the distribution patterns of bases of six reading frames of protein-coding genes, is responsible for the appearance of the clustering phenomenon. In the light of the clustering phenomenon, the author supposes that the anitisense strand ORFs, particularly those corresponding to Frame 2′ and Frame 3′, may not code for proteins in P. aeruginosa genome. 相似文献
18.
Borriss M Lombardot T Glöckner FO Becher D Albrecht D Schweder T 《Extremophiles : life under extreme conditions》2007,11(1):95-104
Virion DNA of bacteriophage 11b (Φ11b), which infects a psychrophilic Flavobacterium isolate from Arctic sea-ice, was determined to consist of 36,012 bp. With 30.6% its GC content corresponds to that of host-genus
species and is the lowest of all phages of Gram-negative bacteria sequenced so far. Similarities of several of 65 predicted
ORFs, genome organization and phylogeny suggest an affiliation to ‘mesophilic’ nonmarine siphoviruses, e.g. to bacteriophages
SPP1 and HK97. Early genes presumably encode an essential recombination factor (ERF), a single strand binding (SSB) protein,
an endonuclease, and a DNA methylase. The late gene segment is likely to contain a terminase, portal, minor head, protease
and a major capsid gene. Five ORFs exhibited similarities to Bacteroidetes species and seem to reflect the host specificity of the phage. Among PAGE-separated virion proteins that were identified
by MALDI-ToF mass spectrometry are the portal, the major capsid, and a putative conserved tail protein. The Φ11b genome is
the first to be described of a cultivated virus infecting a psychrophilic host as well as a Bacteroidetes bacterium.
Electronic Supplementary Material Supplementary material is available to authorised users in the online version of this article at . 相似文献
19.
Gene recognition based on nucleotide distribution of ORFs in a hyper-thermophilic crenarchaeon, Aeropyrum pernix K1. 总被引:1,自引:0,他引:1
The 2694 ORFs originally annotated as potential genes in the genome of Aeropyrum pernix can be categorized into three clusters (A, B, C), according to their nucleotide composition at three codon positions. Coding potential was found to be responsible for the phenomenon of three clusters in a 9-dimensional space derived from the nucleotide composition of ORFs: ORFs assigned to cluster A are coding ones, while those assigned to clusters B and C are non-coding ORFs. A "codingness" index called the AZ score is defined based on a clustering method used to recognize protein-coding genes in the A. pernix genome. The criterion for a coding or non-coding ORF is based on the AZ score. ORFs with AZ > 0 or AZ < 0 are coding or non-coding, respectively. Consequently, 620 out of 632 ORFs with putative functions based on the original annotation are contained in cluster A, which have positive AZ scores. In addition, all 29 ORFs encoding putative or conserved proteins newly added in RefSeq annotation also have positive AZ scores. Accordingly, the number of re-recognized protein-coding genes in the A. pernix genome is 1610, which is significantly less than 2694 in the original annotation and also much less than 1841 in the RefSeq annotation curated by NCBI staff. Annotation information of re-recognized genes and their AZ scores are available at: http://tubic.tju.edu.cn/Aper/. 相似文献
20.
Natale DA Shankavaram UT Galperin MY Wolf YI Aravind L Koonin EV 《Genome biology》2000,1(5):research0009.1-research000919