首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Sphingomonas sp. strain ATCC 31555 can produce an anionic heteropolysaccharide, welan gum, which shows excellent stability and viscosity retention even at high temperatures. Here we present a 4.0-Mb assembly of its genome sequence. We have annotated 10 coding sequences (CDSs) responsible for the welan gum biosynthesis and 55 CDSs related to monosaccharide metabolism.  相似文献   

2.
3.

Background

The Coccolithoviridae is a recently discovered family of viruses that infect the marine coccolithophorid Emiliania huxleyi. Following on from the sequencing of the type strain EhV-86, we have sequenced a second strain, EhV-163.

Results

We have sequenced approximately 80% of the EhV-163 genome, equating to more than 200 full length CDSs. Conserved and variable CDSs and a gene replacement have been identified in the EhV-86 and EhV-163 genomes.

Conclusion

The sequencing of EhV-163 has provided a wealth of information which will aid the re-annotating of the EhV-86 genome and identified a gene insertion in EhV-163.  相似文献   

4.
Diseases caused by phytopathogenic microorganisms account for enormous losses for agribusiness. Although Bacillus species are recognized as being antimicrobial producers and some may provide benefits to plants, the association between Bacillus toyonensis and plants has not been studied. In this study, the whole-genome sequenced endophytic B. toyonensis BAC3151, which has demonstrated antimicrobial activity and quorum sensing inhibition of phytopathogenic bacteria, was investigated for its potential for the production of compounds for biocontrol of plant pathogens. Four whole-genome sequenced B. toyonensis strains shared 3811 protein-coding DNA sequences (CDSs), while strain-specific CDSs, such as biosynthetic gene clusters of antimicrobials, were associated with specific chromosomal regions and mobile genetic elements of the strains. B. toyonensis strains had a higher frequency of putative bacteriocins gene clusters than that of Bacillus species traditionally used for the production of antimicrobials. In addition, gene clusters potentially involved in the production of novel bacteriocins were found in BAC3151, as well as biosynthetic genes of several other compounds, including non-ribosomal peptides, N-acyl homoserine lactonase and chitinases, revealing a genetic repertoire for antimicrobial synthesis greater than that of other Bacillus strains that have demonstrated effective activity against phytopathogens. This study showed for the first time that B. toyonensis has potential to produce various antimicrobials, and the analyses performed indicated that the endophytic strain BAC3151 can be useful for the development of new strategies to control microbial diseases in plants that are responsible for large damages in agricultural crops.  相似文献   

5.
The gene-finding programs developed so far have not paid muchattention to the detection of short protein coding regions (CDSs).However, the detection of short CDSs is important for the studyof photosynthesis. We utilized GeneHacker, a gene-finding programbased on the hidden Markov model (HMM), to detect short CDSs(from 90 to 300 bases) in a 1.0 mega contiguous sequence ofcyanobacterium Synechocystis sp. strain PCC6803 which carriesa complete set of genes for oxygenic photosynthesis. GeneHackerdiffers from other gene-finding programs based on the HMM inthat it utilizes di-codon statistics as well. GeneHacker successfullydetected seven out of the eight short CDSs annotated in thissequence and was clearly superior to GeneMark in this rangeof length. GeneHacker detected 94 potentially new CDSs, 9 ofwhich have counterparts in the genetic databases. Four of thenine CDSs were less than 150 bases and were photosynthesis-relatedgenes. The results show the effectiveness of GeneHacker in detectingvery short CDSs corresponding to genes.  相似文献   

6.
Mycoplasma pulmonis is a wall-less eubacterium belonging to the Mollicutes (trivial name, mycoplasmas) and responsible for murine respiratory diseases. The genome of strain UAB CTIP is composed of a single circular 963 879 bp chromosome with a G + C content of 26.6 mol%, i.e. the lowest reported among bacteria, Ureaplasma urealyticum apart. This genome contains 782 putative coding sequences (CDSs) covering 91.4% of its length and a function could be assigned to 486 CDSs whilst 92 matched the gene sequences of hypothetical proteins, leaving 204 CDSs without significant database match. The genome contains a single set of rRNA genes and only 29 tRNAs genes. The replication origin oriC was localized by sequence analysis and by using the G + C skew method. Sequence polymorphisms within stretches of repeated nucleotides generate phase-variable protein antigens whilst a recombinase gene is likely to catalyse the site-specific DNA inversions in major M.pulmonis surface antigens. Furthermore, a hemolysin, secreted nucleases and a glyco-protease are predicted virulence factors. Surprisingly, several of the genes previously reported to be essential for a self-replicating minimal cell are missing in the M.pulmonis genome although this one is larger than the other mycoplasma genomes fully sequenced until now.  相似文献   

7.
The complete genomic sequence of an intracellular bacterial pathogen, Mycoplasma penetrans HF-2 strain, was determined. The HF-2 genome consists of a 1 358 633 bp single circular chromosome containing 1038 predicted coding sequences (CDSs), one set of rRNA genes and 30 tRNA genes. Among the 1038 CDSs, 264 predicted proteins are common to the Mycoplasmataceae sequenced thus far and 463 are M.penetrans specific. The genome contains the two-component system but lacks the essential cellular gene, uridine kinase. The relatively large genome of M.penetrans HF-2 among mycoplasma species may be accounted for by both its rich core proteome and the presence of a number of paralog families corresponding to 25.4% of all CDSs. The largest paralog family is the p35 family, which encodes surface lipoproteins including the major antigen, P35. A total of 44 genes for p35 and p35 homologs were identified and 30 of them form one large cluster in the chromosome. The genetic tree of p35 paralogs suggests the occurrence of dynamic chromosomal rearrangement in paralog formation during evolution. Thus, M.penetrans HF-2 may have acquired diverse repertoires of antigenic variation-related genes to allow its persistent infection in humans.  相似文献   

8.
The gram-negative anaerobic bacterium Porphyromonas gingivalis is a major causative agent of chronic periodontitis. Porphyromonas gingivalis strains have been classified into virulent and less-virulent strains by mouse subcutaneous soft tissue abscess model analysis. Here, we present the whole genome sequence of P. gingivalis ATCC 33277, which is classified as a less-virulent strain. We identified 2090 protein-coding sequences (CDSs), 4 RNA operons, and 53 tRNA genes in the ATCC 33277 genome. By genomic comparison with the virulent strain W83, we identified 461 ATCC 33277-specific and 415 W83-specific CDSs. Extensive genomic rearrangements were observed between the two strains: 175 regions in which genomic rearrangements have occurred were identified. Thirty-five of those genomic rearrangements were inversion or translocation and 140 were simple insertion, deletion, or replacement. Both strains contained large numbers of mobile elements, such as insertion sequences, miniature inverted-repeat transposable elements (MITEs), and conjugative transposons, which are frequently associated with genomic rearrangements. These findings indicate that the mobile genetic elements have been deeply involved in the extensive genome rearrangement of P. gingivalis and the occurrence of many of the strain-specific CDSs. We also describe here a very unique feature of MITE400, which we renamed MITEPgRS (MITE of P. gingivalis with Repeating Sequences).Key words: Porphyromonas gingivalis, whole genome sequence, genome rearrangement, conjugative transposon, MITE  相似文献   

9.
F Tao  X Wang  C Ma  C Yang  H Tang  Z Gai  P Xu 《Journal of bacteriology》2012,194(17):4755-4756
Xanthomonas campestris JX, a soil bacterium, is an industrially productive strain for xanthan gum. Here we present a 5.0-Mb assembly of its genome sequence. We have annotated 12 coding sequences (CDSs) responsible for xanthan gum biosynthesis, 346 CDSs encoding carbohydrate metabolism, and 69 CDSs related to virulence, defense, and plant disease.  相似文献   

10.

Background

Complete genome annotation is a necessary tool as Anopheles gambiae researchers probe the biology of this potent malaria vector.

Results

We reannotate the A. gambiae genome by synthesizing comparative and ab initio sets of predicted coding sequences (CDSs) into a single set using an exon-gene-union algorithm followed by an open-reading-frame-selection algorithm. The reannotation predicts 20,970 CDSs supported by at least two lines of evidence, and it lowers the proportion of CDSs lacking start and/or stop codons to only approximately 4%. The reannotated CDS set includes a set of 4,681 novel CDSs not represented in the Ensembl annotation but with EST support, and another set of 4,031 Ensembl-supported genes that undergo major structural and, therefore, probably functional changes in the reannotated set. The quality and accuracy of the reannotation was assessed by comparison with end sequences from 20,249 full-length cDNA clones, and evaluation of mass spectrometry peptide hit rates from an A. gambiae shotgun proteomic dataset confirms that the reannotated CDSs offer a high quality protein database for proteomics. We provide a functional proteomics annotation, ReAnoXcel, obtained by analysis of the new CDSs through the AnoXcel pipeline, which allows functional comparisons of the CDS sets within the same bioinformatic platform. CDS data are available for download.

Conclusion

Comprehensive A. gambiae genome reannotation is achieved through a combination of comparative and ab initio gene prediction algorithms.  相似文献   

11.
Erwinia amylovora causes the economically important disease fire blight that affects rosaceous plants, especially pear and apple. Here we report the complete genome sequence and annotation of strain ATCC 49946. The analysis of the sequence and its comparison with sequenced genomes of closely related enterobacteria revealed signs of pathoadaptation to rosaceous hosts.Erwinia amylovora, a plant-associated member of the Enterobacteriaceae, causes fire blight, a devastating disease of rosaceous plants, especially pear and apple (6). The complete genome of Ea273 (ATCC 49946), a virulent strain isolated from an infected apple tree in New York State, was sequenced. Total DNA was extracted and prepared in pMAQ1 shotgun libraries. The complete shotgun sequence was obtained by using dye terminator chemistry in ABI 3730 automated sequencers and contains 88,457 reads (11.12-fold coverage), yielding a theoretical coverage of the genome of 99.99%. The sequence was assembled, finished, and annotated as described previously (1, 5), using Artemis (4) to collate data and facilitate annotation.The genome of E. amylovora consists of a circular chromosome of 3,805,874 bp and two plasmids, AMYP1 (28,243 bp) and AMYP2 (71,487 bp). Coding regions in the chromosome account for 85.1% of the total sequence, with 3,483 identified coding sequences (CDS). Two hundred fifty-four (7%) of the CDSs do not have any matches in current NCBI databases; 114 (3.3%) correspond to conserved hypothetical proteins. Forty-nine CDSs (1.4%) are similar to genes from mobile elements such as integrases, transposases, and bacteriophages, and 110 CDSs (3.2%) were classified as pseudogenes due to interruptions or truncations of the CDSs. The remaining 2,956 annotated CDSs include among other categories genes involved in biosynthesis of the cellular envelope and modifications of surface proteins (299 CDSs [11%]) and genes involved in signal transduction and regulation (228 CDSs [8%]). Seven rRNA operons and 78 tRNA sequences were identified in the chromosome; two new clusters were identified (AMY1550-1575 and AMY2648-2676) that resemble the T3SS-encoding SSR-1 island of Sodalis glossinidius (2), and four clusters that contain genes for biosynthesis of flagella, which based on their location might be regulated independently.The smaller plasmid, AMYP1, had been reported as pEA29 (3); its sequence is nearly identical to the one reported here. The larger plasmid, AMYP2, renamed pEA72 for consistency in nomenclature, contains 87 predicted CDSs, with two predicted mobile-element-related CDSs and one pseudogene. Among the CDSs with annotated functions are a cluster of genes (AMYP2_49 to AMYP2_62) that encode a putative type IV fimbrial system (pil genes).The genome of E. amylovora is only 3.8 Mb long, whereas most free-living enterobacteria, including plant pathogens, have genomes of 4.5 Mb to 5.5 Mb. Comparison of the genome of Ea273 with the sequenced genomes of 15 closely related enterobacteria identified 21 lineage-specific regions, which might be considered genomic islands. E. amylovora has many more predicted pseudogenes, relative to other enterobacteria with similar lifestyles. Given its size and the preponderance of pseudogenes, genome reduction may have occurred via mutational inactivation and subsequent deletion with the following consequences: E. amylovora has fewer genes involved in anaerobic respiration and fermentation than are found in typical related enterobacteria; this likely result in a reduced capacity to live in anaerobic environments.The genome sequence of E. amylovora has revealed clear signs of pathoadaptation to the rosaceous plant environment. For example, T3SS-related proteins are present that are more similar to proteins of other plant pathogens than to proteins of closely related enterobacteria. These include type III effectors, homologous to those of plant-pathogenic pseudomonads, which confer virulence to E. amylovora in plants, and a sorbitol-metabolizing cluster that may confer a competitive advantage for survival in rosaceous plants. The reduced genome size and erosion or loss of genes involved in anaerobic respiration and nitrate assimilation are remarkable, relative to other plant- and animal-pathogenic members of the Enterobacteriaceae.  相似文献   

12.

Background

Xanthomonas campestris pathovar campestris (Xcc) is the causal agent of black rot disease of crucifers worldwide. The molecular genetic diversity and host specificity of Xcc are poorly understood.

Results

We constructed a microarray based on the complete genome sequence of Xcc strain 8004 and investigated the genetic diversity and host specificity of Xcc by array-based comparative genome hybridization analyses of 18 virulent strains. The results demonstrate that a genetic core comprising 3,405 of the 4,186 coding sequences (CDSs) spotted on the array are conserved and a flexible gene pool with 730 CDSs is absent/highly divergent (AHD). The results also revealed that 258 of the 304 proved/presumed pathogenicity genes are conserved and 46 are AHD. The conserved pathogenicity genes include mainly the genes involved in type I, II and III secretion systems, the quorum sensing system, extracellular enzymes and polysaccharide production, as well as many other proved pathogenicity genes, while the AHD CDSs contain the genes encoding type IV secretion system (T4SS) and type III-effectors. A Xcc T4SS-deletion mutant displayed the same virulence as wild type. Furthermore, three avirulence genes (avrXccC, avrXccE1 and avrBs1) were identified. avrXccC and avrXccE1 conferred avirulence on the hosts mustard cultivar Guangtou and Chinese cabbage cultivar Zhongbai-83, respectively, and avrBs1 conferred hypersensitive response on the nonhost pepper ECW10R.

Conclusion

About 80% of the Xcc CDSs, including 258 proved/presumed pathogenicity genes, is conserved in different strains. Xcc T4SS is not involved in pathogenicity. An efficient strategy to identify avr genes determining host specificity from the AHD genes was developed.  相似文献   

13.
Tao F  Tang H  Gai Z  Su F  Wang X  He X  Xu P 《Journal of bacteriology》2011,193(24):7011-7012
Pseudomonas putida Idaho is an organic-solvent-tolerant strain which can degrade and adapt to high concentrations of organic solvents. Here, we announce its first draft genome sequence (6,363,067 bp). We annotated 192 coding sequences (CDSs) responsible for aromatic compound metabolism, 40 CDSs encoding phospholipid synthesis, and 212 CDSs related to stress response.  相似文献   

14.

Background

Laribacter hongkongensis is associated with community-acquired gastroenteritis and traveler's diarrhea. In this study, we performed an in-depth annotation of the genes in its genome related to the various steps in the infective process, drug resistance and mobile genetic elements.

Results

For acid and bile resistance, L. hongkongensis possessed a urease gene cassette, two arc gene clusters and bile salt efflux systems. For intestinal colonization, it possessed a putative adhesin of the autotransporter family homologous to those of diffusely adherent Escherichia coli (E. coli) and enterotoxigenic E. coli. To evade from host defense, it possessed superoxide dismutase and catalases. For lipopolysaccharide biosynthesis, it possessed the same set of genes that encode enzymes for synthesizing lipid A, two Kdo units and heptose units as E. coli, but different genes for its symmetrical acylation pattern, and nine genes for polysaccharide side chains biosynthesis. It contained a number of CDSs that encode putative cell surface acting (RTX toxin and hemolysins) and intracellular cytotoxins (patatin-like proteins) and enzymes for invasion (outer membrane phospholipase A). It contained a broad variety of antibiotic resistance-related genes, including genes related to β-lactam (n = 10) and multidrug efflux (n = 54). It also contained eight prophages, 17 other phage-related CDSs and 26 CDSs for transposases.

Conclusions

The L. hongkongensis genome possessed genes for acid and bile resistance, intestinal mucosa colonization, evasion of host defense and cytotoxicity and invasion. A broad variety of antibiotic resistance or multidrug resistance genes, a high number of prophages, other phage-related CDSs and CDSs for transposases, were also identified.  相似文献   

15.
We present an annotation pipeline that accurately predicts exon–intron structures and protein-coding sequences (CDSs) on the basis of full-length cDNAs (FLcDNAs). This annotation pipeline was used to identify genes in 10 plant genomes. In particular, we show that interspecies mapping of FLcDNAs to genomes is of great value in fully utilizing FLcDNA resources whose availability is limited to several species. Because low sequence conservation at 5′- and 3′-ends of FLcDNAs between different species tends to result in truncated CDSs, we developed an improved algorithm to identify complete CDSs by the extension of both ends of truncated CDSs. Interspecies mapping of 71 801 monocot FLcDNAs to the Oryza sativa genome led to the detection of 22 142 protein-coding regions. Moreover, in comparing two mapping programs and three ab initio prediction programs, we found that our pipeline was more capable of identifying complete CDSs. As demonstrated by monocot interspecies mapping, in which nucleotide identity between FLcDNAs and the genome was ∼80%, the resultant inferred CDSs were sufficiently accurate. Finally, we applied both inter- and intraspecies mapping to 10 monocot and dicot genomes and identified genes in 210 551 loci. Interspecies mapping of FLcDNAs is expected to effectively predict genes and CDSs in newly sequenced genomes.  相似文献   

16.
Burkholderia sp. strain SJ98 has the chemotactic activity towards nitroaromatic and chloronitroaromatic compounds. Recently our group published draft genome of strain SJ98. In this study, we further sequence and annotate the genome of stain SJ98 to exploit the potential of this bacterium. We specifically annotate its chemotaxis genes and methyl accepting chemotaxis proteins. Genome of Burkholderia sp. SJ98 was annotated using PGAAP pipeline that predicts 7,268 CDSs, 52 tRNAs and 3 rRNAs. Our analysis based on phylogenetic and comparative genomics suggest that Burkholderia sp. YI23 is closest neighbor of the strain SJ98. The genes involved in the chemotaxis of strain SJ98 were compared with genes of closely related Burkholderia strains (i.e. YI23, CCGE 1001, CCGE 1002, CCGE 1003) and with well characterized bacterium E. coli K12. It was found that strain SJ98 has 37 che genes including 19 methyl accepting chemotaxis proteins that involved in sensing of different attractants. Chemotaxis genes have been found in a cluster along with the flagellar motor proteins. We also developed a web resource that provides comprehensive information on strain SJ98 that includes all analysis data (http://crdd.osdd.net/raghava/genomesrs/burkholderia/).  相似文献   

17.
Comparative genomics have revealed massive horizontal gene transfer (HGT) between Mycoplasma species sharing common ruminant hosts. Further results pointed toward an integrative conjugative element (ICE) as an important contributor of HGT in the small-ruminant-pathogen Mycoplasma agalactiae. To estimate the prevalence of ICEs in ruminant mycoplasmas, we surveyed their occurrence in a collection of 166 field strains representing 4 (sub)species that are recognized as major pathogens. Based on available sequenced genomes, we first defined the conserved, minimal ICE backbone as composed of 4 coding sequences (CDSs) that are evenly distributed and predicted to be essential for ICE chromosomal integration-excision and horizontal transfer. Screening of the strain collection revealed that these 4 CDSs are well represented in ruminant Mycoplasma species, suggesting widespread occurrence of ICEs. Yet their prevalence varies within and among species, with no correlation found with the individual strain history. Extrachromosomal ICE forms were also often detected, suggesting that ICEs are able to circularize in all species, a first and essential step in ICE horizontal transfer. Examination of the junction of the circular forms and comparative sequence analysis of conserved CDSs clearly pointed toward two types of ICE, the hominis and spiroplasma types, most likely differing in their mechanism of excision-integration. Overall, our data indicate the occurrence and maintenance of functional ICEs in a large number of field isolates of ruminant mycoplasmas. These may contribute to genome plasticity and gene exchanges and, presumably, to the emergence of diverse genotypes within pathogenic mycoplasmas of veterinary importance.  相似文献   

18.
19.
Presented here is the complete genome sequence of Thiomicrospira crunogena XCL-2, representative of ubiquitous chemolithoautotrophic sulfur-oxidizing bacteria isolated from deep-sea hydrothermal vents. This gammaproteobacterium has a single chromosome (2,427,734 base pairs), and its genome illustrates many of the adaptations that have enabled it to thrive at vents globally. It has 14 methyl-accepting chemotaxis protein genes, including four that may assist in positioning it in the redoxcline. A relative abundance of coding sequences (CDSs) encoding regulatory proteins likely control the expression of genes encoding carboxysomes, multiple dissolved inorganic nitrogen and phosphate transporters, as well as a phosphonate operon, which provide this species with a variety of options for acquiring these substrates from the environment. Thiom. crunogena XCL-2 is unusual among obligate sulfur-oxidizing bacteria in relying on the Sox system for the oxidation of reduced sulfur compounds. The genome has characteristics consistent with an obligately chemolithoautotrophic lifestyle, including few transporters predicted to have organic allocrits, and Calvin-Benson-Bassham cycle CDSs scattered throughout the genome.  相似文献   

20.
To investigate the feasibility of conducting a genomic-scale protein labeling and localization study in Escherichia coli, a representative subset of 23 coding DNA sequences (CDSs) was selected for chromosomal tagging with one or more fluorescent protein genes (EGFP, EYFP, mRFP1, DsRed2). We used λ-Red recombination to precisely and efficiently position PCR-generated DNA targeting cassettes containing a fluorescent protein gene and an antibiotic resistance marker, at the C-termini of the CDSs of interest, creating in-frame fusions under the control of their native promoters. We incorporated cre/loxP and flpe/frt technology to enable multiple rounds of chromosomal tagging events to be performed sequentially with minimal disruption to the target locus, thus allowing sets of proteins to be co-localized within the cell. The visualization of labeled proteins in live E. coli cells using fluorescence microscopy revealed a striking variety of distributions including: membrane and nucleoid association, polar foci and diffuse cytoplasmic localization. Fifty of the fifty-two independent targeting experiments performed were successful, and 21 of the 23 selected CDSs could be fluorescently visualized. Our results show that E. coli has an organized and dynamic proteome, and demonstrate that this approach is applicable for tagging and (co-) localizing CDSs on a genome-wide scale.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号