首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
PlasmoDB (http://PlasmoDB.org) is the official database of the Plasmodium falciparum genome sequencing consortium. This resource incorporates finished and draft genome sequence data and annotation emerging from Plasmodium sequencing projects. PlasmoDB currently houses information from five parasite species and provides tools for cross-species comparisons. Sequence information is also integrated with other genomic-scale data emerging from the Plasmodium research community, including gene expression analysis from EST, SAGE and microarray projects. The relational schemas used to build PlasmoDB [Genomics Unified Schema (GUS) and RNA Abundance Database (RAD)] employ a highly structured format to accommodate the diverse data types generated by sequence and expression projects. A variety of tools allow researchers to formulate complex, biologically based queries of the database. A version of the database is also available on CD-ROM (Plasmodium GenePlot), facilitating access to the data in situations where Internet access is difficult (e.g. by malaria researchers working in the field). The goal of PlasmoDB is to enhance utilization of the vast quantities of data emerging from genome-scale projects by the global malaria research community.  相似文献   

2.
PlasmoDB (http://PlasmoDB.org) is the official database of the Plasmodium falciparum genome sequencing consortium. This resource incorporates the recently completed P. falciparum genome sequence and annotation, as well as draft sequence and annotation emerging from other Plasmodium sequencing projects. PlasmoDB currently houses information from five parasite species and provides tools for intra- and inter-species comparisons. Sequence information is integrated with other genomic-scale data emerging from the Plasmodium research community, including gene expression analysis from EST, SAGE and microarray projects and proteomics studies. The relational schema used to build PlasmoDB, GUS (Genomics Unified Schema) employs a highly structured format to accommodate the diverse data types generated by sequence and expression projects. A variety of tools allow researchers to formulate complex, biologically-based, queries of the database. A stand-alone version of the database is also available on CD-ROM (P. falciparum GenePlot), facilitating access to the data in situations where internet access is difficult (e.g. by malaria researchers working in the field). The goal of PlasmoDB is to facilitate utilization of the vast quantities of genomic-scale data produced by the global malaria research community. The software used to develop PlasmoDB has been used to create a second Apicomplexan parasite genome database, ToxoDB (http://ToxoDB.org).  相似文献   

3.
4.
5.
6.
7.
Plasmodium falciparum DNA, isolated from the merozoite stage, was cleaved with HindIII and cloned in pBR322 and lambda L47.1 vectors. Plasmid clones containing 13.4, 7.0, 4.3, 4.1 and 1.5 kb inserts were characterized in some detail. The inserts contain several repeating units of smaller size. Nucleic acid hybridization studies showed that the repeat element is present in the Plasmodium DNA at a very high copy number and appears to be distributed widely throughout the genome.  相似文献   

8.
An international consortium has been formed to sequence the entire genome of the human malaria parasite Plasmodium falciparum. We sequenced chromosome 2 of clone 3D7 using a shotgun sequencing strategy. Chromosome 2 is 947 kb in length, has a base composition of 80.2% A + T, and contains 210 predicted genes. In comparison to the Saccharomyces cerevisiae genome, chromosome 2 has a lower gene density, a greater proportion of genes containing introns, and nearly twice as many proteins containing predicted non-globular domains. A group of putative surface proteins was identified, rifins, which are encoded by a gene family comprising up to 7% of the protein-encoding gene in the genome. The rifins exhibit considerable sequence diversity and may play an important role in antigenic variation. Sixteen genes encoded on chromosome 2 showed signs of a plastid or mitochondrial origin, including several genes involved in fatty acid biosynthesis. Completion of the chromosome 2 sequence demonstrated that the A + T-rich genome of P. falciparum can be sequenced by the shotgun approach. Within 2-3 years, the sequence of almost all P. falciparum genes will have been determined, paving the way for genetic, biochemical, and immunological research aimed at developing new drugs and vaccines against malaria.  相似文献   

9.
10.
The genome of Plasmodium falciparum. I: DNA base composition.   总被引:4,自引:7,他引:4       下载免费PDF全文
Some structural properties of the DNA of Plasmodium falciparum were studied thoroughly using several techniques. Its G+C content was found to be extremely low (17-19%), the lowest reported for a living organism. The DNA seems to be composed only of the four major bases as no methylated bases were detected. This DNA had a Tm value of 62.5 degrees C and its denaturation profile showed no marked intramolecular heterogeneity.  相似文献   

11.
SUMMARY: Currently, new bacterial genomes are being published on a monthly basis. With the growing amount of genome sequence data, there is a demand for a flexible and easy-to-maintain structure for storing sequence data and results from bioinformatic analysis. More than 150 sequenced bacterial genomes are now available, and comparisons of properties for taxonomically similar organisms are not readily available to many biologists. In addition to the most basic information, such as AT content, chromosome length, tRNA count and rRNA count, a large number of more complex calculations are needed to perform detailed comparative genomics. DNA structural calculations like curvature and stacking energy, DNA compositions like base skews, oligo skews and repeats at the local and global level are just a few of the analysis that are presented on the CBS Genome Atlas Web page. Complex analysis, changing methods and frequent addition of new models are factors that require a dynamic database layout. Using basic tools like the GNU Make system, csh, Perl and MySQL, we have created a flexible database environment for storing and maintaining such results for a collection of complete microbial genomes. Currently, these results counts to more than 220 pieces of information. The backbone of this solution consists of a program package written in Perl, which enables administrators to synchronize and update the database content. The MySQL database has been connected to the CBS web-server via PHP4, to present a dynamic web content for users outside the center. This solution is tightly fitted to existing server infrastructure and the solutions proposed here can perhaps serve as a template for other research groups to solve database issues. AVAILABILITY: A web based user interface which is dynamically linked to the Genome Atlas Database can be accessed via www.cbs.dtu.dk/services/GenomeAtlas/. SUPPLEMENTARY INFORMATION: This paper has a supplemental information page which links to the examples presented: www.cbs.dtu.dk/services/GenomeAtlas/suppl/bioinfdatabase.  相似文献   

12.
Yeramian E 《Gene》2000,255(2):151-168
A gene identification procedure is formulated, based on large-scale structural analyses of genomic sequences. The structural property is the physical - thermal - stability of the DNA double-helix, as described by the classical helix-coil model. The analyses are detailed for the Plasmodium falciparum genome, which represents one of the most difficult cases for the gene identification problem (notably because of the extreme AT-richness of the genome). In this genome, the coding domains (either uninterrupted genes or exons in split genes) are accurately identified as regions of high thermal stability. The conclusion is based on the study of the available cloned genes, of which 17 examples are described in detail. These examples demonstrate that the physical criterion is valid for the detection of coding regions whose lengths extend from a few base pairs up to several thousand base pairs. Accordingly, the structural analyses can provide a powerful and convenient tool for the identification of complex genes in the P. falciparum genome. The limits of such a scheme are discussed. The gene identification procedure is applied to the completely sequenced chromosomes (2 and 3), and the results are compared with the database annotations. The structural analyses suggest more or less extensive revision to the annotations, and also allow new putative genes to be identified in the chromosome sequences. Several examples of such new genes are described in detail.  相似文献   

13.
'Knobs' at the surface of erythrocytes infected with mature stages of Plasmodium falciparum are believed to be important in adherence of these cells to capillary walls. They contain at least one parasite protein, designated the knob-associated histidine-rich protein (KAHRP). We present here the sequences of a cDNA and chromosomal clone that predict the complete sequence of KAHRP. The gene contains a single intervening sequence, located at the 3' boundary of the hydrophobic core of a putative signal sequence. Exon two encodes a short region that is rich in histidine as well as two separate regions of repetitive sequence, the 5' repeats (five copies related to SKKHKDNEDAESVK) and the 3' repeats (seven copies related to SKGATKEAST). These repeat blocks were both shown to bear epitopes recognized by the human immune system during natural infection by expressing them separately in Escherichia coli, and reacting human antibodies affinity-purified on lysates of the resulting clones with the corresponding synthetic oligopeptides. The 3' end of the molecule, presumably the repetitive region, is a site of size variation in KAHRP from different isolates.  相似文献   

14.
15.
16.
Bartonella florenciae sp. nov. strain R4T is the type strain of B. florenciae sp. nov., a new species within the genus Bartonella. This strain, whose genome is described here, was isolated in France from the spleen of the shrew Crocidura russula. B. florenciae is an aerobic, rod-shaped, Gram-negative bacterium. Here we describe the features of this organism, together with the complete genome sequence and its annotation. The 2,010,844 bp-long genome contains 1,909 protein-coding and 46 RNA genes, including two rRNA operons.  相似文献   

17.
Brevibacterium senegalense strain JC43T sp. nov. is the type strain of Brevibacterium senegalense sp. nov., a new species within the Brevibacterium genus. This strain, whose genome is described here, was isolated from the fecal flora of a healthy Senegalese patient. B. senegalense is an aerobic rod-shaped Gram-positive bacterium. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 3,425,960 bp long genome (1 chromosome but no plasmid) contains 3,064 protein-coding and 49 RNA genes.  相似文献   

18.
Strain EB01T sp. nov. is the type strain of Bacillus massilioalgeriensis, a new species within the genus Bacillus. This strain, whose genome is described here, was isolated from sediment sample of the hypersaline lake Ezzemoul sabkha in northeastern Algeria. B. massilioalgeriensis is a facultative anaerobic Gram-positive bacillus. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 5,269,577 bp long genome contains 5,098 protein-coding and 95 RNA genes, including 12 rRNA genes.  相似文献   

19.
Megasphaera massiliensis strain NP3T sp. nov. is the type strain of Megasphaera massiliensis sp. nov., a new species within the genus Megasphaera. This strain, whose genome is described here, was isolated from the fecal flora of an HIV-infected patient. M. massiliensis is a Gram-negative, obligate anaerobic coccobacillus. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 2,661,757 bp long genome (1 chromosome but no plasmid) contains 2,577 protein-coding and 61 RNA genes, including 5 rRNA genes.  相似文献   

20.
Anaerococcus senegalensis strain JC48T sp. nov. is the type strain of A. senegalensis sp. nov. a new species within the genus Anaerococcus. This strain whose genome is described here was isolated from the fecal flora of a healthy patient. A. senegalensis is an obligate anaerobic coccus. Here we describe the features of this organism together with the complete genome sequence and annotation. The 1,790,835 bp long genome (1 chromosome but no plasmid) contains 1,721 protein-coding and 53 RNA genes including 5 rRNA genes  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号