首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The germplasm of the genus Nicotiana contains more than 5,000 accessions and plays an important role in modern biological research. Tobacco can be used as a model system to develop methodologies for plant transformation and for investigating gene function. In order to develop the study of Nicotiana, a large quantity of data on germplasm, sequences, molecular markers and genetically modified tobacco was required for in-depth and systematic collation and research. It became necessary to establish a special database for tobacco genetics and breeding. The tobacco genetics and breeding (TGB, http://yancao.sdau.edu.cn/tgb) database was developed with the aim of bringing together tobacco genetics and breeding. The database has three main features: (1) a materials database with information on 1,472 Nicotiana germplasm accessions, as well as updated genomic and expressed sequence tag (EST) data available from the public database; (2) a molecular markers database containing a total of 12,388 potential intron polymorphisms 10,551 EST-simple sequence repeat (EST-SSR) and 66,297 genomic-SSR markers; and (3) an applications database with genetic maps and some genetically modified studies in tobacco. The TGB database also makes Basic Local Alignment Search Tool and primer designing tools publicly available. As far as can be ascertained, the TGB database is the first tobacco genetics and breeding database to be created, and all this comprehensive information will aid basic research into Nicotiana and other related plants. It will serve as an excellent resource for the online tobacco research community.  相似文献   

2.
Gene discovery using the maize genome database ZmDB   总被引:9,自引:0,他引:9       下载免费PDF全文
Zea mays DataBase (ZmDB) is a repository and analysis tool for sequence, expression and phenotype data of the major crop plant maize. The data accessible in ZmDB are mostly generated in a large collaborative project of maize gene discovery, sequencing and phenotypic analysis using a transposon tagging strategy and expressed sequence tag (EST) sequencing. ESTs constitute most of the current content. Database search tools, convenient links to external databases, and novel sequence analysis programs for spliced alignment are provided and together serve as an efficient protocol for gene discovery by sequence inspection. ZmDB can be accessed at http://zmdb. iastate.edu. ZmDB also provides web-based ordering of materials generated in the project, including EST and genomic DNA clones, seeds of mutant plants and microarrays of amplified EST and genomic DNA sequences.  相似文献   

3.
The Olfactory Receptor Database (ORDB) is a WWW-accessible database that stores data on Olfactory Receptor-like molecules (ORs) and has been open to the public since June 1996. It contains a public and a private area. The public area includes published DNA and protein sequence data for ORs, links to OR models and data on their expression, chromosomal localization and source organism, as well as (i) links to bibliography through PubMed and (ii) interactive WWW-based tools, such as BLAST homology searching. The private area functions as a service to laboratories that are actively cloning receptors. Source laboratories enter the sequences of the receptor clones they have characterized to the private database and can search for identical or near identical OR sequences in both public and private databases. If another laboratory has cloned and deposited an identical or closely matching sequence there are means for communication between the laboratories to help avoid duplication of work. ORDB is available via the WWW at http://crepe.med.yale.edu/ORDB/HTML  相似文献   

4.
Array-based comparative genomic hybridization (aCGH) is a molecular cytogenetic technique used in detecting and mapping DNA copy number alterations. aCGH is able to interrogate the entire genome at a previously unattainable, high resolution and has directly led to the recent appreciation of a novel class of genomic variation: copy number variation (CNV) in mammalian genomes. All forms of DNA variation/polymorphism are important for studying the basis of phenotypic diversity among individuals. CNV research is still at its infancy, requiring careful collation and annotation of accumulating CNV data that will undoubtedly be useful for accurate interpretation of genomic imbalances identified during cancer research.  相似文献   

5.
Human genome polymorphism is expected to play a key role in defining the etiologic basis of phenotypic differences between individuals in aspects such as drug responses and common disease predisposition. Relevant functional DNA changes will probably be located in or near to transcribed sequences, and include many single nucleotide polymorphisms. To aid the future analysis of such genome variation, HGBASE (Human Genic Bi-Allelic SEquences) was constructed as a means to gather human gene-linked polymorphisms from all possible public sources, and show these as a non-redundant set of records in a standardized and user-friendly database endowed with text and sequence based search facilities. After 1 year of presence on the WWW, the HGBASE project has compiled data for over 22 000 records, and this number continues to triple every 6-12 months with data harvested or submitted from all major public genome databases and published literature from the previous decade. Extensive annotation enhancement, internal consistency checking and manual review of every record is undertaken to address potential errors and deficiencies sometimes present in the original source data. The fully polished and comprehensive database is made freely available to all at http://hgbase.cgr.ki.se  相似文献   

6.
7.
DAtA: database of Arabidopsis thaliana annotation   总被引:1,自引:0,他引:1       下载免费PDF全文
The Database of Arabidopsis thaliana Annotation (D At A) was created to enable easy access to and analysis of all the Arabidopsis genome project annotation. The database was constructed using the completed A.thaliana genomic sequence data currently in GenBank. An automated annotation process was used to predict coding sequences for GenBank records that do not include annotation. D At A also contains protein motifs and protein similarities derived from searches of the proteins in D At A with motif databases and the non-redundant protein database. The database is routinely updated to include new GenBank submissions for Arabidopsis genomic sequences and new Blast and protein motif search results. A web interface to D At A allows coding sequences to be searched by name, comment, blast similarity or motif field. In addition, browse options present lists of either all the protein names or identified motifs present in the sequenced A.thaliana genome. The database can be accessed at http://baggage. stanford.edu/group/arabprotein/  相似文献   

8.
The genetic variability of the Brazilian physic nut (Jatropha curcas) germplasm bank (117 accessions) was assessed using a combination of phenotypic and molecular data. The joint dissimilarity matrix showed moderate correlation with the original matrices of phenotypic and molecular data. However, the correlation between the phenotypic dissimilarity matrix and the genotypic dissimilarity matrix was low. This finding indicated that molecular markers (RAPD and SSR) did not adequately sample the genomic regions that were relevant for phenotypic differentiation of the accessions. The dissimilarity values of the joint dissimilarity matrix were used to measure phenotypic + molecular diversity. This diversity varied from 0 to 1.29 among the 117 accessions, with an average dissimilarity among genotypes of 0.51. Joint analysis of phenotypic and molecular diversity indicated that the genetic diversity of the physic nut germplasm was 156% and 64% higher than the diversity estimated from phenotypic and molecular data, respectively. These results show that Jatropha genetic variability in Brazil is not as limited as previously thought.  相似文献   

9.
10.
A database of high-mass accuracy tryptic peptides has been created. The database contains 15 897 unique, annotated MS/MS spectra. It is possible to search for peptides according to their mass, number of missed cleavages, and sequence motifs. All of the data contained in the database is downloadable, and each spectrum can be visualized. An example is presented of how the database can be used for studying peptide fragmentation. Fragmentation of different types of missed cleaved peptides has been studied, and the results can be used to improve identification of these types of peptides.  相似文献   

11.
Zea mays DataBase (ZmDB) seeks to provide a comprehensive view of maize (corn) genetics by linking genomic sequence data with gene expression analysis and phenotypes of mutant plants. ZmDB originated in 1999 as the Web portal for a large project of maize gene discovery, sequencing and phenotypic analysis using a transposon tagging strategy and expressed sequence tag (EST) sequencing. Recently, ZmDB has broadened its scope to include all public maize ESTs, genome survey sequences (GSSs), and protein sequences. More than 170 000 ESTs are currently clustered into approximately 20 000 contigs and about an equal number of apparent singlets. These clusters are continuously updated and annotated with respect to potential encoded protein products. More than 100 000 GSSs are similarly assembled and annotated by spliced alignment with EST and protein sequences. The ZmDB interface provides quick access to analytical tools for further sequence analysis. Every sequence record is linked to several display options and similarity search tools, including services for multiple sequence alignment, protein domain determination and spliced alignment. Furthermore, ZmDB provides web-based ordering of materials generated in the project, including ESTs, ordered collections of genomic sequences tagged with the RescueMu transposon and microarrays of amplified ESTs. ZmDB can be accessed at http://zmdb.iastate.edu/.  相似文献   

12.
13.
A data set consisting of DNA sequences from a large-scale shotgun DNA cloning and sequencing project has been collected and posted for public release. The purpose is to propose a standard genomic DNA sequencing data set by which various algorithms and implementations can be tested. This set of data is divided into two subsets, one containing raw DNA sequence data (1023 clones) and the other consisting of the corresponding partially refined or edited DNA sequence data (820 clones). Suggested criteria or guidelines for this data refinement are presented so that algorithms for preprocessing and screening raw sequences may be developed. Development of such preprocessing, screening, aligning, and assembling algorithms will expedite large-scale DNA sequencing projects so that the complete unambiguous consensus DNA sequences will be made available to the general research community in a quicker manner. Smaller scale routine DNA sequencing projects will also be greatly aided by such computational efforts.  相似文献   

14.
HOWDY: an integrated database system for human genome research   总被引:1,自引:0,他引:1  
HOWDY is an integrated database system for accessing and analyzing human genomic information (http://www-alis.tokyo.jst.go.jp/HOWDY/). HOWDY stores information about relationships between genetic objects and the data extracted from a number of databases. HOWDY consists of an Internet accessible user interface that allows thorough searching of the human genomic databases using the gene symbols and their aliases. It also permits flexible editing of the sequence data. The database can be searched using simple words and the search can be restricted to a specific cytogenetic location. Linear maps displaying markers and genes on contig sequences are available, from which an object can be chosen. Any search starting point identifies all the information matching the query. HOWDY provides a convenient search environment of human genomic data for scientists unsure which database is most appropriate for their search.  相似文献   

15.
16.
PartiGene--constructing partial genomes   总被引:4,自引:0,他引:4  
Expressed sequence tags (ESTs) offer a low-cost approach to gene discovery and are being used by an increasing number of laboratories to obtain sequence information for a wide variety of organisms. The challenge lies in processing and organizing this data within a genomic context to facilitate large scale analyses. Here we present PartiGene, an integrated sequence analysis suite that uses freely available public domain software to (1) process raw trace chromatograms into sequence objects suitable for submission to dbEST; (2) place these sequences within a genomic context; (3) perform customizable first-pass annotation of the data; and (4) present the data as HTML tables and an SQL database resource. PartiGene has been used to create a number of non-model organism database resources including NEMBASE (http://www.nematodes.org) and LumbriBase (http://www.earthworms.org/). The packages are readily portable, freely available and can be run on simple Linux-based workstations. AVAILABILITY: PartiGene is available from http://www.nematodes.org/PartiGene and also forms part of the EST analysis software, associated with the Natural Environmental Research Council (UK) Bio-Linux project (http://envgen.nox.ac.uk/biolinux.html).  相似文献   

17.
Leukemias are exceptionally well studied at the molecular level and a wealth of high-throughput data has been published. But further utilization of these data by researchers is severely hampered by the lack of accessible integrative tools for viewing and analysis. We developed the Leukemia Gene Atlas (LGA) as a public platform designed to support research and analysis of diverse genomic data published in the field of leukemia. With respect to leukemia research, the LGA is a unique resource with comprehensive search and browse functions. It provides extensive analysis and visualization tools for various types of molecular data. Currently, its database contains data from more than 5,800 leukemia and hematopoiesis samples generated by microarray gene expression, DNA methylation, SNP and next generation sequencing analyses. The LGA allows easy retrieval of large published data sets and thus helps to avoid redundant investigations. It is accessible at www.leukemia-gene-atlas.org.  相似文献   

18.
19.
20.
Jing R  Johnson R  Seres A  Kiss G  Ambrose MJ  Knox MR  Ellis TH  Flavell AJ 《Genetics》2007,177(4):2263-2275
Sequence diversity of 39 dispersed gene loci was analyzed in 48 diverse individuals representative of the genus Pisum. The different genes show large variation in diversity parameters, suggesting widely differing levels of selection and a high overall diversity level for the species. The data set yields a genetic diversity tree whose deep branches, involving wild samples, are preserved in a tree derived from a polymorphic retrotransposon insertions in an identical sample set. Thus, gene regions and intergenic "junk DNA" share a consistent picture for the genomic diversity of Pisum, despite low linkage disequilibrium in wild and landrace germplasm, which might be expected to allow independent evolution of these very different DNA classes. Additional lines of evidence indicate that recombination has shuffled gene haplotypes efficiently within Pisum, despite its high level of inbreeding and widespread geographic distribution. Trees derived from individual gene loci show marked differences from each other, and genetic distance values between sample pairs show high standard deviations. Sequence mosaic analysis of aligned sequences identifies nine loci showing evidence for intragenic recombination. Lastly, phylogenetic network analysis confirms the non-treelike structure of Pisum diversity and indicates the major germplasm classes involved. Overall, these data emphasize the artificiality of simple tree structures for representing genomic sequence variation within Pisum and emphasize the need for fine structure haplotype analysis to accurately define the genetic structure of the species.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号