期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

The TIGR gene indices: reconstruction and representation of expressed gene sequences 总被引：29，自引：0，他引：29

Quackenbush J Liang F Holt I Pertea G Upton J 《Nucleic acids research》2000,28(1):141-145

相似文献

2.

The Ribosomal Database Project (RDP-II): previewing a new autoaligner that allows regular updates and the new prokaryotic taxonomy 总被引：37，自引：0，他引：37

下载免费PDF全文

Cole JR Chai B Marsh TL Farris RJ Wang Q Kulam SA Chandra S McGarrell DM Schmidt TM Garrity GM Tiedje JM;Ribosomal Database Project 《Nucleic acids research》2003,31(1):442-443

The Ribosomal Database Project-II (RDP-II) pro-vides data, tools and services related to ribosomal RNA sequences to the research community. Through its website (http://rdp.cme.msu.edu), RDP-II offers aligned and annotated rRNA sequence data, analysis services, and phylogenetic inferences (trees) derived from these data. RDP-II release 8.1 contains 16 277 prokaryotic, 5201 eukaryotic, and 1503 mitochondrial small subunit rRNA sequences in aligned and annotated format. The current public beta release of 9.0 debuts a new regularly updated alignment of over 50 000 annotated (eu)bacterial sequences. New analysis services include a sequence search and selection tool (Hierarchy Browser) and a phylogenetic tree building and visualization tool (Phylip Interface). A new interactive tutorial guides users through the basics of rRNA sequence analysis. Other services include probe checking, phylogenetic placement of user sequences, screening of users' sequences for chimeric rRNA sequences, automated alignment, production of similarity matrices, and services to plan and analyze terminal restriction fragment polymorphism (T-RFLP) experiments. The RDP-II email address for questions or comments is rdpstaff@msu.edu. 相似文献

3.

Beyond the 'best' match: machine learning annotation of protein sequences by integration of different sources of information

Tetko IV Rodchenkov IV Walter MC Rattei T Mewes HW 《Bioinformatics (Oxford, England)》2008,24(5):621-628

相似文献

4.

MIPS bacterial genomes functional annotation benchmark dataset

Tetko IV Brauner B Dunger-Kaltenbach I Frishman G Montrone C Fobo G Ruepp A Antonov AV Surmeli D Mewes HW 《Bioinformatics (Oxford, England)》2005,21(10):2520-2521

MOTIVATION: Any development of new methods for automatic functional annotation of proteins according to their sequences requires high-quality data (as benchmark) as well as tedious preparatory work to generate sequence parameters required as input data for the machine learning methods. Different program settings and incompatible protocols make a comparison of the analyzed methods difficult. RESULTS: The MIPS Bacterial Functional Annotation Benchmark dataset (MIPS-BFAB) is a new, high-quality resource comprising four bacterial genomes manually annotated according to the MIPS functional catalogue (FunCat). These resources include precalculated sequence parameters, such as sequence similarity scores, InterPro domain composition and other parameters that could be used to develop and benchmark methods for functional annotation of bacterial protein sequences. These data are provided in XML format and can be used by scientists who are not necessarily experts in genome annotation. AVAILABILITY: BFAB is available at http://mips.gsf.de/proj/bfab 相似文献

5.

Comparative plant genomics resources at PlantGDB

下载免费PDF全文

Dong Q Lawrence CJ Schlueter SD Wilkerson MD Kurtz S Lushbough C Brendel V 《Plant physiology》2005,139(2):610-618

相似文献

6.

Tidying up international nucleotide sequence databases: ecological, geographical and sequence quality annotation of its sequences of mycorrhizal fungi 总被引：1，自引：0，他引：1

Tedersoo L Abarenkov K Nilsson RH Schüssler A Grelet GA Kohout P Oja J Bonito GM Veldre V Jairus T Ryberg M Larsson KH Kõljalg U 《PloS one》2011,6(9):e24940

Sequence analysis of the ribosomal RNA operon, particularly the internal transcribed spacer (ITS) region, provides a powerful tool for identification of mycorrhizal fungi. The sequence data deposited in the International Nucleotide Sequence Databases (INSD) are, however, unfiltered for quality and are often poorly annotated with metadata. To detect chimeric and low-quality sequences and assign the ectomycorrhizal fungi to phylogenetic lineages, fungal ITS sequences were downloaded from INSD, aligned within family-level groups, and examined through phylogenetic analyses and BLAST searches. By combining the fungal sequence database UNITE and the annotation and search tool PlutoF, we also added metadata from the literature to these accessions. Altogether 35,632 sequences belonged to mycorrhizal fungi or originated from ericoid and orchid mycorrhizal roots. Of these sequences, 677 were considered chimeric and 2,174 of low read quality. Information detailing country of collection, geographical coordinates, interacting taxon and isolation source were supplemented to cover 78.0%, 33.0%, 41.7% and 96.4% of the sequences, respectively. These annotated sequences are publicly available via UNITE (http://unite.ut.ee/) for downstream biogeographic, ecological and taxonomic analyses. In European Nucleotide Archive (ENA; http://www.ebi.ac.uk/ena/), the annotated sequences have a special link-out to UNITE. We intend to expand the data annotation to additional genes and all taxonomic groups and functional guilds of fungi. 相似文献

7.

ZmDB,an integrated database for maize genome research

Dong Q Roy L Freeling M Walbot V Brendel V 《Nucleic acids research》2003,31(1):244-247

Zea mays DataBase (ZmDB) seeks to provide a comprehensive view of maize (corn) genetics by linking genomic sequence data with gene expression analysis and phenotypes of mutant plants. ZmDB originated in 1999 as the Web portal for a large project of maize gene discovery, sequencing and phenotypic analysis using a transposon tagging strategy and expressed sequence tag (EST) sequencing. Recently, ZmDB has broadened its scope to include all public maize ESTs, genome survey sequences (GSSs), and protein sequences. More than 170 000 ESTs are currently clustered into approximately 20 000 contigs and about an equal number of apparent singlets. These clusters are continuously updated and annotated with respect to potential encoded protein products. More than 100 000 GSSs are similarly assembled and annotated by spliced alignment with EST and protein sequences. The ZmDB interface provides quick access to analytical tools for further sequence analysis. Every sequence record is linked to several display options and similarity search tools, including services for multiple sequence alignment, protein domain determination and spliced alignment. Furthermore, ZmDB provides web-based ordering of materials generated in the project, including ESTs, ordered collections of genomic sequences tagged with the RescueMu transposon and microarrays of amplified ESTs. ZmDB can be accessed at http://zmdb.iastate.edu/. 相似文献

8.

ANEXdb: an integrated animal ANnotation and microarray EXpression database

Oliver Couture Keith Callenberg Neeraj Koul Sushain Pandit Remy Younes Zhi-Liang Hu Jack Dekkers James Reecy Vasant Honavar Christopher Tuggle 《Mammalian genome》2009,20(11-12):768-777

相似文献

9.

The eukaryotic promoter database (EPD) 总被引：8，自引：0，他引：8

Périer RC Praz V Junier T Bonnard C Bucher P 《Nucleic acids research》2000,28(1):302-303

相似文献

10.

Identification of novel transcribed sequences on human chromosome 22 by expressed sequence tag mapping.

M Hirosawa T Nagase Y Murahashi R Kikuno O Ohara 《DNA research》2001,8(1):1-9

相似文献

11.

Solexa sequencing based transcriptome analysis of Helicoverpa armigera larvae

Jigang Li Xiumin Li Yongli Chen Zhongxiang Yang Sandui Guo 《Molecular biology reports》2012,39(12):11051-11059

相似文献

12.

UniProt蛋白质数据库简介 总被引：1，自引：0，他引：1

下载免费PDF全文

罗静初《生物信息学》2019,17(3):131-144

UniProt(https://www.uniprot.org/)是国际知名蛋白质数据库,主要包括UniProtKB知识库、UniParc归档库和UniRef参考序列集三部分。UniProtKB知识库是UniProt的核心,除蛋白质序列数据外,还包括大量注释信息。UniProtKB知识库分Swiss-Prot和TrEMBL两个子库。Swiss-Prot子库中50多万条序列均由人工审阅和注释,而TrEMBL子库中1.4亿多条序列是由核酸序列数据库EMBL中的蛋白质编码序列翻译所得,并由计算机根据一定规则进行注释。UniParc归档库将存放于不同数据库中的同一个蛋白质归并到一个记录中以避免冗余,并赋予序列唯一性特定标识符。UniRef参考序列集按相似性程度将UniProtKB和UniParc中的序列分为UniRef100、UniRef90和UniRef50三个数据集。UniProt网站为用户提供了高效实用的高级检索系统和大量帮助文档。UniProt数据库每4周发布新版的同时也发布统计报表,用户可通过统计报表了解该数据库的数据量及更新情况、数据类别和物种分布等基本信息,查看常规注释信息、序列特征注释信息和数据库交叉链接等统计数据。UniProt是目前国际上序列数据最完整、注释信息最丰富的非冗余蛋白质序列数据库,自本世纪初创建以来,为生命科学领域提供了宝贵资源。相似文献

13.

Analysis of expressed sequence tags (ESTs) from cocoa (Theobroma cacao L) upon infection with Phytophthora megakarya

Naganeeswaran SA Subbian EA Ramaswamy M 《Bioinformation》2012,8(2):65-69

Phytophthora megakarya, the causative agent of cacao black pod disease in West African countries causes an extensive loss of yield. In this study we have analyzed 4 libraries of ESTs derived from Phytophthora megakarya infected cocoa leaf and pod tissues. Totally 6379 redundant sequences were retrieved from ESTtik database and EST processing was performed using seqclean tool. Clustering and assembling using CAP3 generated 3333 non-redundant (907 contigs and 2426 singletons) sequences. The primary sequence analysis of 3333 non-redundant sequences showed that the GC percentage was 42.7 and the sequence length ranged from 101 - 2576 nucleotides. Further, functional analysis (Blast, Interproscan, Gene ontology and KEGG search) were executed and 1230 orthologous genes were annotated. Totally 272 enzymes corresponding to 114 metabolic pathways were identified. Functional annotation revealed that most of the sequences are related to molecular function, stress response and biological processes. The annotated enzymes are aldehyde dehydrogenase (E.C: 1.2.1.3), catalase (E.C: 1.11.1.6), acetyl-CoA C-acetyltransferase (E.C: 2.3.1.9), threonine ammonia-lyase (E.C: 4.3.1.19), acetolactate synthase (E.C: 2.2.1.6), O-methyltransferase (E.C: 2.1.1.68) which play an important role in amino acid biosynthesis and phenyl propanoid biosynthesis. All this information was stored in MySQL database management system to be used in future for reconstruction of biotic stress response pathway in cocoa. 相似文献

14.

GoSh: a web-based database for goat and sheep EST sequences

Caprera A Lazzari B Stella A Merelli I Caetano AR Mariani P 《Bioinformatics (Oxford, England)》2007,23(8):1043-1045

The GoSh database is a collection of 58 990 Capra hircus and Ovis aries expressed sequence tags. A perl pipeline was prepared to process sequences, and data were collected in a MySQL database. A PHP-based web interface allows browsing and querying the database. Putative single nucleotide polymorphism (SNP) detection, as well as search to repeats were performed, and links to external related resources were provided. Sequences were annotated against three different databases and an algorithm was implemented to create statistics of the distribution of retrieved homologous ontologies in the Gene Ontology categories. The GoSh database is a repository of data and links related to goat and sheep expressed genes. AVAILABILITY: The GoSh database is available at http://www.itb.cnr.it/gosh/ 相似文献

15.

Bioverse: Functional,structural and contextual annotation of proteins and proteomes

McDermott J Samudrala R 《Nucleic acids research》2003,31(13):3736-3737

Functional annotation is routinely performed for large-scale genomics projects and databases. Researchers working on more specific problems, for instance on an individual pathway or complex, also need to be able to quickly, completely and accurately annotate sequences. The Bioverse sequence annotation server (http://bioverse.compbio.washington.edu) provides a web-based interface to allow users to submit protein sequences to the Bioverse framework. Sequences are functionally and structurally annotated and potential contextual annotations are provided. Researchers can also submit candidate genomes for annotation of all proteins encoded by the genome (proteome). 相似文献

16.

The SBASE protein domain library, release 8.0: a collection of annotated protein sequence segments

Murvai J Vlahovicek K Barta E Pongor S 《Nucleic acids research》2001,29(1):58-60

SBASE 8.0 is the eighth release of the SBASE library of protein domain sequences that contains 294 898 annotated structural, functional, ligand-binding and topogenic segments of proteins, cross-referenced to most major sequence databases and sequence pattern collections. The entries are clustered into over 2005 statistically validated domain groups (SBASE-A) and 595 non-validated groups (SBASE-B), provided with several WWW-based search and browsing facilities for online use. A domain-search facility was developed, based on non-parametric pattern recognition methods, including artificial neural networks. SBASE 8.0 is freely available by anonymous 'ftp' file transfer from ftp.icgeb.trieste.it. Automated searching of SBASE can be carried out with the WWW servers http://www.icgeb.trieste.it/sbase/ and http://sbase.abc. hu/sbase/. 相似文献

17.

PipeOnline 2.0: automated EST processing and functional data sorting

Ayoubi P Jin X Leite S Liu X Martajaja J Abduraham A Wan Q Yan W Misawa E Prade RA 《Nucleic acids research》2002,30(21):4761-4769

Expressed sequence tags (ESTs) are generated and deposited in the public domain, as redundant, unannotated, single-pass reactions, with virtually no biological content. PipeOnline automatically analyses and transforms large collections of raw DNA-sequence data from chromatograms or FASTA files by calling the quality of bases, screening and removing vector sequences, assembling and rewriting consensus sequences of redundant input files into a unigene EST data set and finally through translation, amino acid sequence similarity searches, annotation of public databases and functional data. PipeOnline generates an annotated database, retaining the processed unigene sequence, clone/file history, alignments with similar sequences, and proposed functional classification, if available. Functional annotation is automatic and based on a novel method that relies on homology of amino acid sequence multiplicity within GenBank records. Records are examined through a function ordered browser or keyword queries with automated export of results. PipeOnline offers customization for individual projects (MyPipeOnline), automated updating and alert service. PipeOnline is available at http://stress-genomics.org. 相似文献

18.

An automated annotation tool for genomic DNA sequences using GeneScan and BLAST 总被引：1，自引：0，他引：1

Lynn AM Jain CK Kosalai K Barman P Thakur N Batra H Bhattacharya A 《Journal of genetics》2001,80(1):9-16

Genomic sequence data are often available well before the annotated sequence is published. We present a method for analysis of genomic DNA to identify coding sequences using the GeneScan algorithm and characterize these resultant sequences by BLAST. The routines are used to develop a system for automated annotation of genome DNA sequences. 相似文献

19.

Human immunodeficiency virus reverse transcriptase and protease sequence database 总被引：8，自引：0，他引：8

下载免费PDF全文

Shafer RW Jung DR Betts BJ Xi Y Gonzales MJ 《Nucleic acids research》2000,28(1):346-348

相似文献

20.

The TIGR Gene Indices: analysis of gene transcript sequences in highly sampled eukaryotic species

下载免费PDF全文

Quackenbush J Cho J Lee D Liang F Holt I Karamycheva S Parvizi B Pertea G Sultana R White J 《Nucleic acids research》2001,29(1):159-164

相似文献