期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

MBRole: enrichment analysis of metabolomic data

Chagoyen M Pazos F 《Bioinformatics (Oxford, England)》2011,27(5):730-731

相似文献

2.

The Arabidopsis Information Resource (TAIR): a model organism database providing a centralized,curated gateway to Arabidopsis biology,research materials and community

Rhee SY Beavis W Berardini TZ Chen G Dixon D Doyle A Garcia-Hernandez M Huala E Lander G Montoya M Miller N Mueller LA Mundodi S Reiser L Tacklind J Weems DC Wu Y Xu I Yoo D Yoon J Zhang P 《Nucleic acids research》2003,31(1):224-228

Arabidopsis thaliana is the most widely-studied plant today. The concerted efforts of over 11 000 researchers and 4000 organizations around the world are generating a rich diversity and quantity of information and materials. This information is made available through a comprehensive on-line resource called the Arabidopsis Information Resource (TAIR) (http://arabidopsis.org), which is accessible via commonly used web browsers and can be searched and downloaded in a number of ways. In the last two years, efforts have been focused on increasing data content and diversity, functionally annotating genes and gene products with controlled vocabularies, and improving data retrieval, analysis and visualization tools. New information include sequence polymorphisms including alleles, germplasms and phenotypes, Gene Ontology annotations, gene families, protein information, metabolic pathways, gene expression data from microarray experiments and seed and DNA stocks. New data visualization and analysis tools include SeqViewer, which interactively displays the genome from the whole chromosome down to 10 kb of nucleotide sequence and AraCyc, a metabolic pathway database and map tool that allows overlaying expression data onto the pathway diagrams. Finally, we have recently incorporated seed and DNA stock information from the Arabidopsis Biological Resource Center (ABRC) and implemented a shopping-cart style on-line ordering system. 相似文献

3.

An Approach to Function Annotation for Proteins of Unknown Function (PUFs) in the Transcriptome of Indian Mulberry

K. H. Dhanyalakshmi Mahantesha B. N. Naika R. S. Sajeevan Oommen K. Mathew K. Mohamed Shafi Ramanathan Sowdhamini Karaba N. Nataraja 《PloS one》2016,11(3)

相似文献

4.

VIZARD: analysis of Affymetrix Arabidopsis GeneChip data

Moseyko N Feldman LJ 《Bioinformatics (Oxford, England)》2002,18(9):1264-1265

SUMMARY: The Affymetrix GeneChip Arabidopsis genome array has proved to be a very powerful tool for the analysis of gene expression in Arabidopsis thaliana, the most commonly studied plant model organism. VIZARD is a Java program created at the University of California, Berkeley, to facilitate analysis of Arabidopsis GeneChip data. It includes several integrated tools for filtering, sorting, clustering and visualization of gene expression data as well as tools for the discovery of regulatory motifs in upstream sequences. VIZARD also includes annotation and upstream sequence databases for the majority of genes represented on the Affymetrix Arabidopsis GeneChip array. AVAILABILITY: VIZARD is available free of charge for educational, research, and not-for-profit purposes, and can be downloaded at http://www.anm.f2s.com/research/vizard/ CONTACT: moseyko@uclink4.berkeley.edu 相似文献

5.

The SPECIES and ORGANISMS Resources for Fast and Accurate Identification of Taxonomic Names in Text

Evangelos Pafilis Sune P. Frankild Lucia Fanini Sarah Faulwetter Christina Pavloudi Aikaterini Vasileiadou Christos Arvanitidis Lars Juhl Jensen 《PloS one》2013,8(6)

The exponential growth of the biomedical literature is making the need for efficient, accurate text-mining tools increasingly clear. The identification of named biological entities in text is a central and difficult task. We have developed an efficient algorithm and implementation of a dictionary-based approach to named entity recognition, which we here use to identify names of species and other taxa in text. The tool, SPECIES, is more than an order of magnitude faster and as accurate as existing tools. The precision and recall was assessed both on an existing gold-standard corpus and on a new corpus of 800 abstracts, which were manually annotated after the development of the tool. The corpus comprises abstracts from journals selected to represent many taxonomic groups, which gives insights into which types of organism names are hard to detect and which are easy. Finally, we have tagged organism names in the entire Medline database and developed a web resource, ORGANISMS, that makes the results accessible to the broad community of biologists. The SPECIES software is open source and can be downloaded from http://species.jensenlab.org along with dictionary files and the manually annotated gold-standard corpus. The ORGANISMS web resource can be found at http://organisms.jensenlab.org. 相似文献

6.

Using the Semantic Web for Rapid Integration of WikiPathways with Other Biological Online Data Resources

Andra Waagmeester Martina Kutmon Anders Riutta Ryan Miller Egon L. Willighagen Chris T. Evelo Alexander R. Pico 《PLoS computational biology》2016,12(6)

The diversity of online resources storing biological data in different formats provides a challenge for bioinformaticians to integrate and analyse their biological data. The semantic web provides a standard to facilitate knowledge integration using statements built as triples describing a relation between two objects. WikiPathways, an online collaborative pathway resource, is now available in the semantic web through a SPARQL endpoint at http://sparql.wikipathways.org. Having biological pathways in the semantic web allows rapid integration with data from other resources that contain information about elements present in pathways using SPARQL queries. In order to convert WikiPathways content into meaningful triples we developed two new vocabularies that capture the graphical representation and the pathway logic, respectively. Each gene, protein, and metabolite in a given pathway is defined with a standard set of identifiers to support linking to several other biological resources in the semantic web. WikiPathways triples were loaded into the Open PHACTS discovery platform and are available through its Web API (https://dev.openphacts.org/docs) to be used in various tools for drug development. We combined various semantic web resources with the newly converted WikiPathways content using a variety of SPARQL query types and third-party resources, such as the Open PHACTS API. The ability to use pathway information to form new links across diverse biological data highlights the utility of integrating WikiPathways in the semantic web. 相似文献

7.

RNA-CODE: A Noncoding RNA Classification Tool for Short Reads in NGS Data Lacking Reference Genomes

Cheng Yuan Yanni Sun 《PloS one》2013,8(10)

相似文献

8.

Tbl2KnownGene: A command-line program to convert NCBI.tbl to UCSC knownGene.txt data file

Yongsheng Bai 《Bioinformation》2014,10(8):544-547

相似文献

9.

Leveraging hierarchical population structure in discrete association studies

Carlson J Kadie C Mallal S Heckerman D 《PloS one》2007,2(7):e591

相似文献

10.

YPD, PombePD and WormPD: model organism volumes of the BioKnowledge library, an integrated resource for protein information

Costanzo MC Crawford ME Hirschman JE Kranz JE Olsen P Robertson LS Skrzypek MS Braun BR Hopkins KL Kondu P Lengieza C Lew-Smith JE Tillberg M Garrels JI 《Nucleic acids research》2001,29(1):75-79

The BioKnowledge Library is a relational database and web site (http://www.proteome.com) composed of protein-specific information collected from the scientific literature. Each Protein Report on the web site summarizes and displays published information about a single protein, including its biochemical function, role in the cell and in the whole organism, localization, mutant phenotype and genetic interactions, regulation, domains and motifs, interactions with other proteins and other relevant data. This report describes four species-specific volumes of the BioKnowledge Library, concerned with the model organisms Saccharomyces cerevisiae (YPD), Schizosaccharomyces pombe (PombePD) and Caenorhabditis elegans (WormPD), and with the fungal pathogen Candida albicans (CalPD). Protein Reports of each species are unified in format, easily searchable and extensively cross-referenced between species. The relevance of these comprehensively curated resources to analysis of proteins in other species is discussed, and is illustrated by a survey of model organism proteins that have similarity to human proteins involved in disease. 相似文献

11.

SECRETOOL: integrated secretome analysis tool for fungi

Ana R. Cortázar Ana M. Aransay Manuel Alfaro José A. Oguiza José L. Lavín 《Amino acids》2014,46(2):471-473

The secretome (full set of secreted proteins) has been studied in multiple fungal genomes to elucidate the potential role of those protein collections involved in a number of metabolic processes from host infection to wood degradation. Being aminoacid composition a key factor to recognize secretory proteins, SECRETOOL comprises a group of web tools that enable secretome predictions out of aminoacid sequence files, up to complete fungal proteomes, in one step. SECRETOOL is freely available on the web at http://genomics.cicbiogune.es/SECRETOOL/Secretool.php. 相似文献

12.

Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA

Kumar Parijat Tripathi Daniela Evangelista Antonio Zuccaro Mario Rosario Guarracino 《PloS one》2015,10(11)

相似文献

13.

iRegNet: an integrative Regulatory Network analysis tool for Arabidopsis thaliana

Sangrea Shim Chung-Mo Park Pil Joon Seo 《Plant physiology》2021,187(3):1292

相似文献

14.

TAIR: a resource for integrated Arabidopsis data

Garcia-Hernandez M Berardini TZ Chen G Crist D Doyle A Huala E Knee E Lambrecht M Miller N Mueller LA Mundodi S Reiser L Rhee SY Scholl R Tacklind J Weems DC Wu Y Xu I Yoo D Yoon J Zhang P 《Functional & integrative genomics》2002,2(6):239-253

相似文献

15.

A RESTful API for Accessing Microbial Community Data for MG-RAST

Andreas Wilke Jared Bischof Travis Harrison Tom Brettin Mark D'Souza Wolfgang Gerlach Hunter Matthews Tobias Paczian Jared Wilkening Elizabeth M. Glass Narayan Desai Folker Meyer 《PLoS computational biology》2015,11(1)

Metagenomic sequencing has produced significant amounts of data in recent years. For example, as of summer 2013, MG-RAST has been used to annotate over 110,000 data sets totaling over 43 Terabases. With metagenomic sequencing finding even wider adoption in the scientific community, the existing web-based analysis tools and infrastructure in MG-RAST provide limited capability for data retrieval and analysis, such as comparative analysis between multiple data sets. Moreover, although the system provides many analysis tools, it is not comprehensive. By opening MG-RAST up via a web services API (application programmers interface) we have greatly expanded access to MG-RAST data, as well as provided a mechanism for the use of third-party analysis tools with MG-RAST data. This RESTful API makes all data and data objects created by the MG-RAST pipeline accessible as JSON objects. As part of the DOE Systems Biology Knowledgebase project (KBase, http://kbase.us) we have implemented a web services API for MG-RAST. This API complements the existing MG-RAST web interface and constitutes the basis of KBase''s microbial community capabilities. In addition, the API exposes a comprehensive collection of data to programmers. This API, which uses a RESTful (Representational State Transfer) implementation, is compatible with most programming environments and should be easy to use for end users and third parties. It provides comprehensive access to sequence data, quality control results, annotations, and many other data types. Where feasible, we have used standards to expose data and metadata. Code examples are provided in a number of languages both to show the versatility of the API and to provide a starting point for users. We present an API that exposes the data in MG-RAST for consumption by our users, greatly enhancing the utility of the MG-RAST service. 相似文献

16.

Fission stories: using PomBase to understand Schizosaccharomyces pombe biology

Midori A Harris Kim M Rutherford Jacqueline Hayles Antonia Lock Jürg Bhler Stephen G Oliver Juan Mata Valerie Wood 《Genetics》2022,220(4)

PomBase (www.pombase.org), the model organism database (MOD) for the fission yeast Schizosaccharomyces pombe, supports research within and beyond the S. pombe community by integrating and presenting genetic, molecular, and cell biological knowledge into intuitive displays and comprehensive data collections. With new content, novel query capabilities, and biologist-friendly data summaries and visualization, PomBase also drives innovation in the MOD community. 相似文献

17.

CoCo: a web application to display, store and curate ChIP-on-chip data integrated with diverse types of gene expression data

Girardot C Sklyar O Grosz S Huber W Furlong EE 《Bioinformatics (Oxford, England)》2007,23(6):771-773

相似文献

18.

Sequence Maneuverer: tool for sequence extraction from genomes

Tayyaba Yasmin Inayat Ur Rehman Adnan Ahmad Ansari Khurrum liaqat Muhammad Irfan khan 《Bioinformation》2012,8(25):1277-1279

The availability of genomic sequences of many organisms has opened new challenges in many aspects particularly in terms of genome analysis. Sequence extraction is a vital step and many tools have been developed to solve this issue. These tools are available publically but have limitations with reference to the sequence extraction, length of the sequence to be extracted, organism specificity and lack of user friendly interface. We have developed a java based software package having three modules which can be used independently or sequentially. The tool efficiently extracts sequences from large datasets with few simple steps. It can efficiently extract multiple sequences of any desired length from a genome of any organism. The results are crosschecked by published data.

Availability

URL 1: http://ww3.comsats.edu.pk/bio/ResearchProjects.aspxURL 2: http://ww3.comsats.edu.pk/bio/SequenceManeuverer.aspx 相似文献

19.

PATIKAmad: putting microarray data into pathway context

Babur O Colak R Demir E Dogrusoz U 《Proteomics》2008,8(11):2196-2198

High-throughput experiments, most significantly DNA microarrays, provide us with system-scale profiles. Connecting these data with existing biological networks poses a formidable challenge to uncover facts about a cell's proteome. Studies and tools with this purpose are limited to networks with simple structure, such as protein-protein interaction graphs, or do not go much beyond than simply displaying values on the network. We have built a microarray data analysis tool, named PATIKAmad, which can be used to associate microarray data with the pathway models in mechanistic detail, and provides facilities for visualization, clustering, querying, and navigation of biological graphs related with loaded microarray experiments. PATIKAmad is freely available to noncommercial users as a new module of PATIKAweb at http://web.patika.org. 相似文献

20.

HAlign-II: efficient ultra-large multiple sequence alignment and phylogenetic tree reconstruction with distributed and parallel computing

Shixiang Wan Quan Zou 《Algorithms for molecular biology : AMB》2017,12(1):25

Background

Multiple sequence alignment (MSA) plays a key role in biological sequence analyses, especially in phylogenetic tree construction. Extreme increase in next-generation sequencing results in shortage of efficient ultra-large biological sequence alignment approaches for coping with different sequence types.

Methods

Distributed and parallel computing represents a crucial technique for accelerating ultra-large (e.g. files more than 1 GB) sequence analyses. Based on HAlign and Spark distributed computing system, we implement a highly cost-efficient and time-efficient HAlign-II tool to address ultra-large multiple biological sequence alignment and phylogenetic tree construction.

Results

The experiments in the DNA and protein large scale data sets, which are more than 1GB files, showed that HAlign II could save time and space. It outperformed the current software tools. HAlign-II can efficiently carry out MSA and construct phylogenetic trees with ultra-large numbers of biological sequences. HAlign-II shows extremely high memory efficiency and scales well with increases in computing resource.

Conclusions

THAlign-II provides a user-friendly web server based on our distributed computing infrastructure. HAlign-II with open-source codes and datasets was established at http://lab.malab.cn/soft/halign.

相似文献