Paper2sequences: retrieval of sequences listed in a publication |
| |
Authors: | Mersch Henning Fuellen Georg |
| |
Affiliation: | Technische Fakult?t, Universit?t Bielefeld, Bielefeld, Germany. |
| |
Abstract: | Our web-based tool simplifies the often laborious procedure of retrieving a set of biosequences in a publication or webpage. As a front-end to the Bioperl toolkit, it accepts as an input a list of identifiers. They are specified in an ASCII table (copy-pasted from the publication's PDF or HTML page) and give rise to queries in multiple databases for the protein/nucleic acid data specified. Currently, GenBank, PIR (Protein Information Resource) and Swiss-Prot are supported. For any sequence accession code listed, the database can be specified and, if retrieval fails, automatic lookup for the same code in other databases can be requested. Sequence length information (if specified) and heuristic rules are used to drive the lookup if multiple protein coding sequences (CDS) are part of a single accession. Warnings are issued in cases of ambiguities and inconsistencies. An advanced option enables the user to format the output in whatever format they wish. |
| |
Keywords: | |
本文献已被 PubMed 等数据库收录! |
|