Solving the Problem: Genome Annotation Standards before the Data Deluge |
| |
Authors: | Klimke William O'Donovan Claire White Owen Brister J Rodney Clark Karen Fedorov Boris Mizrachi Ilene Pruitt Kim D Tatusova Tatiana |
| |
Institution: | 1The National Center for Biotechnology Information, National Library of Medicine, NIH, Building 45, Bethesda, MD 20894, USA;2UniProt, The EMBL Outstation, The European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK;3Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA |
| |
Abstract: | The promise of genome sequencing was that the vast undiscovered country would be mapped out by comparison of the multitude of sequences available and would aid researchers in deciphering the role of each gene in every organism. Researchers recognize that there is a need for high quality data. However, different annotation procedures, numerous databases, and a diminishing percentage of experimentally determined gene functions have resulted in a spectrum of annotation quality. NCBI in collaboration with sequencing centers, archival databases, and researchers, has developed the first international annotation standards, a fundamental step in ensuring that high quality complete prokaryotic genomes are available as gold standard references. Highlights include the development of annotation assessment tools, community acceptance of protein naming standards, comparison of annotation resources to provide consistent annotation, and improved tracking of the evidence used to generate a particular annotation. The development of a set of minimal standards, including the requirement for annotated complete prokaryotic genomes to contain a full set of ribosomal RNAs, transfer RNAs, and proteins encoding core conserved functions, is an historic milestone. The use of these standards in existing genomes and future submissions will increase the quality of databases, enabling researchers to make accurate biological discoveries. |
| |
Keywords: | |
本文献已被 PubMed 等数据库收录! |
|