Spectral dictionaries: Integrating de novo peptide sequencing with database search of tandem mass spectra |
| |
Authors: | Kim Sangtae Gupta Nitin Bandeira Nuno Pevzner Pavel A |
| |
Affiliation: | From the ‡Department of Computer Science and Engineering and §Bioinformatics Program, University of California San Diego, La Jolla, California 92093 |
| |
Abstract: | Database search tools identify peptides by matching tandem mass spectra against a protein database. We study an alternative approach when all plausible de novo interpretations of a spectrum (spectral dictionary) are generated and then quickly matched against the database. We present a new MS-Dictionary algorithm for efficiently generating spectral dictionaries and demonstrate that MS-Dictionary can identify spectra that are missed in the database search. We argue that MS-Dictionary enables proteogenomics searches in six-frame translation of genomic sequences that may be prohibitively time-consuming for existing database search approaches. We show that such searches allow one to correct sequencing errors and find programmed frameshifts. |
| |
Keywords: | |
本文献已被 ScienceDirect PubMed 等数据库收录! |