Efficient computation of absent words in genomic sequences |
| |
Authors: | Julia Herold Stefan Kurtz Robert Giegerich |
| |
Institution: | (1) Center of Biotechnology, Bielefeld University, Postfach 10 01 31, 33501 Bielefeld, Germany;(2) Center for Bioinformatics, University of Hamburg, Bundesstrasse 43, 20146 Hamburg, Germany |
| |
Abstract: | Background Analysis of sequence composition is a routine task in genome research. Organisms are characterized by their base composition,
dinucleotide relative abundance, codon usage, and so on. Unique subsequences are markers of special interest in genome comparison,
expression profiling, and genetic engineering. Relative to a random sequence of the same length, unique subsequences are overrepresented
in real genomes. Shortest words absent from a genome have been addressed in two recent studies. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|