Background
Recently, Marcus et al. (Bioinformatics 30:3476–83,
2014) proposed to use a compressed de Bruijn graph to describe the relationship between the genomes of many individuals/strains of the same or closely related species. They devised an (O(nlog g)) time algorithm called splitMEM that constructs this graph directly (i.e., without using the uncompressed de Bruijn graph) based on a suffix tree, where
n is the total length of the genomes and
g is the length of the longest genome. Baier et al. (Bioinformatics 32:497–504,
2016) improved their result.